The Grammar of Copulas Across Languages

Copular verbs and copular sentences have been for many years a central issue in the theoretical discussions about the nature of (light) verbs and other grammatical categories, the ingredients of predication structures, the properties of nominal categories, agreement, and the interaction between syntax and semantics at the level of clause structure. The current research on copulas has gone beyond the investigation of what kind of objects they are, and has implications for the nature of agreement and other formal processes in syntax and morphology, as well as proposals about the types of structure building operations available in natural languages, the types of features that lexical selection is sensitive to, and the possibility that languages have access to semantically-empty elements required for the satisfaction of purely formal properties. The twelve works included in this volume illustrate the state of the art of these discussions through the analysis of detailed patterns of data from a variety of languages.

2020 ◽  
Vol 17 (29) ◽  
pp. 181-204
Author(s):  
PAULA GUERRA

Neste artigo procuraremos analisar os motivos para a invisibilidade feminina no rockportuguêscomo aspeto central da construção da feminilidade da contemporaneidade portuguesa. Noutro lugar demonstramos a existência de uma consistente dominação masculina no rockportuguês. Parece que as mulheres apenas são recordadas pela lente dos estereótipos dominantes, ou como meras namoradas, acompanhantes e atores sociais submissos em espaço público. Para combater esse esquecimento propomos, primeiro, um estado da arte que cruze género e estudos juvenis, depois uma curta apresentação do estado da participação feminina no rock português, para depois nos centrarmos na questão central do artigo: a história de vida de Xana, vocalista dos RádioMacau. Uma trajetória paradigmática não só de uma músicaportuguesa, mas de toda a construção da feminilidade no mundo das artes e da cultura na história recente de Portugal. Palavras-chave: Portugal.Rock.Dominação Masculina. Género. Xana. Rádio Macau. A PLACE WITH NO PLACE... IN PORTUGUESE ROCK Abstract: In this article we analyze the reasons for female invisibility in Portuguese rock as a central aspect in the construction of the femininity of the Portuguesecontemporaneity. Elsewhere we showed the existence of a consistent male domination in Portuguese rockscene.It seems that the women are barely remembered through the dominant stereotypeslenses, such as mere lovers, companions and submissive social actressesin public space. To combat this invisibility, we propose, first, a state of the art about gender and youth studies, then a brief presentation of the state of female participation in Portuguese rock, and then the central issue of the article: the life historyof Xana, vocalist of Radio Macau.A paradigmatic trajectory not only of Portuguese music, but of the entire construction of femininity in the world of arts and culture in recent Portuguese history. Keywords: Portugal. Rock. Male Domination. Gender. Xana. Radio Macau. UN LUGAR SIN LUGAR ... EN EL ROCK PORTUGUÉS Resumen: En este artículo analizaremos las razones de la invisibilidad femenina en el punk portugués. En otras partes4demostramos la existencia de una profunda misoginia en las letras punk portuguesas.Parece que las mujeres solo son recordadasa través de la lente de los estereotipos dominantes, o como meras novias, chaperonas y actores sociales sumisos en el espacio público. Para combatir este olvido, proponemos, primero, un estado del arte que cruza los estudios de género y juventud, luego una breve presentación del estado de la participación femenina en el rockportugués, y luego nos centramos en el tema central del artículo: la historia de vida de Xana, la vocalista de Rádio Macau.Una trayectoria paradigmática no solo de la música portuguesa, sino de toda la construcción de la feminidad en el mundo de las artes y la cultura en la historia portuguesa reciente.Palabras clave: Portugal. Rock. Dominación Masculina.Gender.Xana.Rádio Macau.


Author(s):  
Sergiy Pogorilyy ◽  
Artem Kramov

The detection of coreferent pairs within a text is one of the basic tasks in the area of natural language processing (NLP). The state‑ of‑ the‑ art methods of coreference resolution are based on machine learning algorithms. The key idea of the methods is to detect certain regularities between the semantic or grammatical features of text entities. In the paper, the comparative analysis of current methods of coreference resolution in English and Ukrainian texts has been performed. The key disadvantage of many methods consists in the interpretation of coreference resolution as a classification problem. The result of coreferent pairs detection is the set of groups in which elements refer to a common entity. Therefore it is advisable to consider the coreference resolution as a clusterization task. The method of coreference resolution using the set of filtering sieves and a convolutional neural network has been suggested. The set of filtering sieves to find candidates for coreferent pairs formation has been implemented. The training process of a multichannel convolutional neural network on a marked Ukrainian corpus has been performed. The usage of a multichannel structure allows analyzing of the different components of text units: semantic, lexical, and grammatical features of words and sentences. Furthermore, it is possible to process input data with unfixed size (words or sentences of a text) using a convolutional layer. The output result of the method is the set of clusters. In order to form clusters, it is necessary to take into account the previous steps of the model’s workflow. Nevertheless, such an approach contradicts the traditional methodology of machine learning. Thus, the training process of the network has been performed using the SEARN algorithm that allows the solving of tasks with unfixed output structures using a classifier model. An experimental examination of the method on the corpus of Ukrainian news has been performed. In order to estimate the accuracy of the method the corresponding common metrics for clusterization tasks have been calculated. The results obtained can indicate that the suggested method can be used to find coreferent pairs within Ukrainian texts. The method can be also easily adapted and applied to other natural languages.


2021 ◽  
Author(s):  
Himanshu Yadav ◽  
Shubham Mittal ◽  
Samar Husain

Dependency length minimization is widely regarded as a cross-linguistic universal capturing syntactic complexity in natural languages. A typical way to operationalize dependency length in corpus-based studies is to count the number of words between syntactically related words. However, such a formulation ignores the syntactic nature of the linguistic material that intervenes the dependency. We present a cross-linguistic corpus study to argue that the number of syntactic heads (rather than words) that intervene a dependency better captures the syntactic complexity across languages. We demonstrate that the dependency length minimization constraint in terms of number of words is a consequence of constraints on intervening heads and tree properties such as arity. The current study shows that when syntactically related words are non-adjacent, increased structure building in the intervening region is avoided.


2020 ◽  
Vol 34 (01) ◽  
pp. 1169-1176
Author(s):  
Huangzhao Zhang ◽  
Zhuo Li ◽  
Ge Li ◽  
Lei Ma ◽  
Yang Liu ◽  
...  

Automated processing, analysis, and generation of source code are among the key activities in software and system lifecycle. To this end, while deep learning (DL) exhibits a certain level of capability in handling these tasks, the current state-of-the-art DL models still suffer from non-robust issues and can be easily fooled by adversarial attacks.Different from adversarial attacks for image, audio, and natural languages, the structured nature of programming languages brings new challenges. In this paper, we propose a Metropolis-Hastings sampling-based identifier renaming technique, named \fullmethod (\method), which generates adversarial examples for DL models specialized for source code processing. Our in-depth evaluation on a functionality classification benchmark demonstrates the effectiveness of \method in generating adversarial examples of source code. The higher robustness and performance enhanced through our adversarial training with \method further confirms the usefulness of DL models-based method for future fully automated source code processing.


2020 ◽  
Vol 34 (05) ◽  
pp. 9693-9700
Author(s):  
Yinhe Zheng ◽  
Rongsheng Zhang ◽  
Minlie Huang ◽  
Xiaoxi Mao

Endowing dialogue systems with personas is essential to deliver more human-like conversations. However, this problem is still far from well explored due to the difficulties of both embodying personalities in natural languages and the persona sparsity issue observed in most dialogue corpora. This paper proposes a pre-training based personalized dialogue model that can generate coherent responses using persona-sparse dialogue data. In this method, a pre-trained language model is used to initialize an encoder and decoder, and personal attribute embeddings are devised to model richer dialogue contexts by encoding speakers' personas together with dialogue histories. Further, to incorporate the target persona in the decoding process and to balance its contribution, an attention routing structure is devised in the decoder to merge features extracted from the target persona and dialogue contexts using dynamically predicted weights. Our model can utilize persona-sparse dialogues in a unified manner during the training process, and can also control the amount of persona-related features to exhibit during the inference process. Both automatic and manual evaluation demonstrates that the proposed model outperforms state-of-the-art methods for generating more coherent and persona consistent responses with persona-sparse data.


2017 ◽  
Vol 21 (1) ◽  
pp. 13-38
Author(s):  
Götz König

The Nask Bayān, once part of the so called Greater (Sasanian) Avesta, but now lost, became a central issue of discussion in the Avestan Studies over the last 15 years. These discussions tried to clarify its relation to the collection of the Yašts (and to the Xorde Avesta) and to reconstruct types of a liturgical celebration of this Bayān Nask or of parts of it. The following article gives an overview on the recent research. It gives also some new suggestions concerning the structure of the collection of the Yašts.


2021 ◽  
Vol 31 ◽  
Author(s):  
MARTIN ELSMAN ◽  
NIELS HALLENBERG

Abstract We present a region-based memory management scheme with support for generational garbage collection. The scheme features a compile-time region inference algorithm, which associates values with logical regions, and builds on a region type system that deploys region types at runtime to avoid the overhead of write barriers and to support partly tag-free garbage collection. The scheme is implemented in the MLKit Standard ML compiler, which generates native x64 machine code. Besides demonstrating a number of important formal properties of the scheme, we measure the scheme’s characteristics, for a number of benchmarks, and compare the performance of the generated executables with the performance of executables generated with the MLton state-of-the-art Standard ML compiler and configurations of the MLKit with and without region inference and generational garbage collection enabled. Although region inference often serves the purpose of generations, combining region inference with generational garbage collection is shown often to be superior to combining region inference with non-generational collection despite the overhead introduced by a larger amount of memory waste, due to region fragmentation.


Author(s):  
Kareem Darwish ◽  
Ahmed Abdelali ◽  
Hamdy Mubarak ◽  
Mohamed Eldesouki

Diacritics (short vowels) are typically omitted when writing Arabic text, and readers have to reintroduce them to correctly pronounce words. There are two types of Arabic diacritics: The first are core-word diacritics (CW), which specify the lexical selection, and the second are case endings (CE), which typically appear at the end of word stems and generally specify their syntactic roles. Recovering CEs is relatively harder than recovering core-word diacritics due to inter-word dependencies, which are often distant. In this article, we use feature-rich recurrent neural network model that use a variety of linguistic and surface-level features to recover both core word diacritics and case endings. Our model surpasses all previous state-of-the-art systems with a CW error rate (CWER) of 2.9% and a CE error rate (CEER) of 3.7% for Modern Standard Arabic (MSA) and CWER of 2.2% and CEER of 2.5% for Classical Arabic (CA). When combining diacritized word cores with case endings, the resultant word error rates are 6.0% and 4.3% for MSA and CA, respectively. This highlights the effectiveness of feature engineering for such deep neural models.


2010 ◽  
Vol 66 (1) ◽  
pp. 1-16 ◽  
Author(s):  
Tom Baikie ◽  
Stevin S. Pramana ◽  
Cristiano Ferraris ◽  
Yizhong Huang ◽  
Emma Kendrick ◽  
...  

Certain complex structures are logically regarded as intergrowths of chemically or topologically discrete modules. When the proportions of these components vary systematically a polysomatic series is created, whose construction provides a basis for understanding defects, symmetry alternation and trends in physical properties. Here, we describe the polysomatic family A 5N B 3N O9N + 6 X Nδ (2 ≤ N ≤ ∞) that is built by condensing N apatite modules (A 5 B 3O18 X δ) in configurations to create B n O3n + 1 (1 ≤ n ≤ ∞) tetrahedral chains. Hydroxyapatite [Ca10(PO4)6(OH)2] typifies a widely studied polysome where N = 2 and the tetrahedra are isolated in A 10(BO4)6 X 2 compounds, but N = 3 A 15(B 2O7)3(BO4)3 X 3 (ganomalite) and N = 4 A 20(B 2O7)6 X 4 (nasonite) are also known, with the X site untenanted or partially occupied as required for charge balance. The apatite modules, while topologically identical, are often compositionally or symmetrically distinct, and an infinite number of polysomes is feasible, generally with the restriction being that an A:B = 5:3 cation ratio be maintained. The end-members are the N = 2 polysome with all tetrahedra separated, and N = ∞, in which the hypothetical compound A 5 B 3O9 X contains infinite, corner-connected tetrahedral strings. The principal characteristics of a polysome are summarized using the nomenclature apatite-(A B X)-NS, where A/B/X are the most abundant species in these sites, N is the number of modules in the crystallographic repeat, and S is the symmetry symbol (usually H, T, M or A). This article examines the state-of-the-art in polysomatic apatite synthesis and crystallochemical design. It also presents X-ray and neutron powder diffraction investigations for several polysome chemical series and examines the prevalence of stacking disorder by electron microscopy. These insights into the structure-building principles of apatite polysomes will guide their development as functional materials.


Sign in / Sign up

Export Citation Format

Share Document