The Grammar of Copulas Across Languages

Mapping Intimacies ◽

10.1093/oso/9780198829850.001.0001 ◽

2019 ◽

Keyword(s):

State Of The Art ◽

Central Issue ◽

Lexical Selection ◽

Clause Structure ◽

Natural Languages ◽

Light Verbs ◽

Structure Building ◽

Nominal Categories ◽

Copular Sentences ◽

Formal Properties

Copular verbs and copular sentences have been for many years a central issue in the theoretical discussions about the nature of (light) verbs and other grammatical categories, the ingredients of predication structures, the properties of nominal categories, agreement, and the interaction between syntax and semantics at the level of clause structure. The current research on copulas has gone beyond the investigation of what kind of objects they are, and has implications for the nature of agreement and other formal processes in syntax and morphology, as well as proposals about the types of structure building operations available in natural languages, the types of features that lexical selection is sensitive to, and the possibility that languages have access to semantically-empty elements required for the satisfaction of purely formal properties. The twelve works included in this volume illustrate the state of the art of these discussions through the analysis of detailed patterns of data from a variety of languages.

Download Full-text

Noam Chomsky and George A. Miller. Introduction to the formal analysis of natural languages. Handbook of mathematical psychology, Volume II, edited by R. Duncan Luce, Robert R. Bush, and Eugene Galanter, John Wiley and Sons, Inc., New York and London1963, pp. 269–321. - Noam Chomsky. Formal properties of grammars.Handbook of mathematical psychology, Volume II, edited by R. Duncan Luce, Robert R. Bush, and Eugene Galanter, John Wiley and Sons, Inc., New York and London1963, pp. 323–418. - George A. Miller and Noam Chomsky. Finitary models of language users.Handbook of mathematical psychology, Volume II, edited by R. Duncan Luce, Robert R. Bush, and Eugene Galanter, John Wiley and Sons, Inc., New York and London1963, pp. 419–491.

Journal of Symbolic Logic ◽

10.2307/2269904 ◽

1968 ◽

Vol 33 (2) ◽

pp. 299-300

Author(s):

Joseph S. Ullian

Keyword(s):

New York ◽

Formal Analysis ◽

Mathematical Psychology ◽

Natural Languages ◽

Noam Chomsky ◽

Formal Properties

Download Full-text

UM LUGAR SEM LUGAR... NO ROCK PORTUGUÊS

Outros Tempos – Pesquisa em Foco - História ◽

10.18817/ot.v17i29.757 ◽

2020 ◽

Vol 17 (29) ◽

pp. 181-204

Author(s):

PAULA GUERRA

Keyword(s):

Public Space ◽

State Of The Art ◽

Central Issue ◽

Female Participation ◽

Male Domination ◽

The World ◽

Arts And Culture ◽

Historia De Vida ◽

Las Mujeres ◽

Portuguese Music

Neste artigo procuraremos analisar os motivos para a invisibilidade feminina no rockportuguêscomo aspeto central da construção da feminilidade da contemporaneidade portuguesa. Noutro lugar demonstramos a existência de uma consistente dominação masculina no rockportuguês. Parece que as mulheres apenas são recordadas pela lente dos estereótipos dominantes, ou como meras namoradas, acompanhantes e atores sociais submissos em espaço público. Para combater esse esquecimento propomos, primeiro, um estado da arte que cruze género e estudos juvenis, depois uma curta apresentação do estado da participação feminina no rock português, para depois nos centrarmos na questão central do artigo: a história de vida de Xana, vocalista dos RádioMacau. Uma trajetória paradigmática não só de uma músicaportuguesa, mas de toda a construção da feminilidade no mundo das artes e da cultura na história recente de Portugal. Palavras-chave: Portugal.Rock.Dominação Masculina. Género. Xana. Rádio Macau. A PLACE WITH NO PLACE... IN PORTUGUESE ROCK Abstract: In this article we analyze the reasons for female invisibility in Portuguese rock as a central aspect in the construction of the femininity of the Portuguesecontemporaneity. Elsewhere we showed the existence of a consistent male domination in Portuguese rockscene.It seems that the women are barely remembered through the dominant stereotypeslenses, such as mere lovers, companions and submissive social actressesin public space. To combat this invisibility, we propose, first, a state of the art about gender and youth studies, then a brief presentation of the state of female participation in Portuguese rock, and then the central issue of the article: the life historyof Xana, vocalist of Radio Macau.A paradigmatic trajectory not only of Portuguese music, but of the entire construction of femininity in the world of arts and culture in recent Portuguese history. Keywords: Portugal. Rock. Male Domination. Gender. Xana. Radio Macau. UN LUGAR SIN LUGAR ... EN EL ROCK PORTUGUÉS Resumen: En este artículo analizaremos las razones de la invisibilidad femenina en el punk portugués. En otras partes4demostramos la existencia de una profunda misoginia en las letras punk portuguesas.Parece que las mujeres solo son recordadasa través de la lente de los estereotipos dominantes, o como meras novias, chaperonas y actores sociales sumisos en el espacio público. Para combatir este olvido, proponemos, primero, un estado del arte que cruza los estudios de género y juventud, luego una breve presentación del estado de la participación femenina en el rockportugués, y luego nos centramos en el tema central del artículo: la historia de vida de Xana, la vocalista de Rádio Macau.Una trayectoria paradigmática no solo de la música portuguesa, sino de toda la construcción de la feminidad en el mundo de las artes y la cultura en la historia portuguesa reciente.Palabras clave: Portugal. Rock. Dominación Masculina.Gender.Xana.Rádio Macau.

Download Full-text

COREFERENT PAIRS DETECTION IN UKRAINIAN TEXTS USING A CONVOLUTIONAL NEURAL NETWORK

Visnyk Universytetu “Ukraina” ◽

10.36994/2707-4110-2019-2-23-25 ◽

2019 ◽

Author(s):

Sergiy Pogorilyy ◽

Artem Kramov

Keyword(s):

Neural Network ◽

Machine Learning ◽

Convolutional Neural Network ◽

Language Processing ◽

State Of The Art ◽

Classification Problem ◽

Machine Learning Algorithms ◽

Coreference Resolution ◽

Training Process ◽

Natural Languages

The detection of coreferent pairs within a text is one of the basic tasks in the area of natural language processing (NLP). The state‑ of‑ the‑ art methods of coreference resolution are based on machine learning algorithms. The key idea of the methods is to detect certain regularities between the semantic or grammatical features of text entities. In the paper, the comparative analysis of current methods of coreference resolution in English and Ukrainian texts has been performed. The key disadvantage of many methods consists in the interpretation of coreference resolution as a classification problem. The result of coreferent pairs detection is the set of groups in which elements refer to a common entity. Therefore it is advisable to consider the coreference resolution as a clusterization task. The method of coreference resolution using the set of filtering sieves and a convolutional neural network has been suggested. The set of filtering sieves to find candidates for coreferent pairs formation has been implemented. The training process of a multichannel convolutional neural network on a marked Ukrainian corpus has been performed. The usage of a multichannel structure allows analyzing of the different components of text units: semantic, lexical, and grammatical features of words and sentences. Furthermore, it is possible to process input data with unfixed size (words or sentences of a text) using a convolutional layer. The output result of the method is the set of clusters. In order to form clusters, it is necessary to take into account the previous steps of the model’s workflow. Nevertheless, such an approach contradicts the traditional methodology of machine learning. Thus, the training process of the network has been performed using the SEARN algorithm that allows the solving of tasks with unfixed output structures using a classifier model. An experimental examination of the method on the corpus of Ukrainian news has been performed. In order to estimate the accuracy of the method the corresponding common metrics for clusterization tasks have been calculated. The results obtained can indicate that the suggested method can be used to find coreferent pairs within Ukrainian texts. The method can be also easily adapted and applied to other natural languages.

Download Full-text

A reappraisal of dependency length minimization as a linguistic universal

10.31234/osf.io/95x3g ◽

2021 ◽

Author(s):

Himanshu Yadav ◽

Shubham Mittal ◽

Samar Husain

Keyword(s):

Syntactic Complexity ◽

Natural Languages ◽

Corpus Study ◽

Structure Building ◽

Linguistic Universal ◽

Linguistic Material

Dependency length minimization is widely regarded as a cross-linguistic universal capturing syntactic complexity in natural languages. A typical way to operationalize dependency length in corpus-based studies is to count the number of words between syntactically related words. However, such a formulation ignores the syntactic nature of the linguistic material that intervenes the dependency. We present a cross-linguistic corpus study to argue that the number of syntactic heads (rather than words) that intervene a dependency better captures the syntactic complexity across languages. We demonstrate that the dependency length minimization constraint in terms of number of words is a consequence of constraints on intervening heads and tree properties such as arity. The current study shows that when syntactically related words are non-adjacent, increased structure building in the intervening region is avoided.

Download Full-text

Generating Adversarial Examples for Holding Robustness of Source Code Processing Models

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i01.5469 ◽

2020 ◽

Vol 34 (01) ◽

pp. 1169-1176

Author(s):

Huangzhao Zhang ◽

Zhuo Li ◽

Ge Li ◽

Lei Ma ◽

Yang Liu ◽

...

Keyword(s):

Programming Languages ◽

State Of The Art ◽

Source Code ◽

Natural Languages ◽

Current State ◽

Automated Processing ◽

Adversarial Examples ◽

Adversarial Training ◽

New Challenges ◽

And Performance

Automated processing, analysis, and generation of source code are among the key activities in software and system lifecycle. To this end, while deep learning (DL) exhibits a certain level of capability in handling these tasks, the current state-of-the-art DL models still suffer from non-robust issues and can be easily fooled by adversarial attacks.Different from adversarial attacks for image, audio, and natural languages, the structured nature of programming languages brings new challenges. In this paper, we propose a Metropolis-Hastings sampling-based identifier renaming technique, named \fullmethod (\method), which generates adversarial examples for DL models specialized for source code processing. Our in-depth evaluation on a functionality classification benchmark demonstrates the effectiveness of \method in generating adversarial examples of source code. The higher robustness and performance enhanced through our adversarial training with \method further confirms the usefulness of DL models-based method for future fully automated source code processing.

Download Full-text

A Pre-Training Based Personalized Dialogue Generation Model with Persona-Sparse Data

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i05.6518 ◽

2020 ◽

Vol 34 (05) ◽

pp. 9693-9700

Author(s):

Yinhe Zheng ◽

Rongsheng Zhang ◽

Minlie Huang ◽

Xiaoxi Mao

Keyword(s):

State Of The Art ◽

Language Model ◽

Sparse Data ◽

Generation Model ◽

Dialogue Systems ◽

Inference Process ◽

Training Process ◽

Natural Languages ◽

Dialogue Model ◽

Proposed Model

Endowing dialogue systems with personas is essential to deliver more human-like conversations. However, this problem is still far from well explored due to the difficulties of both embodying personalities in natural languages and the persona sparsity issue observed in most dialogue corpora. This paper proposes a pre-training based personalized dialogue model that can generate coherent responses using persona-sparse dialogue data. In this method, a pre-trained language model is used to initialize an encoder and decoder, and personal attribute embeddings are devised to model richer dialogue contexts by encoding speakers' personas together with dialogue histories. Further, to incorporate the target persona in the decoding process and to balance its contribution, an attention routing structure is devised in the decoder to merge features extracted from the target persona and dialogue contexts using dynamically predicted weights. Our model can utilize persona-sparse dialogues in a unified manner during the training process, and can also control the amount of persona-related features to exhibit during the inference process. Both automatic and manual evaluation demonstrates that the proposed model outperforms state-of-the-art methods for generating more coherent and persona consistent responses with persona-sparse data.

Download Full-text

Bayān Yasn: State of the Art

Iran and the Caucasus ◽

10.1163/1573384x-90000003 ◽

2017 ◽

Vol 21 (1) ◽

pp. 13-38

Author(s):

Götz König

Keyword(s):

State Of The Art ◽

Central Issue

The Nask Bayān, once part of the so called Greater (Sasanian) Avesta, but now lost, became a central issue of discussion in the Avestan Studies over the last 15 years. These discussions tried to clarify its relation to the collection of the Yašts (and to the Xorde Avesta) and to reconstruct types of a liturgical celebration of this Bayān Nask or of parts of it. The following article gives an overview on the recent research. It gives also some new suggestions concerning the structure of the collection of the Yašts.

Download Full-text

Integrating region memory management and tag-free generational garbage collection

Journal of Functional Programming ◽

10.1017/s0956796821000010 ◽

2021 ◽

Vol 31 ◽

Author(s):

MARTIN ELSMAN ◽

NIELS HALLENBERG

Keyword(s):

Memory Management ◽

Garbage Collection ◽

State Of The Art ◽

Type System ◽

Inference Algorithm ◽

Machine Code ◽

Standard Ml ◽

Management Scheme ◽

Time Region ◽

Formal Properties

Abstract We present a region-based memory management scheme with support for generational garbage collection. The scheme features a compile-time region inference algorithm, which associates values with logical regions, and builds on a region type system that deploys region types at runtime to avoid the overhead of write barriers and to support partly tag-free garbage collection. The scheme is implemented in the MLKit Standard ML compiler, which generates native x64 machine code. Besides demonstrating a number of important formal properties of the scheme, we measure the scheme’s characteristics, for a number of benchmarks, and compare the performance of the generated executables with the performance of executables generated with the MLton state-of-the-art Standard ML compiler and configurations of the MLKit with and without region inference and generational garbage collection enabled. Although region inference often serves the purpose of generations, combining region inference with generational garbage collection is shown often to be superior to combining region inference with non-generational collection despite the overhead introduced by a larger amount of memory waste, due to region fragmentation.

Download Full-text

Arabic Diacritic Recovery Using a Feature-rich biLSTM Model

ACM Transactions on Asian and Low-Resource Language Information Processing ◽

10.1145/3434235 ◽

2021 ◽

Vol 20 (2) ◽

pp. 1-18

Author(s):

Kareem Darwish ◽

Ahmed Abdelali ◽

Hamdy Mubarak ◽

Mohamed Eldesouki

Keyword(s):

Error Rate ◽

State Of The Art ◽

Error Rates ◽

Arabic Text ◽

Lexical Selection ◽

Neural Models ◽

Short Vowels ◽

Previous State ◽

Classical Arabic ◽

Modern Standard

Diacritics (short vowels) are typically omitted when writing Arabic text, and readers have to reintroduce them to correctly pronounce words. There are two types of Arabic diacritics: The first are core-word diacritics (CW), which specify the lexical selection, and the second are case endings (CE), which typically appear at the end of word stems and generally specify their syntactic roles. Recovering CEs is relatively harder than recovering core-word diacritics due to inter-word dependencies, which are often distant. In this article, we use feature-rich recurrent neural network model that use a variety of linguistic and surface-level features to recover both core word diacritics and case endings. Our model surpasses all previous state-of-the-art systems with a CW error rate (CWER) of 2.9% and a CE error rate (CEER) of 3.7% for Modern Standard Arabic (MSA) and CWER of 2.2% and CEER of 2.5% for Classical Arabic (CA). When combining diacritized word cores with case endings, the resultant word error rates are 6.0% and 4.3% for MSA and CA, respectively. This highlights the effectiveness of feature engineering for such deep neural models.

Download Full-text

Polysomatic apatites

Acta Crystallographica Section B Structural Science ◽

10.1107/s0108768109053981 ◽

2010 ◽

Vol 66 (1) ◽

pp. 1-16 ◽

Cited By ~ 19

Author(s):

Tom Baikie ◽

Stevin S. Pramana ◽

Cristiano Ferraris ◽

Yizhong Huang ◽

Emma Kendrick ◽

...

Keyword(s):

State Of The Art ◽

Functional Materials ◽

Abundant Species ◽

Complex Structures ◽

Charge Balance ◽

Principal Characteristics ◽

X Ray ◽

Structure Building ◽

End Members ◽

Stacking Disorder

Certain complex structures are logically regarded as intergrowths of chemically or topologically discrete modules. When the proportions of these components vary systematically a polysomatic series is created, whose construction provides a basis for understanding defects, symmetry alternation and trends in physical properties. Here, we describe the polysomatic family A 5N B 3N O9N + 6 X Nδ (2 ≤ N ≤ ∞) that is built by condensing N apatite modules (A 5 B 3O18 X δ) in configurations to create B n O3n + 1 (1 ≤ n ≤ ∞) tetrahedral chains. Hydroxyapatite [Ca10(PO4)6(OH)2] typifies a widely studied polysome where N = 2 and the tetrahedra are isolated in A 10(BO4)6 X 2 compounds, but N = 3 A 15(B 2O7)3(BO4)3 X 3 (ganomalite) and N = 4 A 20(B 2O7)6 X 4 (nasonite) are also known, with the X site untenanted or partially occupied as required for charge balance. The apatite modules, while topologically identical, are often compositionally or symmetrically distinct, and an infinite number of polysomes is feasible, generally with the restriction being that an A:B = 5:3 cation ratio be maintained. The end-members are the N = 2 polysome with all tetrahedra separated, and N = ∞, in which the hypothetical compound A 5 B 3O9 X contains infinite, corner-connected tetrahedral strings. The principal characteristics of a polysome are summarized using the nomenclature apatite-(A B X)-NS, where A/B/X are the most abundant species in these sites, N is the number of modules in the crystallographic repeat, and S is the symmetry symbol (usually H, T, M or A). This article examines the state-of-the-art in polysomatic apatite synthesis and crystallochemical design. It also presents X-ray and neutron powder diffraction investigations for several polysome chemical series and examines the prevalence of stacking disorder by electron microscopy. These insights into the structure-building principles of apatite polysomes will guide their development as functional materials.

Download Full-text