scholarly journals Rapid radiation of the inner Indo-European languages: an advanced approach to Indo-European lexicostatistics

Linguistics ◽  
2021 ◽  
Vol 0 (0) ◽  
Author(s):  
Alexei S. Kassian ◽  
Mikhail Zhivlov ◽  
George Starostin ◽  
Artem A. Trofimov ◽  
Petr A. Kocharov ◽  
...  

Abstract In this article we present a new reconstruction of Indo-European phylogeny based on 13 110-item basic wordlists for protolanguages of IE subgroups (Proto-Germanic, Proto-Slavic, etc.) or ancient languages of the corresponding subgroups (Hittite, Ancient Greek, etc.). We apply reasonably formal techniques of linguistic data collection and post-processing (onomasiological reconstruction, derivational drift elimination, homoplastic optimization) that have been recently proposed or specially developed for the present study. We use sequential phylogenetic workflow and obtain a consensus tree based on several algorithms (Bayesian inference, maximum parsimony, neighbor joining; without topological constraints applied). The resulting tree topology and datings are entirely compatible with established expert views. Our main finding is the multifurcation of the Inner IE clade into four branches ca. 3357–2162 bc: (1) Greek-Armenian, (2) Albanian, (3) Italic-Germanic-Celtic, (4) Balto-Slavic–Indo-Iranian. The proposed radiation scenario may be reconciled with diverse opinions on Inner IE branchings previously expressed by Indo-Europeanists.

2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Tom Struck ◽  
Javed Lindner ◽  
Arne Hollmann ◽  
Floyd Schauer ◽  
Andreas Schmidbauer ◽  
...  

AbstractEstablishing low-error and fast detection methods for qubit readout is crucial for efficient quantum error correction. Here, we test neural networks to classify a collection of single-shot spin detection events, which are the readout signal of our qubit measurements. This readout signal contains a stochastic peak, for which a Bayesian inference filter including Gaussian noise is theoretically optimal. Hence, we benchmark our neural networks trained by various strategies versus this latter algorithm. Training of the network with 106 experimentally recorded single-shot readout traces does not improve the post-processing performance. A network trained by synthetically generated measurement traces performs similar in terms of the detection error and the post-processing speed compared to the Bayesian inference filter. This neural network turns out to be more robust to fluctuations in the signal offset, length and delay as well as in the signal-to-noise ratio. Notably, we find an increase of 7% in the visibility of the Rabi oscillation when we employ a network trained by synthetic readout traces combined with measured signal noise of our setup. Our contribution thus represents an example of the beneficial role which software and hardware implementation of neural networks may play in scalable spin qubit processor architectures.


2016 ◽  
Vol 28 (4) ◽  
pp. 353-364 ◽  
Author(s):  
Peter Lipar ◽  
Irena Strnad ◽  
Martin Česnik ◽  
Tomaž Maher

This paper presents GIS-based methodology for urban area driving cycle construction. The approach reaches beyond the frames of usual driving cycle development methods and takes into account another perspective of data collection. Rather than planning data collection, the approach is based on available in-vehicle measurement data post processing using Geographic Information Systems to manipulate the excessive database and extract only the representative and geographically limited individual trip data. With such data post processing the data was carefully adjusted to include only the data that describe representative driving in Ljubljana urban area. The selected method for the driving cycle development is based on searching for the best microtrips combination while minimizing the difference between two vectors; one based on generated cycle and the other on the database. Accounting for a large random sample of actual trip data, our approach enables more representative area-specific driving cycle development than the previously used techniques.


2017 ◽  
Vol 51 (s38) ◽  
Author(s):  
Alexei S. Kassian

AbstractThis paper deals with the problem of linguistic homoplasy (parallel or backward development), how it can be detected, what kinds of linguistic homoplasy can be distinguished and which varieties of the phenomenon are the most deleterious for the reconstruction of language phylogeny. It is proposed that language phylogeny reconstruction should consist of two main stages. Firstly, a strict consensus tree should be built on the basis of high-quality input data elaborated with the help of the main phylogenetic methods (such as Neighbor-joining, Bayesian MCMC, and Maximum parsimony), and ancestral character states, allowing us to reveal a certain number of homoplastic characters. Secondly, after the detected instances of homoplasy are eliminated from the input matrix, the consensus tree is to be compiled again. It is expected that after homoplastic optimization it will be possible to better resolve individual “problem clades”, and generally the homoplasy-optimized phylogeny should be more robust than the tree constructed initially. The proposed procedure is tested on the 110-item Swadesh wordlists of the Lezgian and Tsezic groups. The Lezgian and Tsezic results generally support theoretical expectations. The MLN (minimal lateral network) method, currently implemented in the LingPy software, is a helpful tool for the detection of linguistic homoplasy.


2014 ◽  
Vol 577 ◽  
pp. 856-859
Author(s):  
Fan Yang ◽  
Long Zhang Liu ◽  
Xing Jia Lu

Radio Frequency Identification (RFID) technologies are used in many applications for data collection. However, raw RFID readings are usually of low quality and may contain many anomalies. The solution should take advantage of the resulting data redundancy for data cleaning. In this paper we propose a Bayesian inference based approach for cleaning RFID raw data. Our approach takes full advantage of data redundancy. To capture the likelihood, we design a 3-state detection model and formally prove this model can maximize the system performance.


2009 ◽  
Vol 32 (1) ◽  
pp. 55-76 ◽  
Author(s):  
Nunzio La Fauci ◽  
Liana Tronci

This paper deals with the complex interaction between form and function in the verb morphosyntax of four Indo-European languages (French, Italian, Ancient Greek and Sanskrit). Beyond the difference in form, auxiliation patterns in French and Italian, and verb inflections in Ancient Greek and Sanskrit correlate, thanks to the agreement for number and person, to the expression of the relationship with the Subject. The different auxiliation patterns (sum and habeo) and the different inflections (middle and active) correlate to different properties of the Subject. In particular, these forms depend on the syntactic opposition between middle and non-middle. The ways of this dependency are regulated and systematic, although they appear fuzzy and chaotic, not only if the four languages are compared to each other, but also if different morphosyntactic combinations, inside the same language, are concerned.


Zootaxa ◽  
2017 ◽  
Vol 4232 (4) ◽  
pp. 523 ◽  
Author(s):  
XIAOHONG ZHANG ◽  
JINFENG HAO ◽  
YU XIA ◽  
YAGE CHANG ◽  
DAOCHUAN ZHANG ◽  
...  

The higher taxa classification and phylogeny of the insect order Orthoptera have long been controversial. Hexamerin, as a member of the highly conserved arthropod hemocyanin superfamily, has been shown to be a good marker for the phylogenetic study of insects. However, few studies have used hexamerins on the phylogeny of Orthoptera. In this study, we determined twenty-seven different hexamerin subunit type sequences in seventeen speices of Orthoptera. In order to infer the phylogenetic relationships among the superfamilies within Orthoptera and test the monophyly of Orthoptera, phylogenic trees were reconstructed using Neighbor-Joining (NJ) and Bayesian inference (BI) methods with two dipluran and three hymenopteran hexamerin sequences as outgroups. The result supported the monophyly of Orthoptera, which includes two monophyletic suborders Caelifera and Ensifera. The Caelifera includes Acridoidea, Eumastacoidea, Tetrigoidea and Tridactyloidea, and the Ensifera includes Tettigonioidea, Grylloidea and Gryllotalpoidea. Our study is basically consistent with the study of morphological classification. In addition, our study indicates that a relatively comprehensive taxa sampling is essential to solve some problems in phylogenetic reconstruction. 


2017 ◽  
pp. 155-161
Author(s):  
Liudmyla Marchuk

According to the genetic classification of borrowings, one of the largest groups of foreign language vocabulary in European languages is formed by lexemes of Latin origin. They are the result of interaction between languages, which are often characterized by a significant degree of genetic and temporal distances. In the article the terminology of Forest industry with the most common terminological elements of Latin and Greek origin is analyzed. Element – is not a word, but a part of the word (prefix, suffix, root), which is being in grammatical relationship with the other elements, forms independent words-terms. Thus, the element “micro” has no complete meaning of the words but it has the specific semantic loading, which is transferred by the concept “small, not big one”. In words-terms, it indicates the correlation of those words and constant objects, phenomena. And the precise scientific definition is received by term in the process of studying the specific scientific discipline. Knowledge of the structure of term’s elements explains the meaning of foreign language term, it helps in better understanding. It is well-known that the terms reflects either one dominant or secondary, or occasional feature, underlying the phenomenon, which reflects, such as color, shape etc. Terminology concept with elements of Greek and Latin origin in the Forest field, can be grouped into major thematic sections. The initial point for placing elements is a substantive concept, followed by an element that expresses this concept, the indicator of the origin of element, its importance in translation. Elements are usually placed according to antonymous meaning (micro – macro). Due to the polysemy some elements get into different sections. The group of terms, in which ancient Greek and Latin term elements are distinguishes is very numerous. Terms, formed with the help of classical elements, reflect the process of adaptation (in bigger or less degree) by modern languages. Thus, the group of terms with the initial parts, the etymons of which are Greek and Latin languages, get into synonymous relations, varying, intersecting or overlapping each other . The doublet terms – are the words or phrases that are combined by special terminological correlation with the same scientific concepts and object of reality. Thus, a large part of Greek and Latin elements in the terms and terminological combinations function as units of scientific style (root words, derivation and other elements) have the ability to influence the linguistic and cultural nature of language, and, thanks to aesthetically complete words, they intellectualize both speech and a speaker, the a specialist of the field.


Slovene ◽  
2017 ◽  
Vol 6 (1) ◽  
pp. 76-94
Author(s):  
Mikhail N. Saenko

According to one of the most well-founded hypotheses, in the Proto-Indo-European language *sem-, meant ‘unus,’ whereas *Hoi̯H- meant ‘solus.’ In this article arguments for and against this hypothesis are examined in detail. In Proto-Slavic the reverse distribution is observed: *samъ, indirectly originating from *sem-, meant ‘solus,’ whereas *edinъ, going back to *Hoi̯H-, meant ‘unus.’ This article is an at tempt to determine how *somHos (> *samъ) ‘idem’ in Proto-Slavic extended its meaning first to ‘ipse’ and then to ‘solus’ and to analyze exactly how it happened. Although for the Indo-European languages the reverse situation is more common (‘ipse’ acquires the meaning ‘idem’), a similar pattern for such a shift in meaning can be found in the history of Ancient Greek αὐτός.


1996 ◽  
Vol 74 (1) ◽  
pp. 125-134 ◽  
Author(s):  
Hongping Liang ◽  
Khidir W. Hilu

The 3′ region of the matK gene from 17 species, representing 13 tribes and six subfamilies of the Poaceae, is used to investigate the potential of the gene in addressing systematic questions in the family. The aligned sequences were analyzed by the Wagner parsimony methods using PAUP and PHYLIP and by the neighbor-joining method. Out of the 583 bp sequenced, 30% were variable and 14.9% were informative. The strict consensus tree, rooted by Joinvillea (Joinvilleaceae), showed well-resolved major clades that represent the grass subfamilies. The bambusoid Phyllostachys appeared as a basal clade in the family. Oryza diverged either before or after the Pooideae in the parsimony and neighbor-joining methods, respectively. The three members of the Pooideae grouped in a monophyletic lineage in both analyses. Arundo (Arundinoideae) was basal to the subfamilies Panicoideae and Chloridoideae. Analysis of the various types of DNA mutations underscores the potential of the matK gene in providing insight into grass systematic and evolution. Keywords: matK, Poaceae, grasses, phylogeny, chloroplast, DNA sequence.


Sign in / Sign up

Export Citation Format

Share Document