scholarly journals Token-based typology and word order entropy: A study based on Universal Dependencies

2019 ◽  
Vol 23 (3) ◽  
pp. 533-572 ◽  
Author(s):  
Natalia Levshina

AbstractThe present paper discusses the benefits and challenges of token-based typology, which takes into account the frequencies of words and constructions in language use. This approach makes it possible to introduce new criteria for language classification, which would be difficult or impossible to achieve with the traditional, type-based approach. This point is illustrated by several quantitative studies of word order variation, which can be measured as entropy at different levels of granularity. I argue that this variation can be explained by general functional mechanisms and pressures, which manifest themselves in language use, such as optimization of processing (including avoidance of ambiguity) and grammaticalization of predictable units occurring in chunks. The case studies are based on multilingual corpora, which have been parsed using the Universal Dependencies annotation scheme.

2018 ◽  
Vol 10 (2) ◽  
pp. 269-305 ◽  
Author(s):  
Dirk Pijpops ◽  
Isabeau De Smet ◽  
Freek Van de Velde

Abstract In every-day language use, two or more structurally unrelated constructions may occasionally give rise to strings that look very similar on the surface. As a result of this superficial resemblance, a subset of instances of one of these constructions may deviate in the probabilistic preference for either of several possible formal variants. This effect is called ‘constructional contamination’, and was introduced in Pijpops & Van de Velde (2016). Constructional contamination bears testimony to the hypothesis that language users do not always execute a full parse of the utterances they interpret, but instead often rely on ‘shallow parsing’ and the storage of large, unanalyzed chunks of language in memory, as proposed in Ferreira, Bailey, & Ferraro (2002), Ferreira & Patson (2007), and Dąbrowska (2014). Pijpops & Van de Velde (2016) investigated a single case study in depth, namely the Dutch partitive genitive. This case study is reviewed, and three new case studies are added, namely the competition between long and bare infinitives, word order variation in verbal clusters, and preterite formation. We find evidence of constructional contamination in all case studies, albeit in varying degrees. This indicates that constructional contamination is not a particularity of the Dutch partitive genitive but appears to be more wide-spread, affecting both morphology and syntax. Furthermore, we distinguish between two forms of constructional contamination, viz. first degree and second degree contamination, with first degree contamination producing greater effects than second degree contamination.


Author(s):  
Julia Bacskai-Atkari

This chapter examines word order variation and change in the high CP-domain of Hungarian embedded clauses containing the finite subordinating C head hogy ‘that’. It is argued that the complementizer hogy developed from an operator of the same morphophonological form, meaning ‘how’, and that its grammaticalization path develops in two steps. In addition to the change from an operator, located in a specifier, into a C head (specifier-to-head reanalysis), the fully grammaticalized complementizer hogy also changed its relative position on the CP-periphery, ultimately occupying the higher of two C head positions (upward reanalysis). Other complementizers that could co-occur with hogy in Old Hungarian eventually underwent similar reanalysis processes. Hence the possibility of accommodating two separate C heads in the left periphery was lost and variation in the relative position of complementizers was replaced by a fixed order.


Author(s):  
Svetlana Petrova ◽  
Helmut Weiß

This chapter surveys the word order variation in the right periphery of the clause in OHG. The investigation is based on a corpus including all dependent clauses introduced by the complementizer thaz ‘that’ in the minor OHG documents, a collection of up to forty smaller texts of various genres. The analysis shows that the majority of the data can be explained within a standard OV grammar, assuming additional extraposition of heavy XPs to the right. But apart from these cases, there is evidence supporting the assumption of leftward movement of the verb to an intermediate functional projection vP which is optional with basic OV but obligatory with basic VO. In addition, the chapter presents patterns which evidently involve verb movement to a higher functional head, above vP, and discusses the nature of the landing site of the verb in these cases.


2009 ◽  
Vol 13 (3) ◽  
pp. 365-384 ◽  
Author(s):  
ELENA SEOANE

The aim of this article is to provide an overview of the syntactic, pragmatic and semantic determinants of word-order variation in Modern English, exemplified by the specific case of the use of long passives as order-rearranging devices. Word order in English and in most other SVO languages is affected by a number of factors such as animacy, semantic role, discourse status and syntactic complexity (Sornicola 2006). In this article, which analyses the influence of such factors in the use of long passives, I will try to show that their effects are construction-specific; in particular, that factors which are crucial in determining word order in some constructions – factors such as the animacy of the constituents involved – are entirely overruled by others in the case of Modern English long passives. Corpus data presented here will also serve to address issues pertaining to the nature of the determinants of grammatical variation, such as their independent versus epiphenomenal character, their interactions, and the locus of their effects on word order.


Sign in / Sign up

Export Citation Format

Share Document