When missing NPs make double center-embedding sentences acceptable

A common view in theoretical syntax and computational linguistics holds that there are no grammatical restrictions on multiple center-embedding of clauses. Syntax would thus be characterized by unbounded recursion. An analysis of 119 genuine multiple clausal center-embeddings from seven ‘Standard Average European’ languages (English, Finnish, French, German, Latin, Swedish, Danish) uncovers usage-based regularities, constraints, that run counter to these and several other widely held views, such as that any type of multiple self-embedding (of the same clause type) would be possible, or that self-embedding would be more complex than multiple center-embedding of different clause types. The maximal degree of center-embedding in written language is three. In spoken language, multiple center-embedding is practically absent. Typical center-embeddings of any degree involve relative clauses specifying the referent of the subject NP of the superordinate clause. Only postmodifying clauses, especially relative clauses and that-clauses acting as noun complements, allow central self-embedding. Double relativization of objects (The rat the cat the dog chased killed ate the malt) does not occur. These corpus-based ‘soft constraints’ suggest that full-blown recursion creating multiple clausal center-embedding is not a central design feature of language in use. Multiple center-embedding emerged with the advent of written language, with Aristotle, Cicero, and Livy in the Greek and Latin stylistic tradition of ‘periodic’ sentence composition.

Download Full-text

Long-distance syntactic dependencies drive the complexity of legal language

10.31234/osf.io/hfbdt ◽

2021 ◽

Author(s):

Eric Martinez ◽

Francis Mollica ◽

Edward Gibson

Keyword(s):

Low Frequency ◽

Long Distance ◽

Passive Voice ◽

Reading Levels ◽

Legal Documents ◽

Embedded Clauses ◽

Legal Concepts ◽

Syntactic Dependencies ◽

Center Embedding ◽

Processing Difficulty

Although contracts and other legal documents have long been known to cause processing difficulty in laypeople, the source and nature of this difficulty has remained unclear. To better understand this mismatch, we conducted a corpus analysis (~10 million words) to investigate to what extent difficult-to-process features that are reportedly common in contracts--such as center embedding, low-frequency jargon, passive voice and non-standard capitalization--are in fact present in contracts relative to normal texts. We found that all of these features were strikingly more prevalent in contracts relative to standard-English texts. We also conducted an experimental study ($n=108$ subjects) to determine to what extent such features cause processing difficulties for laypeople of different reading levels. We found that contractual excerpts containing these features were recalled and comprehended at a lower rate than excerpts without these features, even for experienced readers, and that center-embedded clauses led to greater decreases in recall than other features. These findings confirm long-standing anecdotal accounts of the presence of difficult-to-process features in contracts, and show that these features inhibit comprehension and recall of legal content for readers of all levels. Our findings also suggest such difficulties may largely result from working memory costs imposed by complex syntactic features--such as center-embedded clauses--as opposed to a mere lack of understanding of specialized legal concepts, and that removing these features would be both tractable and beneficial for society at large.

Download Full-text

Reliable Virtual Data Center Embedding Across Multiple Data Centers

Proceedings of the International Conference on Internet of Things and Big Data ◽

10.5220/0005842101950203 ◽

2016 ◽

Cited By ~ 2

Author(s):

Gang Sun ◽

Sitong Bu ◽

Vishal Anand ◽

Victor Chang ◽

Dan Liao

Keyword(s):

Data Center ◽

Data Centers ◽

Multiple Data ◽

Virtual Data Center ◽

Center Embedding ◽

Virtual Data

Download Full-text

Venice: Reliable virtual data center embedding in clouds

IEEE INFOCOM 2014 - IEEE Conference on Computer Communications ◽

10.1109/infocom.2014.6847950 ◽

2014 ◽

Cited By ~ 69

Author(s):

Qi Zhang ◽

Mohamed Faten Zhani ◽

Maissa Jabri ◽

Raouf Boutaba

Keyword(s):

Data Center ◽

Virtual Data Center ◽

Center Embedding ◽

Virtual Data

Download Full-text

SAVE: Energy-aware Virtual Data Center embedding and Traffic Engineering using SDN

Proceedings of the 2015 1st IEEE Conference on Network Softwarization (NetSoft) ◽

10.1109/netsoft.2015.7116142 ◽

2015 ◽

Cited By ~ 1

Author(s):

Yoonseon Han ◽

Jian Li ◽

Jae-Yoon Chung ◽

Jae-Hyoung Yoo ◽

James Won-Ki Hong

Keyword(s):

Data Center ◽

Traffic Engineering ◽

Energy Aware ◽

Virtual Data Center ◽

Save Energy ◽

Center Embedding ◽

Virtual Data

Download Full-text

Virtual Data Center Embedding: A Survey

IEEE Latin America Transactions ◽

10.1109/tla.2015.7112029 ◽

2015 ◽

Vol 13 (5) ◽

pp. 1661-1670 ◽

Cited By ~ 12

Author(s):

Eder Samir Correa ◽

Luis Alejandro Fletscher ◽

Juan Felipe Botero

Keyword(s):

Data Center ◽

Virtual Data Center ◽

Center Embedding ◽

Virtual Data

Download Full-text

Reliability-aware virtual data center embedding

2014 6th International Workshop on Reliable Networks Design and Modeling (RNDM) ◽

10.1109/rndm.2014.7014945 ◽

2014 ◽

Cited By ~ 2

Author(s):

Cheng Zuo ◽

Hongfang Yu

Keyword(s):

Data Center ◽

Virtual Data Center ◽

Center Embedding ◽

Virtual Data

Download Full-text

Infinite use of finite means? Evaluating the generalization of center embedding learned from an artificial grammar

10.31234/osf.io/r8ct2 ◽

2021 ◽

Author(s):

R. Thomas McCoy ◽

Jennifer Culbertson ◽

Paul Smolensky ◽

Géraldine Legendre

Keyword(s):

Finite Number ◽

Language Learning ◽

Language Learners ◽

Building Blocks ◽

Artificial Language ◽

Generative Capacity ◽

Learning Biases ◽

Artificial Language Learning ◽

Training Sequences ◽

Center Embedding

Human language is often assumed to make "infinite use of finite means" - that is, to generate an infinite number of possible utterances from a finite number of building blocks. From an acquisition perspective, this assumed property of language is interesting because learners must acquire their languages from a finite number of examples. To acquire an infinite language, learners must therefore generalize beyond the finite bounds of the linguistic data they have observed. In this work, we use an artificial language learning experiment to investigate whether people generalize in this way. We train participants on sequences from a simple grammar featuring center embedding, where the training sequences have at most two levels of embedding, and then evaluate whether participants accept sequences of a greater depth of embedding. We find that, when participants learn the pattern for sequences of the sizes they have observed, they also extrapolate it to sequences with a greater depth of embedding. These results support the hypothesis that the learning biases of humans favor languages with an infinite generative capacity.

Download Full-text