HistorEx: Exploring Historical Text Corpora Using Word and Document Embeddings

Author(s):  
Sven Müller ◽  
Michael Brunzel ◽  
Daniela Kaun ◽  
Russa Biswas ◽  
Maria Koutraki ◽  
...  
Keyword(s):  
Author(s):  
Rhona Alcorn ◽  
Joanna Kopaczyk ◽  
Bettelou Los ◽  
Benjamin Molineaux

This chapter provides an overview of the historical text corpora and digital repositories hosted by the Angus McIntosh Centre for Historical Linguistics and created by its predecessor, the Institute of Historical Dialectology: A Linguistic Atlas of Late Middle English (LALME), and its remodelled electronic version eLALME; A Linguistic Atlas of Early Middle English (LAEME), A Linguistic Atlas of Older Scots (LAOS) and The Corpus of Narrative Etymologies from Proto-Old English to Early Middle English (CoNE). The chapter also highlights related resources created at the University of Stavanger, most prominently the Middle English Scribal Texts programme (MEST), and its offshoot, The Middle English Grammar Corpus (MEG-C), which provides tagged and annotated diplomatic transcriptions of 410 LALME texts; and the Corpus of Middle English Local Documents (MELD) which comprises transcriptions of over 2000 fifteenth-century documents.


2021 ◽  
Author(s):  
Aotao Xu ◽  
Jennifer Ellen Stellar ◽  
Yang Xu

Humans possess the unique ability to communicate emotions through language. Although concepts like anger or awe are abstract, there is a shared consensus about what these English emotion words mean. This consensus may give the impression that their meaning is static, but we propose this is not the case. We cannot travel back to earlier periods to study emotion concepts directly, but we can examine text corpora, which have partially preserved the meaning of emotion words. Using natural language processing of historical text, we found evidence for semantic change in emotion words over the past century and that varying rates of change were predicted in part by an emotion concept's prototypicality - how representative it is of the broader category of "emotion". Prototypicality negatively correlated with historical rates of emotion semantic change obtained from text-based word embeddings, beyond more established variables including usage frequency in English and a second comparison language, French. This effect for prototypicality did not consistently extend to the semantic category of birds, suggesting its relevance for predicting semantic change may be category-dependent. Our results suggest emotion semantics are evolving over time, with prototypical emotion words remaining semantically stable, while other emotion words evolve more freely.


Author(s):  
Oleh Tyshchenko

The presented research reveals imagery-metaphoric and phraseological objectivities of the conceptual spheres Soul, Consciousness, Envy, Jealousy and Greed in Polish, Russian, Ukrainian, Czech and Slovak languages and conceptual picture of the world (first of all in proverbs and sayings, idioms, imagery means of secondary nomination both in standard language and its regional or dialectal variants) according to the indication of holistic characteristic and semantic intersection of these concepts. It describes the spheres of their typological coincidence and differences from the point of imagery motivation. It defines the symbolic functions of these ethno cultural concepts (object sphere) with respect to the specificity of manifestation of Envy in archaic texts, believes, in the language of traditional folk culture and archaic expressions with religious sense that reach Christian ideology, ideas of moral purity and dirt, Body and Soul. It has been defined the collocations with the components envy and jealousy in some thesauri and dictionaries in terms of the specificity of interlingual equivalence and expressions of envy and similar negative emotions and their functioning in the Ukrainian and English text corpora. The analysis demonstrated that practically in all compared languages and linguistic cultures Envy is associated with greed and jealousy, psychic disorders with a corresponding complex of feelings, expressed by metaphoric predicates of destruction and remorse that encode the moral and legal aspect of conscience (conscience is a judge, witness and executioner). Metaphor of Envy containing nominations of colours differ in the Slavonic and Germanic languages whereas those denoting spatial, gustatory, odour, acoustic and parametrical meaning are similar. Many imagery contexts of Envy correlate with such conceptual oppositions as richness and poverty, light and darkness; success is associated with the frames “foreign is better than domestic” where Envy encodes the meaning of encroachment upon another's property, “envy is better than sympathy”, “envy dominates where there are richness, success, welfare, happiness” which confirms the ideas of representatives in the field of psychoanalysis, cultural anthropology and sociology. In some languages the motives of black magic, evil eye (in Polish, Ukrainian and Russian) are rooted in the sphere of folk believes and invocations, as well as cultural anthroponyms.


2020 ◽  
Vol 11 (2) ◽  
pp. 113
Author(s):  
Novarina Novarina ◽  
Mamlahatun Buduroh

This paper is the result of a study of the Nusantara manuscripts using the historical text sources of Madura. The object of this research is the transliteration of a manuscript from the collection of the Central Library of Indonesia entitled Sajarah Proza Begin Brawijaya (SPBB) code SJ.230 Novarina edition (2020). In examining the manuscript, the philological method and literary theory framework were used. From the field of literature, Jan van Luxemburg's structural theory, Julia Kristeva's intertextuality, and Teeuw's concept of literary representation are used. From the structural study, it can be seen that the SPBB text framework is composed of literary structures and content structures (history), which as a whole serve to legitimize the power of the 17-18 century Madurese king. Meanwhile, the results of the intertextual analysis showed that the elements built into the content structure (history) of the SPBB text were connected with M.C. Ricklefs and H.J. De Graaf in representing Cakraningrat as the main figure in the history of Java, Madura, and VOC based on the author's life view to raise one of the values of the Javanese philosophy of life in this text. This linkage results in the conclusion that as a traditional Javanese historical literary work, the SPBB text is representative of its creator's culture, one of which is as a representation of the philosophy of mikul dhuwur mendhem jero in the Javanese view of life.


2017 ◽  
Vol 29 (12) ◽  
pp. 2265
Author(s):  
Yi Zhang ◽  
Yudong Shao ◽  
Jiawan Zhang

Author(s):  
Ekaterina A. LOBANOVA

This article studies the cognitive features of the “power” frame and its gender implementation in the historical tragedy by W. Shakespeare “Macbeth”. Here, the author examines the concepts of “frame” and “gender” in linguistics, studying different approaches to their definition. The relevance of this work is determined by the close attention of the contemporary linguistics to these concepts, as well as their place in the contemporary academic paradigm. The academic affirmation of the “frame” and “gender” concepts designates a new step in understanding the ways and peculiarities of the language interaction, consciousness, and culture, and, consequently, it shows new aspects of the relationship of linguistics with other sciences. Nevertheless, the problems of both frame and gender are not yet fully understood. This study allows describing in detail the essence of the frame “power” and showing its meaning, use, and ways of its gender implementation in fiction, which explains the novelty of this article. The study’s methodology is based on the cognitive-discursive analysis of the text, as well as on an integrative approach to the discourse study, which combines methods of both cognitive and gender linguistics, as well as the discourse analysis. Common research methods were used along with private linguistic methods. The application of cognitive-discursive analysis has significantly increased the depth of understanding of the “power” frame that dominates Shakespeare’s historical tragedy. This historical text presents the central theme of political tragedy: the overthrow of the rightful ruler and the usurpation of power. The motive for the seizure of power forms a thematic core and is presented from the usurpers’ point of view. In this article, the author observes the gender shift and duality of the female and male beginnings: Shakespeare puts the female protagonist, hungry for power, among men, thus the images of Lady Macbeth and her husband come into conflict with the gender characteristics attributed to them. The play clearly traces the main idea of Machiavellianism: the goal justifies the means. The results conclude that the “power” frame is the leading one in Lady Macbeth’s monologue, thus setting one of the main themes of this tragedy.


Author(s):  
Alvin Cheng-Hsien Chen

AbstractIn this study, we aim to demonstrate the effectiveness of network science in exploring the emergence of constructional semantics from the connectedness and relationships between linguistic units. With Mandarin locative constructions (MLCs) as a case study, we extracted constructional tokens from a representative corpus, including their respective space particles (SPs) and the head nouns of the landmarks (LMs), which constitute the nodes of the network. We computed edges based on the lexical similarities of word embeddings learned from large text corpora and the SP-LM contingency from collostructional analysis. We address three issues: (1) For each LM, how prototypical is it of the meaning of the SP? (2) For each SP, how semantically cohesive are its LM exemplars? (3) What are the emerging semantic fields from the constructional network of MLCs? We address these questions by examining the quantitative properties of the network at three levels: microscopic (i.e., node centrality and local clustering coefficient), mesoscopic (i.e., community) and macroscopic properties (i.e., small-worldness and scale-free). Our network analyses bring to the foreground the importance of repeated language experiences in the shaping and entrenchment of linguistic knowledge.


Sign in / Sign up

Export Citation Format

Share Document