TEI

Author(s):  
Syd Bauman

TEI, the Text Encoding Initiative, was founded in 1987 to develop guidelines for encoding machine-readable texts of interest to the humanities and social sciences. The Text Encoding Initiative (TEI) is a text-centric community of practice in the academic field of digital humanities, operating continuously since the 1980s. The community currently runs a mailing list, meetings and conference series, and maintains an eponymous technical standard, a journal, a wiki, a GitHub repository and a toolchain. The TEI Guidelines, which collectively define an XML format, are the defining output of the community of practice. The format differs from other well-known open formats for text (such as HTML and OpenDocument) in that it’s primarily semantic rather than presentational.

Author(s):  
Raffaele Viglianti

TEI, the Text Encoding Initiative, was founded in 1987 to develop guidelines for encoding machine-readable texts of interest to the humanities and social sciences. The TEI is a text-centric community of practice in the academic field of digital humanities, operating continuously since the 1980s. The community currently runs several mailing lists, holds an annual conference, and maintains an eponymous technical standard, an online journal, a wiki, a GitHub repository, and a toolchain. The TEI Guidelines, which collectively define an XML format, are the defining output of the community of practice. The format differs from other well-known open formats for text (such as HTML and OpenDocument) in that it’s main mission is for encoding “extant” texts such that they are amenable to scholarly processing. After a brief introduction to the TEI, we will discuss the mechanisms built in to the TEI for customization.


Ethnohistory ◽  
2021 ◽  
Vol 68 (4) ◽  
pp. 493-518
Author(s):  
Rafael C. Alvarado ◽  
Aldo Ismael Barriente ◽  
Allison Margaret Bigelow

Abstract The Popol Wuj is one of the most important, commonly studied, and widely circulated Indigenous literary works from colonial Mesoamerica. By some accounts, there are 1,200 editions of the work published in thirty world languages, all of which trace back to a single manuscript—itself a copy of an earlier Mayan work. To protect their work from being destroyed by colonial officials or Inquisitional authorities, the original K’iche’ authors of the Popol Wuj had to embed their ways of knowing in a language and narrative structure that could not be detected by Spanish readers. Each edition of the Popol Wuj therefore helps to uncover different elements of the cosmovisión that is embedded in the text. This article draws from recent collaborative efforts to prepare a digital critical edition of the Popol Wuj based on the editorial standards and scholarly conventions of the Text Encoding Initiative (TEI). By comparing and contrasting the advantages and drawbacks of this edition relative to printed works and digital editions, we suggest how methods from the digital humanities can shed new light on texts like the Popol Wuj.


2016 ◽  
Vol 5 (1) ◽  
pp. 205-277 ◽  
Author(s):  
Giuliano Lancioni ◽  
N. Peter Joosse

The Arabic Diatessaron Project (henceforth ADP) is an international research project in Digital Humanities that aims to collect, digitalise and encode all known manuscripts of the Arabic Diatessaron (henceforth AD), a text that has been relatively neglected in scholarly research. ADP’s final goal is to provide a number of tools that can enable scholars to effectively query, compare and investigate all known variants of the text that will be encoded as far as possible in compliance with the Text Encoding Initiative (TEI) guidelines. The paper addresses a number of issues involved in the process of digitalising manuscripts included in the two existing editions (Ciasca 1888 and Marmardji 1935), adding variants in unedited manuscripts, encoding and lemmatising the text. Issues involved in the design of the ADP include presentation of variants, choice of the standard text, applicability of TEI guidelines, automatic translation between different encodings, cross-edition concordances and principles of lemmatisation.


2017 ◽  
Vol 58 (2) ◽  
pp. 295-326 ◽  
Author(s):  
Thierry Rossier ◽  
Felix Bühlmann ◽  
André Mach

AbstractThis paper studies the rise of professors of economics and business studies in the second half of the 20th century in Switzerland. It focuses on three types of power resources: positions in the university hierarchy, scientific reputation and extra-academic positions in the economic and political spheres. Based on a biographical database of N = 487 professors, it examines how these resources developed from 1957 to 2000. We find that professors of economic sciences were increasingly and simultaneously successful on all three studied dimensions – especially when compared to disciplines such as law, social sciences or humanities. This evolution seems to challenge the notorious trade-off between scientific and society poles of the academic field: professors of economics and business increased their scientific reputation while becoming more powerful in worldly positions. However, zooming in on their individual endowment with capital, we see that the same professors rarely hold simultaneously a significant amount of scientific and institutional capital.


2020 ◽  
pp. 232-238
Author(s):  
Michelle Taylor ◽  
Andrew Keck

The Text Encoding Initiative (TEI), a branch of XML, is a mature standard for encoding texts that was developed three decades ago and continues to be improved and expanded upon today. Learn about how TEI was centrally imagined for a project devoted to a corpus of John Wesley material. We will begin by explaining why we chose to use TEI for the project and reviewing the considerations inherent in transitioning from a longstanding print-based project to a digital project, including the challenges of converting thousands of pages of text across different file types into rudimentary TEI. Next, we will move into topics specific to TEI encoding practices, including the creation of XML tagsets designed to maximize the use value of the Wesley Works for its various audiences: scholars, librarians, and clergy. Finally, we will show the TEI in action by sharing an example of an XML file from our first round of encoding.


2019 ◽  
Author(s):  
John Rigby ◽  
Barbara Jones

This paper reflects on alternatives to the traditional form of doctoral thesis which are emerging to reflect a new approach to the valuation and designation of scientific outputs. We examine the changes and consider some implications. We suggest that the adoption of co-citation as underpinning principle for the measurement of knowledge structures has led to re-designation of the value of knowledge and knowledge producers in increasingly quantitative terms. We use notions of ‘institution’ and ‘logic’ to better understand such a change and its implications. Under a new logic that is gradually embedding itself across the higher education sector, the ‘constitutive rules’ concerned with the value of research now prioritize quantification, and tangibility of output, and quality is increasingly equated with citation. Whilst the scientific disciplines have traditionally been closer to this model, albeit with significant national variations, subjects within the Social Sciences and Humanities are now being affected. We present evidence from a small study of the UK higher education sector of university regulation of doctoral degree submission format in two disciplines in the Humanities and Social Sciences (History and Sociology). Our evidence shows the recent and gradual adoption of a practice, previously more common in scientific disciplines, that allows the doctoral thesis to be constituted by a series of publishable papers, known by a variety of names, the most common being ‘Thesis by Published Papers’, ‘Journal Format Thesis’, ‘Alternative Format Thesis’, and ‘Integrated Thesis’. As the thesis of the Social Sciences and Humanities – itself an important institution in the academic field - begins to reflect a greater emphasis upon quantity of knowledge outputs, a tension emerges with the most central of all scientific institutions, the peer-reviewed journal paper.


2018 ◽  
Vol 6 (2) ◽  
pp. 221-242 ◽  
Author(s):  
Mohamed A. H. Ahmed

Abstract The main aim of this study is to introduce a model of TEI (Text Encoding Initiative) annotation of Hebrew elements in Judeo-Arabic texts, i.e., code switching (CS), borrowing, and Hebrew quotations. This article will provide an introduction to using XML (Extensible Markup Language) to investigate sociolinguistic aspects in medieval Judeo-Arabic texts. Accordingly, it will suggest to what extent using XML is useful for investigating linguistic and sociolinguistic features in the Judeo-Arabic paradigm. To provide an example for how XML annotation could be applied to Judeo-Arabic texts, a corpus of 300 pages selected from three Judeo-Arabic books has been manually annotated using the TEI P5. The annotation covers all instances of CS, borrowing, and Hebrew quotations in that corpus.


2020 ◽  
Vol 02 ◽  
Author(s):  
Rooweither Mabuya ◽  
Dimakatso Mathe ◽  
Mmasibidi Setaka ◽  
Menno van Zaanen

South Africa has eleven official languages. However, not all have received similar amounts of attention. In particular, for many of the languages, only a limited number of digital language resources (data sets and computational tools) exist. This scarcity hinders (computational) research in the fields of humanities and social sciences for these languages. Additionally, using existing computational linguistics tools in a practical setting requires expert knowledge on the usage of these tools. In South Africa, only a small number of people currently have this expertise, further limiting the type of research that relies on computational linguistic tools. The South African Centre for Digital Language Resources (SADiLaR) aims to enable and enhance research in the area of language technology by focusing on the development, management, and distribution of digital language resources for all South African languages. Additionally, it aims to build research capacity, specifically in the field of digital humanities. This requires several challenges to be resolved that we cluster under resources, training, and community building. SADiLaR hosts a repository of existing digital language resources and supports the development of new resources. Additionally, it provides training on the use of these resources, specifically for (but not limited to) researchers in the fields of humanities and social sciences. Through this training, SADiLaR tries to build a community of practice to boost information sharing in the area of digital humanities.


Author(s):  
Ryan Cordell ◽  
Benjamin J. Doyle ◽  
Elizabeth Hopwood

Ryan Cordell, Benjamin Doyle, and Elizabeth Hopwood’s essay seizes a nineteenth-century invention, the kaleidoscope, as a model and metaphor for pedagogical practices and learning spaces that encourage play and experimentation. Through examples that involve setting letterpress type, the Text Encoding Initiative (TEI) encoding of nineteenth-century texts as an interpretive process, and the collaborative creation of Wikipedia pages, the authors describe how experiments with contemporary technologies help students claim scholarly agency over the texts and tools central to their study of the nineteenth century. Kaleidoscopic pedagogy encourages students to discover how C19 competencies like close reading and contemporary methods of coding and data analysis have the potential to be mutually constitutive, inspiring a more nuanced understanding of both periods.


Sign in / Sign up

Export Citation Format

Share Document