TEI

Proceedings of the Symposium on Markup Vocabulary Ecosystems ◽

10.4242/balisagevol22.bauman01 ◽

2018 ◽

Author(s):

Syd Bauman

Keyword(s):

Social Sciences ◽

Community Of Practice ◽

Digital Humanities ◽

Mailing List ◽

Technical Standard ◽

Text Encoding ◽

Academic Field ◽

Conference Series ◽

Text Encoding Initiative ◽

Machine Readable

TEI, the Text Encoding Initiative, was founded in 1987 to develop guidelines for encoding machine-readable texts of interest to the humanities and social sciences. The Text Encoding Initiative (TEI) is a text-centric community of practice in the academic field of digital humanities, operating continuously since the 1980s. The community currently runs a mailing list, meetings and conference series, and maintains an eponymous technical standard, a journal, a wiki, a GitHub repository and a toolchain. The TEI Guidelines, which collectively define an XML format, are the defining output of the community of practice. The format differs from other well-known open formats for text (such as HTML and OpenDocument) in that it’s primarily semantic rather than presentational.

Download Full-text

One Document Does-it-all (ODD): a language for documentation, schema generation, and customization from the Text Encoding Initiative

Proceedings of the Symposium on Markup Vocabulary Customization ◽

10.4242/balisagevol24.viglianti01 ◽

2019 ◽

Cited By ~ 2

Author(s):

Raffaele Viglianti

Keyword(s):

Social Sciences ◽

Community Of Practice ◽

Annual Conference ◽

Technical Standard ◽

Online Journal ◽

Text Encoding ◽

Academic Field ◽

Text Encoding Initiative ◽

Mailing Lists ◽

Machine Readable

TEI, the Text Encoding Initiative, was founded in 1987 to develop guidelines for encoding machine-readable texts of interest to the humanities and social sciences. The TEI is a text-centric community of practice in the academic field of digital humanities, operating continuously since the 1980s. The community currently runs several mailing lists, holds an annual conference, and maintains an eponymous technical standard, an online journal, a wiki, a GitHub repository, and a toolchain. The TEI Guidelines, which collectively define an XML format, are the defining output of the community of practice. The format differs from other well-known open formats for text (such as HTML and OpenDocument) in that it’s main mission is for encoding “extant” texts such that they are amenable to scholarly processing. After a brief introduction to the TEI, we will discuss the mechanisms built in to the TEI for customization.

Download Full-text

Popol Wujs: Culture, Complexity, and the Encoding of Maya Cosmovisión

Ethnohistory ◽

10.1215/00141801-9157219 ◽

2021 ◽

Vol 68 (4) ◽

pp. 493-518

Author(s):

Rafael C. Alvarado ◽

Aldo Ismael Barriente ◽

Allison Margaret Bigelow

Keyword(s):

Critical Edition ◽

Digital Humanities ◽

Ways Of Knowing ◽

Narrative Structure ◽

Text Encoding ◽

Literary Works ◽

Text Encoding Initiative ◽

World Languages ◽

Trace Back ◽

Collaborative Efforts

Abstract The Popol Wuj is one of the most important, commonly studied, and widely circulated Indigenous literary works from colonial Mesoamerica. By some accounts, there are 1,200 editions of the work published in thirty world languages, all of which trace back to a single manuscript—itself a copy of an earlier Mayan work. To protect their work from being destroyed by colonial officials or Inquisitional authorities, the original K’iche’ authors of the Popol Wuj had to embed their ways of knowing in a language and narrative structure that could not be detected by Spanish readers. Each edition of the Popol Wuj therefore helps to uncover different elements of the cosmovisión that is embedded in the text. This article draws from recent collaborative efforts to prepare a digital critical edition of the Popol Wuj based on the editorial standards and scholarly conventions of the Text Encoding Initiative (TEI). By comparing and contrasting the advantages and drawbacks of this edition relative to printed works and digital editions, we suggest how methods from the digital humanities can shed new light on texts like the Popol Wuj.

Download Full-text

The Arabic Diatessaron Project: Digitalizing, Encoding, Lemmatization

Journal of Religion Media and Digital Culture ◽

10.1163/21659214-90000075 ◽

2016 ◽

Vol 5 (1) ◽

pp. 205-277 ◽

Cited By ~ 2

Author(s):

Giuliano Lancioni ◽

N. Peter Joosse

Keyword(s):

International Research ◽

Digital Humanities ◽

Research Project ◽

Scholarly Research ◽

Text Encoding ◽

Automatic Translation ◽

Text Encoding Initiative ◽

Final Goal ◽

Standard Text ◽

International Research Project

The Arabic Diatessaron Project (henceforth ADP) is an international research project in Digital Humanities that aims to collect, digitalise and encode all known manuscripts of the Arabic Diatessaron (henceforth AD), a text that has been relatively neglected in scholarly research. ADP’s final goal is to provide a number of tools that can enable scholars to effectively query, compare and investigate all known variants of the text that will be encoded as far as possible in compliance with the Text Encoding Initiative (TEI) guidelines. The paper addresses a number of issues involved in the process of digitalising manuscripts included in the two existing editions (Ciasca 1888 and Marmardji 1935), adding variants in unedited manuscripts, encoding and lemmatising the text. Issues involved in the design of the ADP include presentation of variants, choice of the standard text, applicability of TEI guidelines, automatic translation between different encodings, cross-edition concordances and principles of lemmatisation.

Download Full-text

The Rise of Professors of Economics and Business Studies in Switzerland

European Journal of Sociology ◽

10.1017/s000397561700011x ◽

2017 ◽

Vol 58 (2) ◽

pp. 295-326 ◽

Cited By ~ 3

Author(s):

Thierry Rossier ◽

Felix Bühlmann ◽

André Mach

Keyword(s):

Social Sciences ◽

20Th Century ◽

Trade Off ◽

Academic Field ◽

Business Studies ◽

Power Resources ◽

Institutional Capital ◽

Scientific Reputation ◽

The University ◽

Academic Positions

AbstractThis paper studies the rise of professors of economics and business studies in the second half of the 20th century in Switzerland. It focuses on three types of power resources: positions in the university hierarchy, scientific reputation and extra-academic positions in the economic and political spheres. Based on a biographical database of N = 487 professors, it examines how these resources developed from 1957 to 2000. We find that professors of economic sciences were increasingly and simultaneously successful on all three studied dimensions – especially when compared to disciplines such as law, social sciences or humanities. This evolution seems to challenge the notorious trade-off between scientific and society poles of the academic field: professors of economics and business increased their scientific reputation while becoming more powerful in worldly positions. However, zooming in on their individual endowment with capital, we see that the same professors rarely hold simultaneously a significant amount of scientific and institutional capital.

Download Full-text

Proposing a TEI-Encoding Project for the Wesley Works

ATLA Summary of Proceedings ◽

10.31046/proceedings.2020.1850 ◽

2020 ◽

pp. 232-238

Author(s):

Michelle Taylor ◽

Andrew Keck

Keyword(s):

Use Value ◽

John Wesley ◽

Text Encoding ◽

Text Encoding Initiative ◽

The Creation

The Text Encoding Initiative (TEI), a branch of XML, is a mature standard for encoding texts that was developed three decades ago and continues to be improved and expanded upon today. Learn about how TEI was centrally imagined for a project devoted to a corpus of John Wesley material. We will begin by explaining why we chose to use TEI for the project and reviewing the considerations inherent in transitioning from a longstanding print-based project to a digital project, including the challenges of converting thousands of pages of text across different file types into rudimentary TEI. Next, we will move into topics specific to TEI encoding practices, including the creation of XML tagsets designed to maximize the use value of the Wesley Works for its various audiences: scholars, librarians, and clergy. Finally, we will show the TEI in action by sharing an example of an XML file from our first round of encoding.

Download Full-text

Bringing the doctoral thesis by published papers to the Social Sciences and the Humanities: a quantitative easing?

10.31235/osf.io/25j6s ◽

2019 ◽

Author(s):

John Rigby ◽

Barbara Jones

Keyword(s):

Social Sciences ◽

Higher Education ◽

Doctoral Degree ◽

Education Sector ◽

Academic Field ◽

Doctoral Thesis ◽

Social Sciences And Humanities ◽

Scientific Disciplines ◽

The Social ◽

The Uk

This paper reflects on alternatives to the traditional form of doctoral thesis which are emerging to reflect a new approach to the valuation and designation of scientific outputs. We examine the changes and consider some implications. We suggest that the adoption of co-citation as underpinning principle for the measurement of knowledge structures has led to re-designation of the value of knowledge and knowledge producers in increasingly quantitative terms. We use notions of ‘institution’ and ‘logic’ to better understand such a change and its implications. Under a new logic that is gradually embedding itself across the higher education sector, the ‘constitutive rules’ concerned with the value of research now prioritize quantification, and tangibility of output, and quality is increasingly equated with citation. Whilst the scientific disciplines have traditionally been closer to this model, albeit with significant national variations, subjects within the Social Sciences and Humanities are now being affected. We present evidence from a small study of the UK higher education sector of university regulation of doctoral degree submission format in two disciplines in the Humanities and Social Sciences (History and Sociology). Our evidence shows the recent and gradual adoption of a practice, previously more common in scientific disciplines, that allows the doctoral thesis to be constituted by a series of publishable papers, known by a variety of names, the most common being ‘Thesis by Published Papers’, ‘Journal Format Thesis’, ‘Alternative Format Thesis’, and ‘Integrated Thesis’. As the thesis of the Social Sciences and Humanities – itself an important institution in the academic field - begins to reflect a greater emphasis upon quantity of knowledge outputs, a tension emerges with the most central of all scientific institutions, the peer-reviewed journal paper.

Download Full-text

Manus OnLine and the Text Encoding Initiative Schema

Journal of the Text Encoding Initiative ◽

10.4000/jtei.1054 ◽

2014 ◽

Cited By ~ 1

Author(s):

Giliola Barbero ◽

Francesca Trasselli

Keyword(s):

Text Encoding ◽

Text Encoding Initiative

Download Full-text

XML Annotation of Hebrew Elements in Judeo-Arabic Texts

Journal of Jewish Languages ◽

10.1163/22134638-06021122 ◽

2018 ◽

Vol 6 (2) ◽

pp. 221-242 ◽

Cited By ~ 1

Author(s):

Mohamed A. H. Ahmed

Keyword(s):

Code Switching ◽

Extensible Markup Language ◽

Markup Language ◽

Text Encoding ◽

Text Encoding Initiative ◽

Extensible Markup

Abstract The main aim of this study is to introduce a model of TEI (Text Encoding Initiative) annotation of Hebrew elements in Judeo-Arabic texts, i.e., code switching (CS), borrowing, and Hebrew quotations. This article will provide an introduction to using XML (Extensible Markup Language) to investigate sociolinguistic aspects in medieval Judeo-Arabic texts. Accordingly, it will suggest to what extent using XML is useful for investigating linguistic and sociolinguistic features in the Judeo-Arabic paradigm. To provide an example for how XML annotation could be applied to Judeo-Arabic texts, a corpus of 300 pages selected from three Judeo-Arabic books has been manually annotated using the TEI P5. The annotation covers all instances of CS, borrowing, and Hebrew quotations in that corpus.

Download Full-text

Digitizing Humanities in South Africa: Computational linguistic resources, training, and community building

Pop! Public. Open. Participatory ◽

10.54590/pop.2020.007 ◽

2020 ◽

Vol 02 ◽

Author(s):

Rooweither Mabuya ◽

Dimakatso Mathe ◽

Mmasibidi Setaka ◽

Menno van Zaanen

Keyword(s):

Social Sciences ◽

South Africa ◽

Computational Linguistic ◽

South African ◽

Digital Humanities ◽

Community Building ◽

Language Resources ◽

Language Technology ◽

Humanities And Social Sciences ◽

Digital Language

South Africa has eleven official languages. However, not all have received similar amounts of attention. In particular, for many of the languages, only a limited number of digital language resources (data sets and computational tools) exist. This scarcity hinders (computational) research in the fields of humanities and social sciences for these languages. Additionally, using existing computational linguistics tools in a practical setting requires expert knowledge on the usage of these tools. In South Africa, only a small number of people currently have this expertise, further limiting the type of research that relies on computational linguistic tools. The South African Centre for Digital Language Resources (SADiLaR) aims to enable and enhance research in the area of language technology by focusing on the development, management, and distribution of digital language resources for all South African languages. Additionally, it aims to build research capacity, specifically in the field of digital humanities. This requires several challenges to be resolved that we cluster under resources, training, and community building. SADiLaR hosts a repository of existing digital language resources and supports the development of new resources. Additionally, it provides training on the use of these resources, specifically for (but not limited to) researchers in the fields of humanities and social sciences. Through this training, SADiLaR tries to build a community of practice to boost information sharing in the area of digital humanities.

Download Full-text

Kaleidoscopic Pedagogy in the Classroom Laboratory

Teaching with Digital Humanities ◽

10.5622/illinois/9780252042232.003.0001 ◽

2018 ◽

pp. 3-23

Author(s):

Ryan Cordell ◽

Benjamin J. Doyle ◽

Elizabeth Hopwood

Keyword(s):

Nineteenth Century ◽

Data Analysis ◽

Pedagogical Practices ◽

Close Reading ◽

Learning Spaces ◽

Text Encoding ◽

Text Encoding Initiative ◽

Classroom Laboratory ◽

Nuanced Understanding ◽

Interpretive Process

Ryan Cordell, Benjamin Doyle, and Elizabeth Hopwood’s essay seizes a nineteenth-century invention, the kaleidoscope, as a model and metaphor for pedagogical practices and learning spaces that encourage play and experimentation. Through examples that involve setting letterpress type, the Text Encoding Initiative (TEI) encoding of nineteenth-century texts as an interpretive process, and the collaborative creation of Wikipedia pages, the authors describe how experiments with contemporary technologies help students claim scholarly agency over the texts and tools central to their study of the nineteenth century. Kaleidoscopic pedagogy encourages students to discover how C19 competencies like close reading and contemporary methods of coding and data analysis have the potential to be mutually constitutive, inspiring a more nuanced understanding of both periods.

Download Full-text