scholarly journals Designing an API-Based Protocol for the Interoperability of Textual Resources

2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Pascal Belouin ◽  
Shih-Pei Chen ◽  
Sean Wang

Designing a protocol for the interoperability of digital textual resources—or, more simply, a “IIIF for texts”—remains a challenge, as such a protocol must cater to their vastly heterogenous formats, structures, languages, text encodings and metadata. There have been many attempts to propose a standard for textual resource interoperability, from the ubiquitous Text Encoding Initiative (TEI) format to more recent proposals like the Distributed Text Services (DTS) protocol. In this paper, we introduce our proposal called SHINE, which prioritizes instead the ease for software developers to represent and exchange textual resources and their associated metadata. We do so by combining a hierarchical model of textual structure with a flexible metadata scheme in SHINE, and we continue to define and develop it based on user-centered and iterative design principles. Therefore, we argue that SHINE is a protocol for textual interoperability that successfully balances flexibility of resource representation, consistency across resource representation, and overall simplicity of implementation.RésuméConcevoir un protocole pour l’interopérabilité des ressources textuelles numériques – c’est-à-dire, un IIIF pour des textes – demeure un défi, puisqu’un tel protocole doit correspondre à leurs formats considérablement hétérogènes, ainsi qu’à leurs structures, langues, encodages textuels et métadonnées. Il existe déjà plusieurs tentatives de proposer des standards pour l’interopérabilité des ressources textuelles, tel que l’ubiquiste Text Encoding Initiative (TEI – Initiative d’encodage textuel) ou des propositions plus récentes comme le protocole de Distributed Text Services (DTS – Services de texte distribuées). Dans cet article, nous présenterons une proposition que nous appelons SHINE, qui priorise la facilité de la représentation et de l’échange des ressources textuelles et des métadonnées associées pour les développeurs de logiciel. Nous le ferons en combinant un modèle de structure textuelle hiérarchique avec un schéma de métadonnées flexible dans SHINE et nous le définirons et le développerons selon des principes axés sur l’utilisateur et selon des principes de conceptions itératifs. Par conséquent, nous avançons que SHINE est un protocole pour l’interopérabilité textuelle qui équilibre systématiquement la flexibilité de la représentation de ressources, ainsi que la simplicité globale de l’implémentation, pour toute représentation de ressources.Mots-clés: format d’échange; modélisation de documents; métadonnées; infrastructure numériques; interopérabilité

2019 ◽  
Author(s):  
Pascal Belouin ◽  
Sean H. Wang

Designing a protocol for the interoperability of digital textual resources—or, more simply, a “IIIF for texts”—remains a challenge, as such a protocol must cater to their vastly heterogenous formats, structures, languages, text encodings and metadata. There have been many attempts to propose a standard for textual resource interoperability, from the ubiquitous Text Encoding Initiative (TEI) format to more recent proposals like the Distributed Text Services (DTS) protocol. In this paper, we critically survey these attempts and introduce our proposal called SHINE, which aims to escape from TEI’s legacy and prioritize instead the ease for software developers to representation and exchange textual resources and their associated metadata. We do so by combining a hierarchical model of textual structure with a flexible metadata scheme in SHINE, and we continue to define and develop it based on user-centered and iterative design principles. Therefore, we argue that SHINE is a protocol for textual interoperability that successfully balances flexibility of resource representation, consistency across resource representation, and overall simplicity of implementation.


2021 ◽  
Author(s):  
Jeonghwan Hwang ◽  
Taeheon Lee ◽  
Honggu Lee ◽  
Seonjeong Byun

BACKGROUND Despite the unprecedented performances of deep learning algorithms in clinical domains, full reviews of algorithmic predictions by human experts remain mandatory. Under these circumstances, artificial intelligence (AI) models are primarily designed as clinical decision support systems (CDSSs). However, from the perspective of clinical practitioners, the lack of clinical interpretability and user-centered interfaces block the adoption of these AI systems in practice. OBJECTIVE The aim of this study was to develop an AI-based CDSS for assisting polysomnographic technicians in reviewing AI-predicted sleep staging results. This study proposed and evaluated a CDSS that provides clinically sound explanations for AI predictions in a user-centered fashion. METHODS User needs for the system were identified during interviews with polysomnographic technicians. User observation sessions were conducted to understand the workflow of the practitioners during sleep scoring. Iterative design process was performed to ensure easy integration of the tool into clinical workflows. Then, we evaluated the system with polysomnographic technicians. We measured the improvements in sleep staging accuracies after adopting our tool and assessed qualitatively how the participants perceived and used the tool. RESULTS The user study revealed that technicians desire explanations relevant to key electroencephalogram (EEG) patterns for sleep staging when assessing the correctness of the AI predictions. Here, technicians could evaluate whether AI models properly locate and use those patterns during prediction. Based on this, information in AI models that is closely related to sleep EEG patterns was formulated and visualized during the iterative design process. Furthermore, we developed a different visualization strategy for each pattern based on the way the technicians interpreted the EEG recordings with these patterns during their workflows. Generally, the tool evaluation results from the nine polysomnographic technicians were positive. Quantitatively, technicians achieved better classification performances after reviewing the AI-generated predictions with the proposed system; classification accuracies measured with Macro-F1 scores improved from 60.20 to 62.71. Qualitatively, participants reported that the provided information from the tool effectively supported them, and they were able to develop notable adoption strategies for the tool. CONCLUSIONS Our findings indicate that formulating clinical explanations for automated predictions using the information in the AI with a user-centered design process is an effective strategy for developing a CDSS for sleep staging.


2020 ◽  
pp. 232-238
Author(s):  
Michelle Taylor ◽  
Andrew Keck

The Text Encoding Initiative (TEI), a branch of XML, is a mature standard for encoding texts that was developed three decades ago and continues to be improved and expanded upon today. Learn about how TEI was centrally imagined for a project devoted to a corpus of John Wesley material. We will begin by explaining why we chose to use TEI for the project and reviewing the considerations inherent in transitioning from a longstanding print-based project to a digital project, including the challenges of converting thousands of pages of text across different file types into rudimentary TEI. Next, we will move into topics specific to TEI encoding practices, including the creation of XML tagsets designed to maximize the use value of the Wesley Works for its various audiences: scholars, librarians, and clergy. Finally, we will show the TEI in action by sharing an example of an XML file from our first round of encoding.


2018 ◽  
Vol 6 (2) ◽  
pp. 221-242 ◽  
Author(s):  
Mohamed A. H. Ahmed

Abstract The main aim of this study is to introduce a model of TEI (Text Encoding Initiative) annotation of Hebrew elements in Judeo-Arabic texts, i.e., code switching (CS), borrowing, and Hebrew quotations. This article will provide an introduction to using XML (Extensible Markup Language) to investigate sociolinguistic aspects in medieval Judeo-Arabic texts. Accordingly, it will suggest to what extent using XML is useful for investigating linguistic and sociolinguistic features in the Judeo-Arabic paradigm. To provide an example for how XML annotation could be applied to Judeo-Arabic texts, a corpus of 300 pages selected from three Judeo-Arabic books has been manually annotated using the TEI P5. The annotation covers all instances of CS, borrowing, and Hebrew quotations in that corpus.


Author(s):  
Ryan Cordell ◽  
Benjamin J. Doyle ◽  
Elizabeth Hopwood

Ryan Cordell, Benjamin Doyle, and Elizabeth Hopwood’s essay seizes a nineteenth-century invention, the kaleidoscope, as a model and metaphor for pedagogical practices and learning spaces that encourage play and experimentation. Through examples that involve setting letterpress type, the Text Encoding Initiative (TEI) encoding of nineteenth-century texts as an interpretive process, and the collaborative creation of Wikipedia pages, the authors describe how experiments with contemporary technologies help students claim scholarly agency over the texts and tools central to their study of the nineteenth century. Kaleidoscopic pedagogy encourages students to discover how C19 competencies like close reading and contemporary methods of coding and data analysis have the potential to be mutually constitutive, inspiring a more nuanced understanding of both periods.


2020 ◽  
Vol 17 (6) ◽  
pp. 703-711
Author(s):  
Anita Walden ◽  
Lynsi Garvin ◽  
Michelle Smerek ◽  
Constance Johnson

Background: Increasing and sustaining the engagement of participants in clinical research studies is a goal for clinical investigators, especially for studies that require long-term or frequent involvement of participants. Technology can be used to reduce barriers to participation by providing multiple options for clinical data entry and form submission. However, electronic systems used in clinical research studies should be user-friendly while also ensuring data quality. Directly involving study participants in evaluating the effectiveness and usability of electronic tools may promote wider adoption, maintain involvement, and increase user satisfaction of the technology. While developers of healthcare applications have incorporated user-centered designs, these methods remain uncommon in the design of clinical study tools such as patient-reported outcome surveys or electronic data capture digital health tools. Methods: Our study evaluated whether the clinical research setting may benefit from implementing user-centered design principles. Study participants were recruited to test the web-based form for the Measurement to Understand the Reclassification of Disease of Cabarrus/Kannapolis (MURDOCK) Study Community Translational Population Health Registry and Biorepository that would enable them to complete their study forms electronically. The study enrollment form collects disease history, conditions, smoking status, medications, and other information. The system was initially evaluated by the data management team through traditional user-acceptance testing methods. During the tool evaluation phase, a decision was made to incorporate a small-scale usability study to directly test the system. Results: Results showed that a majority of participants found the system easy to use. Of the eight required tasks, 75% were completed successfully. Of the 72 heuristics violated, language was the most frequent violation. Conclusion: Our study showed that user-centered usability methods can identify important issues and capture information that can enhance the participant’s experience and may improve the quality of study tools.


Author(s):  
Steven M. Belz

Success in the marketplace doesn't happen by accident but through the application of human factors/ergonomics user-centered design principles.


2008 ◽  
Vol 18 (1) ◽  
pp. 103-119
Author(s):  
JANICE CARRUTHERS

ABSTRACTThe objective of this paper is to describe and evaluate the application of the Text Encoding Initiative (TEI) Guidelines to a corpus of oral French, this being the first corpus of oral French where the TEI has been used. The paper explains the purpose of the corpus, both in creating a specialist corpus ofnéo-contagethat will broaden the range of oral corpora available, and, more importantly, in creating a dataset to explore a variety of oral French that has a particularly interesting status in terms of factors such asconception orale/écrite, réalisation médialeandcomportement communicatif(Koch and Oesterreicher 2001). The linguistic phenomena to be encoded are both stylistic (speech and thought presentation) and syntactic (negation, detachment, inversion), and all represent areas where previous research has highlighted the significance of factors such as medium, register and discourse type, as well as a host of linguistic factors (syntactic, phonetic, lexical). After a discussion of how a tagset can be designed and applied within the TEI to encode speech and thought presentation, negation, detachment and inversion, the final section of the paper evaluates the benefits and possible drawbacks of the methodology offered by the TEI when applied to a syntactic and stylistic markup of an oral corpus.


Sign in / Sign up

Export Citation Format

Share Document