Automatic document-level semantic metadata annotation using folksonomies and domain ontologies

For all research data collected, data descriptions and information about the corresponding variables are essential for data analysis and reuse. To enable cross-study comparisons and analyses, semantic interoperability of metadata is one of the most important requirements. In the area of clinical and epidemiological studies, data collection instruments such as case report forms (CRFs), data dictionaries and questionnaires are critical for metadata collection. Even though data collection instruments are often created in a digital form, they are mostly not machine readable; i.e., they are not semantically coded. As a result, the comparison between data collection instruments is complex. The German project NFDI4Health is dedicated to the development of national research data infrastructure for personal health data, and as such searches for ways to enhance semantic interoperability. Retrospective integration of semantic codes into study metadata is important, as ongoing or completed studies contain valuable information. However, this is labor intensive and should be eased by software. To understand the market and find out what techniques and technologies support retrospective semantic annotation/enrichment of metadata, we conducted a literature review. In NFDI4Health, we identified basic requirements for semantic metadata annotation software in the biomedical field and in the context of the FAIR principles. Ten relevant software systems were summarized and aligned with those requirements. We concluded that despite active research on semantic annotation systems, no system meets all requirements. Consequently, further research and software development in this area is needed, as interoperability of data dictionaries, questionnaires and data collection tools is key to reusing and combining results from independent research studies.

Download Full-text

FolksAnnotation: A Semantic Metadata Tool for Annotating Learning Resources Using Folksonomies and Domain Ontologies

2006 Innovations in Information Technology ◽

10.1109/innovations.2006.301927 ◽

2006 ◽

Cited By ~ 6

Author(s):

Hend Al-khalifa ◽

Hugh Davis

Keyword(s):

Learning Resources ◽

Semantic Metadata ◽

Domain Ontologies

Download Full-text

KeyPhrase Extraction Tool (KET) for Semantic Metadata Annotation of Learning Materials

2009 International Conference on Signal Processing Systems ◽

10.1109/icsps.2009.192 ◽

2009 ◽

Cited By ~ 6

Author(s):

Sonal Jain ◽

Jyoti Pareek

Keyword(s):

Keyphrase Extraction ◽

Learning Materials ◽

Semantic Metadata ◽

Metadata Annotation

Download Full-text

Developing a K-CDA Implementation Guide for Applying Health Information Exchange Service in South Korea (Preprint)

10.2196/preprints.25485 ◽

2020 ◽

Author(s):

Sung Won Jung ◽

Sungchul Bae ◽

Donghyeong Seong ◽

Byoung-Kee Yi

Keyword(s):

Information Exchange ◽

Health Information Exchange ◽

Expert Committee ◽

Healthcare Information ◽

Entry Level ◽

Template Library ◽

Value Sets ◽

Definition Of ◽

Document Level ◽

Section Level

BACKGROUND Through several years of the healthcare information exchange based on the HIE project, some problems were found in the CDA documents generated. OBJECTIVE To fix some problems, we developed the K-CDA Implementation Guide (K means S. Korea) that conforms to the HL7 CDA, and suits the domestic conditions regarding the healthcare information. METHODS We achieved by analyzing HIE guideline and the U.S. C-CDA, and comparing each item. The items that required further discussion were reviewed by the expert committee. Based on the reviews, the previously developed templates were revised. RESULTS A total of 35 CDA templates were developed: five document-level templates, fourteen section-level templates, and sixteen entry-level templates. The 28 value sets used in the templates have been improved and the OIDs for HIE have been redefined CONCLUSIONS The K-CDA IG allows management in the form of a template library based on the definition of the General K-Header and the structured templates. This enables the K-CDA IG to respond to the expansion of national HIE templates with flexibility. For the K-CDA IG, the CDA template in current use was incorporated to the greatest extent possible, to minimize the scope of modifications. It enables the national HIE and the HIE with countries abroad.

Download Full-text

Document Level Emotion Detection from Bangla Text Using Machine Learning Techniques

2021 International Conference on Information and Communication Technology for Sustainable Development (ICICT4SD) ◽

10.1109/icict4sd50815.2021.9397036 ◽

2021 ◽

Author(s):

Sadia Afrin Purba ◽

Sadia Tasnim ◽

Mobasshira Jabin ◽

Tahmim Hossen ◽

Md. Khairul Hasan

Keyword(s):

Machine Learning ◽

Machine Learning Techniques ◽

Emotion Detection ◽

Learning Techniques ◽

Document Level

Download Full-text

Incorporating Multi-Type External Information for Document-Level Sentiment Classification

2020 International Conference on Asian Language Processing (IALP) ◽

10.1109/ialp51396.2020.9310480 ◽

2020 ◽

Author(s):

Pengyuan Liu ◽

Chenghao Zhu

Keyword(s):

Sentiment Classification ◽

External Information ◽

Document Level

Download Full-text

A Survey on Document-level Neural Machine Translation

ACM Computing Surveys ◽

10.1145/3441691 ◽

2021 ◽

Vol 54 (2) ◽

pp. 1-36

Author(s):

Sameen Maruf ◽

Fahimeh Saleh ◽

Gholamreza Haffari

Keyword(s):

Machine Translation ◽

Language Processing ◽

Research Field ◽

Translation Process ◽

Future Directions ◽

Translation Quality ◽

Current State ◽

Evaluation Strategies ◽

Almost All ◽

Document Level

Machine translation (MT) is an important task in natural language processing (NLP), as it automates the translation process and reduces the reliance on human translators. With the resurgence of neural networks, the translation quality surpasses that of the translations obtained using statistical techniques for most language-pairs. Up until a few years ago, almost all of the neural translation models translated sentences independently , without incorporating the wider document-context and inter-dependencies among the sentences. The aim of this survey article is to highlight the major works that have been undertaken in the space of document-level machine translation after the neural revolution, so researchers can recognize the current state and future directions of this field. We provide an organization of the literature based on novelties in modelling and architectures as well as training and decoding strategies. In addition, we cover evaluation strategies that have been introduced to account for the improvements in document MT, including automatic metrics and discourse-targeted test sets. We conclude by presenting possible avenues for future exploration in this research field.

Download Full-text

An End-to-End Approach for Document-level Event Factuality Identification in Chinese

2020 International Conference on Asian Language Processing (IALP) ◽

10.1109/ialp51396.2020.9310484 ◽

2020 ◽

Author(s):

Xiaojia Li ◽

Yun Zhang ◽

Zhong Qian

Keyword(s):

End To End ◽

Document Level ◽

Event Factuality

Download Full-text

Sentiment Analysis of Comparative Sentences for Chinese Document

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.157-158.1079 ◽

2012 ◽

Vol 157-158 ◽

pp. 1079-1082

Author(s):

Guo Shi Wu ◽

Xiao Yin Wu ◽

Jing Jing Wei

Keyword(s):

Sentiment Analysis ◽

Opinion Mining ◽

Sentiment Classification ◽

The Third ◽

Sentence Patterns ◽

Simple Sentences ◽

Document Level ◽

Level Analysis

One of the most widely-studied sub-problems of opinion mining is sentiment classification, which includes three study levels: word, sentence and document. At the third level, most of the existing methods ignore comparative sentences which have particular sentence patterns and may lower the precision of the document-level analysis. This paper studies sentiment analysis of comparative sentences. The aim is to determine whether opinions expressed in a comparative sentence are positive or negative. Experiments of comparing with document-level sentiment analysis based on simple sentences shows the effectiveness of the proposed method.

Download Full-text