scholarly journals Automatic document-level semantic metadata annotation using folksonomies and domain ontologies

2008 ◽  
Vol 2008 (Autumn) ◽  
pp. 1-3 ◽  
Author(s):  
Hend S. Al-Khalifa ◽  
Jessica Rubart
2022 ◽  
Vol 12 (2) ◽  
pp. 796
Author(s):  
Julia Sasse ◽  
Johannes Darms ◽  
Juliane Fluck

For all research data collected, data descriptions and information about the corresponding variables are essential for data analysis and reuse. To enable cross-study comparisons and analyses, semantic interoperability of metadata is one of the most important requirements. In the area of clinical and epidemiological studies, data collection instruments such as case report forms (CRFs), data dictionaries and questionnaires are critical for metadata collection. Even though data collection instruments are often created in a digital form, they are mostly not machine readable; i.e., they are not semantically coded. As a result, the comparison between data collection instruments is complex. The German project NFDI4Health is dedicated to the development of national research data infrastructure for personal health data, and as such searches for ways to enhance semantic interoperability. Retrospective integration of semantic codes into study metadata is important, as ongoing or completed studies contain valuable information. However, this is labor intensive and should be eased by software. To understand the market and find out what techniques and technologies support retrospective semantic annotation/enrichment of metadata, we conducted a literature review. In NFDI4Health, we identified basic requirements for semantic metadata annotation software in the biomedical field and in the context of the FAIR principles. Ten relevant software systems were summarized and aligned with those requirements. We concluded that despite active research on semantic annotation systems, no system meets all requirements. Consequently, further research and software development in this area is needed, as interoperability of data dictionaries, questionnaires and data collection tools is key to reusing and combining results from independent research studies.


2020 ◽  
Author(s):  
Sung Won Jung ◽  
Sungchul Bae ◽  
Donghyeong Seong ◽  
Byoung-Kee Yi

BACKGROUND Through several years of the healthcare information exchange based on the HIE project, some problems were found in the CDA documents generated. OBJECTIVE To fix some problems, we developed the K-CDA Implementation Guide (K means S. Korea) that conforms to the HL7 CDA, and suits the domestic conditions regarding the healthcare information. METHODS We achieved by analyzing HIE guideline and the U.S. C-CDA, and comparing each item. The items that required further discussion were reviewed by the expert committee. Based on the reviews, the previously developed templates were revised. RESULTS A total of 35 CDA templates were developed: five document-level templates, fourteen section-level templates, and sixteen entry-level templates. The 28 value sets used in the templates have been improved and the OIDs for HIE have been redefined CONCLUSIONS The K-CDA IG allows management in the form of a template library based on the definition of the General K-Header and the structured templates. This enables the K-CDA IG to respond to the expansion of national HIE templates with flexibility. For the K-CDA IG, the CDA template in current use was incorporated to the greatest extent possible, to minimize the scope of modifications. It enables the national HIE and the HIE with countries abroad.


2021 ◽  
Vol 54 (2) ◽  
pp. 1-36
Author(s):  
Sameen Maruf ◽  
Fahimeh Saleh ◽  
Gholamreza Haffari

Machine translation (MT) is an important task in natural language processing (NLP), as it automates the translation process and reduces the reliance on human translators. With the resurgence of neural networks, the translation quality surpasses that of the translations obtained using statistical techniques for most language-pairs. Up until a few years ago, almost all of the neural translation models translated sentences independently , without incorporating the wider document-context and inter-dependencies among the sentences. The aim of this survey article is to highlight the major works that have been undertaken in the space of document-level machine translation after the neural revolution, so researchers can recognize the current state and future directions of this field. We provide an organization of the literature based on novelties in modelling and architectures as well as training and decoding strategies. In addition, we cover evaluation strategies that have been introduced to account for the improvements in document MT, including automatic metrics and discourse-targeted test sets. We conclude by presenting possible avenues for future exploration in this research field.


2012 ◽  
Vol 157-158 ◽  
pp. 1079-1082
Author(s):  
Guo Shi Wu ◽  
Xiao Yin Wu ◽  
Jing Jing Wei

One of the most widely-studied sub-problems of opinion mining is sentiment classification, which includes three study levels: word, sentence and document. At the third level, most of the existing methods ignore comparative sentences which have particular sentence patterns and may lower the precision of the document-level analysis. This paper studies sentiment analysis of comparative sentences. The aim is to determine whether opinions expressed in a comparative sentence are positive or negative. Experiments of comparing with document-level sentiment analysis based on simple sentences shows the effectiveness of the proposed method.


Sign in / Sign up

Export Citation Format

Share Document