Improving Semantic Relation Extraction System with Compositional Dependency Unit on Enriched Shortest Dependency Path

Author(s):  
Duy-Cat Can ◽  
Hoang-Quynh Le ◽  
Quang-Thuy Ha
Author(s):  
Wei Shen ◽  
Jianyong Wang ◽  
Ping Luo ◽  
Min Wang

Relation extraction from the Web data has attracted a lot of attention recently. However, little work has been done when it comes to the enterprise data regardless of the urgent needs to such work in real applications (e.g., E-discovery). One distinct characteristic of the enterprise data (in comparison with the Web data) is its low redundancy. Previous work on relation extraction from the Web data largely relies on the data's high redundancy level and thus cannot be applied to the enterprise data effectively. This chapter reviews related work on relation extraction and introduces an unsupervised hybrid framework REACTOR for semantic relation extraction over enterprise data. REACTOR combines a statistical method, classification, and clustering to identify various types of relations among entities appearing in the enterprise data automatically. REACTOR was evaluated over a real-world enterprise data set from HP that contains over three million pages and the experimental results show its effectiveness.


PLoS ONE ◽  
2011 ◽  
Vol 6 (8) ◽  
pp. e23862 ◽  
Author(s):  
Yue Shang ◽  
Yanpeng Li ◽  
Hongfei Lin ◽  
Zhihao Yang

2018 ◽  
Vol 31 (9) ◽  
pp. 4563-4576 ◽  
Author(s):  
Shengli Song ◽  
Yulong Sun ◽  
Qiang Di

2020 ◽  
Vol 23 (3) ◽  
pp. 2043-2077 ◽  
Author(s):  
Yongpan Sheng ◽  
Zenglin Xu ◽  
Yafang Wang ◽  
Gerard de Melo

2020 ◽  
Vol 509 ◽  
pp. 183-192 ◽  
Author(s):  
ZhiQiang Geng ◽  
GuoFei Chen ◽  
YongMing Han ◽  
Gang Lu ◽  
Fang Li

2017 ◽  
Vol 24 (6) ◽  
pp. 1062-1071 ◽  
Author(s):  
Tian Kang ◽  
Shaodian Zhang ◽  
Youlan Tang ◽  
Gregory W Hruby ◽  
Alexander Rusanov ◽  
...  

Abstract Objective To develop an open-source information extraction system called Eligibility Criteria Information Extraction (EliIE) for parsing and formalizing free-text clinical research eligibility criteria (EC) following Observational Medical Outcomes Partnership Common Data Model (OMOP CDM) version 5.0. Materials and Methods EliIE parses EC in 4 steps: (1) clinical entity and attribute recognition, (2) negation detection, (3) relation extraction, and (4) concept normalization and output structuring. Informaticians and domain experts were recruited to design an annotation guideline and generate a training corpus of annotated EC for 230 Alzheimer’s clinical trials, which were represented as queries against the OMOP CDM and included 8008 entities, 3550 attributes, and 3529 relations. A sequence labeling–based method was developed for automatic entity and attribute recognition. Negation detection was supported by NegEx and a set of predefined rules. Relation extraction was achieved by a support vector machine classifier. We further performed terminology-based concept normalization and output structuring. Results In task-specific evaluations, the best F1 score for entity recognition was 0.79, and for relation extraction was 0.89. The accuracy of negation detection was 0.94. The overall accuracy for query formalization was 0.71 in an end-to-end evaluation. Conclusions This study presents EliIE, an OMOP CDM–based information extraction system for automatic structuring and formalization of free-text EC. According to our evaluation, machine learning-based EliIE outperforms existing systems and shows promise to improve.


Sign in / Sign up

Export Citation Format

Share Document