Improving Semantic Relation Extraction System with Compositional Dependency Unit on Enriched Shortest Dependency Path

Relation extraction from the Web data has attracted a lot of attention recently. However, little work has been done when it comes to the enterprise data regardless of the urgent needs to such work in real applications (e.g., E-discovery). One distinct characteristic of the enterprise data (in comparison with the Web data) is its low redundancy. Previous work on relation extraction from the Web data largely relies on the data's high redundancy level and thus cannot be applied to the enterprise data effectively. This chapter reviews related work on relation extraction and introduces an unsupervised hybrid framework REACTOR for semantic relation extraction over enterprise data. REACTOR combines a statistical method, classification, and clustering to identify various types of relations among entities appearing in the enterprise data automatically. REACTOR was evaluated over a real-world enterprise data set from HP that contains over three million pages and the experimental results show its effectiveness.

Download Full-text

Semantic Relation Extraction from Legislative Text Using Generalized Syntactic Dependencies and Support Vector Machines

Theory, Practice, and Applications of Rules on the Web - Lecture Notes in Computer Science ◽

10.1007/978-3-642-39617-5_20 ◽

2013 ◽

pp. 218-225 ◽

Cited By ~ 12

Author(s):

Guido Boella ◽

Luigi Di Caro ◽

Livio Robaldo

Keyword(s):

Support Vector Machines ◽

Relation Extraction ◽

Semantic Relation ◽

Support Vector ◽

Vector Machines ◽

Syntactic Dependencies

Download Full-text

Enhancing Biomedical Text Summarization Using Semantic Relation Extraction

PLoS ONE ◽

10.1371/journal.pone.0023862 ◽

2011 ◽

Vol 6 (8) ◽

pp. e23862 ◽

Cited By ~ 17

Author(s):

Yue Shang ◽

Yanpeng Li ◽

Hongfei Lin ◽

Zhihao Yang

Keyword(s):

Relation Extraction ◽

Semantic Relation ◽

Text Summarization ◽

Biomedical Text

Download Full-text

Multiple order semantic relation extraction

Neural Computing and Applications ◽

10.1007/s00521-018-3453-x ◽

2018 ◽

Vol 31 (9) ◽

pp. 4563-4576 ◽

Cited By ~ 3

Author(s):

Shengli Song ◽

Yulong Sun ◽

Qiang Di

Keyword(s):

Relation Extraction ◽

Semantic Relation

Download Full-text

Multi-document semantic relation extraction for news analytics

World Wide Web ◽

10.1007/s11280-020-00790-2 ◽

2020 ◽

Vol 23 (3) ◽

pp. 2043-2077 ◽

Cited By ~ 1

Author(s):

Yongpan Sheng ◽

Zenglin Xu ◽

Yafang Wang ◽

Gerard de Melo

Keyword(s):

Relation Extraction ◽

Semantic Relation

Download Full-text

Semantic relation extraction using sequential and tree-structured LSTM with attention

Information Sciences ◽

10.1016/j.ins.2019.09.006 ◽

2020 ◽

Vol 509 ◽

pp. 183-192 ◽

Cited By ~ 17

Author(s):

ZhiQiang Geng ◽

GuoFei Chen ◽

YongMing Han ◽

Gang Lu ◽

Fang Li

Keyword(s):

Relation Extraction ◽

Semantic Relation

Download Full-text

EliIE: An open-source information extraction system for clinical trial eligibility criteria

Journal of the American Medical Informatics Association ◽

10.1093/jamia/ocx019 ◽

2017 ◽

Vol 24 (6) ◽

pp. 1062-1071 ◽

Cited By ~ 20

Author(s):

Tian Kang ◽

Shaodian Zhang ◽

Youlan Tang ◽

Gregory W Hruby ◽

Alexander Rusanov ◽

...

Keyword(s):

Information Extraction ◽

Open Source ◽

Relation Extraction ◽

Extraction System ◽

Free Text ◽

Source Information ◽

Eligibility Criteria ◽

Negation Detection ◽

Attribute Recognition ◽

Information Extraction System

Abstract Objective To develop an open-source information extraction system called Eligibility Criteria Information Extraction (EliIE) for parsing and formalizing free-text clinical research eligibility criteria (EC) following Observational Medical Outcomes Partnership Common Data Model (OMOP CDM) version 5.0. Materials and Methods EliIE parses EC in 4 steps: (1) clinical entity and attribute recognition, (2) negation detection, (3) relation extraction, and (4) concept normalization and output structuring. Informaticians and domain experts were recruited to design an annotation guideline and generate a training corpus of annotated EC for 230 Alzheimer’s clinical trials, which were represented as queries against the OMOP CDM and included 8008 entities, 3550 attributes, and 3529 relations. A sequence labeling–based method was developed for automatic entity and attribute recognition. Negation detection was supported by NegEx and a set of predefined rules. Relation extraction was achieved by a support vector machine classifier. We further performed terminology-based concept normalization and output structuring. Results In task-specific evaluations, the best F1 score for entity recognition was 0.79, and for relation extraction was 0.89. The accuracy of negation detection was 0.94. The overall accuracy for query formalization was 0.71 in an end-to-end evaluation. Conclusions This study presents EliIE, an OMOP CDM–based information extraction system for automatic structuring and formalization of free-text EC. According to our evaluation, machine learning-based EliIE outperforms existing systems and shows promise to improve.

Download Full-text