Chinese Scientific And Technological Named Entity identification and Relation extraction based on knowledge tree

With the advent of Web 2.0, there exist many online platforms that result in massive textual-data production. With ever-increasing textual data at hand, it is of immense importance to extract information nuggets from this data. One approach towards effective harnessing of this unstructured textual data could be its transformation into structured text. Hence, this study aims to present an overview of approaches that can be applied to extract key insights from textual data in a structured way. For this, Named Entity Recognition and Relation Extraction are being majorly addressed in this review study. The former deals with identification of named entities, and the latter deals with problem of extracting relation between set of entities. This study covers early approaches as well as the developments made up till now using machine learning models. Survey findings conclude that deep-learning-based hybrid and joint models are currently governing the state-of-the-art. It is also observed that annotated benchmark datasets for various textual-data generators such as Twitter and other social forums are not available. This scarcity of dataset has resulted into relatively less progress in these domains. Additionally, the majority of the state-of-the-art techniques are offline and computationally expensive. Last, with increasing focus on deep-learning frameworks, there is need to understand and explain the under-going processes in deep architectures.

Download Full-text

An Attention-Based Model Using Character Composition of Entities in Chinese Relation Extraction

Information ◽

10.3390/info11020079 ◽

2020 ◽

Vol 11 (2) ◽

pp. 79 ◽

Cited By ~ 2

Author(s):

Xiaoyu Han ◽

Yue Zhang ◽

Wenkai Zhang ◽

Tinglei Huang

Keyword(s):

Language Processing ◽

Large Scale ◽

Named Entity Recognition ◽

Relation Extraction ◽

Entity Recognition ◽

Additional Information ◽

Named Entity ◽

Proposed Model ◽

The Relationship ◽

Crucial Part

Relation extraction is a vital task in natural language processing. It aims to identify the relationship between two specified entities in a sentence. Besides information contained in the sentence, additional information about the entities is verified to be helpful in relation extraction. Additional information such as entity type getting by NER (Named Entity Recognition) and description provided by knowledge base both have their limitations. Nevertheless, there exists another way to provide additional information which can overcome these limitations in Chinese relation extraction. As Chinese characters usually have explicit meanings and can carry more information than English letters. We suggest that characters that constitute the entities can provide additional information which is helpful for the relation extraction task, especially in large scale datasets. This assumption has never been verified before. The main obstacle is the lack of large-scale Chinese relation datasets. In this paper, first, we generate a large scale Chinese relation extraction dataset based on a Chinese encyclopedia. Second, we propose an attention-based model using the characters that compose the entities. The result on the generated dataset shows that these characters can provide useful information for the Chinese relation extraction task. By using this information, the attention mechanism we used can recognize the crucial part of the sentence that can express the relation. The proposed model outperforms other baseline models on our Chinese relation extraction dataset.

Download Full-text

An unsupervised learning method for named entity relation extraction of space knowledge graph

Journal of Physics Conference Series ◽

10.1088/1742-6596/1871/1/012051 ◽

2021 ◽

Vol 1871 (1) ◽

pp. 012051

Author(s):

Zhanji Wei ◽

Lingyong Huang ◽

Gang Wan ◽

Yao Mu ◽

Yunxia Yin

Keyword(s):

Unsupervised Learning ◽

Relation Extraction ◽

Knowledge Graph ◽

Learning Method ◽

Named Entity ◽

Entity Relation Extraction

Download Full-text

Named Entity Identification and Cyberinfrastructure

Research and Advanced Technology for Digital Libraries - Lecture Notes in Computer Science ◽

10.1007/978-3-540-74851-9_22 ◽

2007 ◽

pp. 259-270 ◽

Cited By ~ 5

Author(s):

Alison Babeu ◽

David Bamman ◽

Gregory Crane ◽

Robert Kummer ◽

Gabriel Weaver

Keyword(s):

Named Entity ◽

Entity Identification

Download Full-text

Optimizing Relation Extraction Based on the Type Tag of Named Entity

Lecture Notes in Computer Science - Chinese Lexical Semantics ◽

10.1007/978-3-030-04015-4_45 ◽

2018 ◽

pp. 532-541

Author(s):

Yixing Zhang ◽

Yangsen Zhang ◽

Gaijuan Huang ◽

Zhengbin Guo

Keyword(s):

Relation Extraction ◽

Named Entity

Download Full-text

A Military Named Entity Relation Extraction Approach Based on Deep Learning

Proceedings of the 2018 International Conference on Algorithms, Computing and Artificial Intelligence - ACAI 2018 ◽

10.1145/3302425.3302473 ◽

2018 ◽

Author(s):

Xuefeng Wang ◽

Ruopeng Yang ◽

Yulong Feng ◽

Dongsheng Li ◽

Jianfeng Hou

Keyword(s):

Deep Learning ◽

Relation Extraction ◽

Named Entity ◽

Entity Relation Extraction

Download Full-text

Linking chemical and disease entities to ontologies by integrating PageRank with extracted relations from literature

Journal of Cheminformatics ◽

10.1186/s13321-020-00461-4 ◽

2020 ◽

Vol 12 (1) ◽

Author(s):

Pedro Ruas ◽

Andre Lamurias ◽

Francisco M. Couto

Keyword(s):

Digital Libraries ◽

Information Overload ◽

Relation Extraction ◽

Knowledge Bases ◽

Entity Linking ◽

Personalized Pagerank ◽

Named Entity ◽

Manual Curation ◽

Low Performance ◽

Gold Standards

Abstract Background Named Entity Linking systems are a powerful aid to the manual curation of digital libraries, which is getting increasingly costly and inefficient due to the information overload. Models based on the Personalized PageRank (PPR) algorithm are one of the state-of-the-art approaches, but these have low performance when the disambiguation graphs are sparse. Findings This work proposes a Named Entity Linking framework designated by Relation Extraction for Entity Linking (REEL) that uses automatically extracted relations to overcome this limitation. Our method builds a disambiguation graph, where the nodes are the ontology candidates for the entities and the edges are added according to the relations established in the text, which the method extracts automatically. The PPR algorithm and the information content of each ontology are then applied to choose the candidate for each entity that maximises the coherence of the disambiguation graph. We evaluated the method on three gold standards: the subset of the CRAFT corpus with ChEBI annotations (CRAFT-ChEBI), the subset of the BC5CDR corpus with disease annotations from the MEDIC vocabulary (BC5CDR-Diseases) and the subset with chemical annotations from the CTD-Chemical vocabulary (BC5CDR-Chemicals). The F1-Score achieved by REEL was 85.8%, 80.9% and 90.3% in these gold standards, respectively, outperforming baseline approaches. Conclusions We demonstrated that RE tools can improve Named Entity Linking by capturing semantic information expressed in text missing in Knowledge Bases and use it to improve the disambiguation graph of Named Entity Linking models. REEL can be adapted to any text mining pipeline and potentially to any domain, as long as there is an ontology or other knowledge Base available.

Download Full-text