Named Entity Linking mit Wikidata und GND – Das Potenzial handkuratierter und strukturierter Datenquellen für die semantische Anreicherung von Volltexten

Web enables to retrieve concise information about specific entities including people, organizations, movies and their features. Additionally, large amount of Web resources generally lies on a unstructured form and it tackles to find critical information for specific entities. Text analysis approaches such as Named Entity Recognizer and Entity Linking aim to identify entities and link them to relevant entities in the given knowledge base. To evaluate these approaches, there are a vast amount of general purpose benchmark datasets. However, it is difficult to evaluate domain-specific approaches due to lack of evaluation datasets for specific domains. This study presents WeDGeM that is a multilingual evaluation set generator for specific domains exploiting Wikipedia category pages and DBpedia hierarchy. Also, Wikipedia disambiguation pages are used to adjust the ambiguity level of the generated texts. Based on this generated test data, a use case for well-known Entity Linking systems supporting Turkish texts are evaluated in the movie domain.

Download Full-text

Domain-Specific Entity Linking via Fake Named Entity Detection

Database Systems for Advanced Applications - Lecture Notes in Computer Science ◽

10.1007/978-3-319-32025-0_7 ◽

2016 ◽

pp. 101-116 ◽

Cited By ~ 5

Author(s):

Jiangtao Zhang ◽

Juanzi Li ◽

Xiao-Li Li ◽

Yao Shi ◽

Junpeng Li ◽

...

Keyword(s):

Entity Linking ◽

Named Entity ◽

Domain Specific ◽

Named Entity Detection

Download Full-text

Named entity linking of geospatial and host metadata in GenBank for advancing biomedical research

Database ◽

10.1093/database/bax093 ◽

2017 ◽

Vol 2017 ◽

Cited By ~ 5

Author(s):

Tasnia Tahsin ◽

Davy Weissenbacher ◽

Demetrius Jones-Shargani ◽

Daniel Magee ◽

Matteo Vaiente ◽

...

Keyword(s):

Biomedical Research ◽

Entity Linking ◽

Named Entity

Download Full-text

Linking chemical and disease entities to ontologies by integrating PageRank with extracted relations from literature

Journal of Cheminformatics ◽

10.1186/s13321-020-00461-4 ◽

2020 ◽

Vol 12 (1) ◽

Author(s):

Pedro Ruas ◽

Andre Lamurias ◽

Francisco M. Couto

Keyword(s):

Digital Libraries ◽

Information Overload ◽

Relation Extraction ◽

Knowledge Bases ◽

Entity Linking ◽

Personalized Pagerank ◽

Named Entity ◽

Manual Curation ◽

Low Performance ◽

Gold Standards

Abstract Background Named Entity Linking systems are a powerful aid to the manual curation of digital libraries, which is getting increasingly costly and inefficient due to the information overload. Models based on the Personalized PageRank (PPR) algorithm are one of the state-of-the-art approaches, but these have low performance when the disambiguation graphs are sparse. Findings This work proposes a Named Entity Linking framework designated by Relation Extraction for Entity Linking (REEL) that uses automatically extracted relations to overcome this limitation. Our method builds a disambiguation graph, where the nodes are the ontology candidates for the entities and the edges are added according to the relations established in the text, which the method extracts automatically. The PPR algorithm and the information content of each ontology are then applied to choose the candidate for each entity that maximises the coherence of the disambiguation graph. We evaluated the method on three gold standards: the subset of the CRAFT corpus with ChEBI annotations (CRAFT-ChEBI), the subset of the BC5CDR corpus with disease annotations from the MEDIC vocabulary (BC5CDR-Diseases) and the subset with chemical annotations from the CTD-Chemical vocabulary (BC5CDR-Chemicals). The F1-Score achieved by REEL was 85.8%, 80.9% and 90.3% in these gold standards, respectively, outperforming baseline approaches. Conclusions We demonstrated that RE tools can improve Named Entity Linking by capturing semantic information expressed in text missing in Knowledge Bases and use it to improve the disambiguation graph of Named Entity Linking models. REEL can be adapted to any text mining pipeline and potentially to any domain, as long as there is an ontology or other knowledge Base available.

Download Full-text