scholarly journals Named Entity Linking mit Wikidata und GND – Das Potenzial handkuratierter und strukturierter Datenquellen für die semantische Anreicherung von Volltexten

2021 ◽  
pp. 229-258
Author(s):  
Sina Menzel ◽  
Hannes Schnaitter ◽  
Josefine Zinck ◽  
Vivien Petras ◽  
Clemens Neudecker ◽  
...  
Keyword(s):  
Author(s):  
Emrah Inan ◽  
Vahab Mostafapour ◽  
Fatif Tekbacak

Web enables to retrieve concise information about specific entities including people, organizations, movies and their features. Additionally, large amount of Web resources generally lies on a unstructured form and it tackles to find critical information for specific entities. Text analysis approaches such as Named Entity Recognizer and Entity Linking aim to identify entities and link them to relevant entities in the given knowledge base. To evaluate these approaches, there are a vast amount of general purpose benchmark datasets. However, it is difficult to evaluate domain-specific approaches due to lack of evaluation datasets for specific domains. This study presents WeDGeM that is a multilingual evaluation set generator for specific domains exploiting Wikipedia category pages and DBpedia hierarchy. Also, Wikipedia disambiguation pages are used to adjust the ambiguity level of the generated texts. Based on this generated test data, a use case for well-known Entity Linking systems supporting Turkish texts are evaluated in the movie domain.


Author(s):  
Jiangtao Zhang ◽  
Juanzi Li ◽  
Xiao-Li Li ◽  
Yao Shi ◽  
Junpeng Li ◽  
...  

Database ◽  
2017 ◽  
Vol 2017 ◽  
Author(s):  
Tasnia Tahsin ◽  
Davy Weissenbacher ◽  
Demetrius Jones-Shargani ◽  
Daniel Magee ◽  
Matteo Vaiente ◽  
...  

2020 ◽  
Vol 12 (1) ◽  
Author(s):  
Pedro Ruas ◽  
Andre Lamurias ◽  
Francisco M. Couto

Abstract Background Named Entity Linking systems are a powerful aid to the manual curation of digital libraries, which is getting increasingly costly and inefficient due to the information overload. Models based on the Personalized PageRank (PPR) algorithm are one of the state-of-the-art approaches, but these have low performance when the disambiguation graphs are sparse. Findings This work proposes a Named Entity Linking framework designated by Relation Extraction for Entity Linking (REEL) that uses automatically extracted relations to overcome this limitation. Our method builds a disambiguation graph, where the nodes are the ontology candidates for the entities and the edges are added according to the relations established in the text, which the method extracts automatically. The PPR algorithm and the information content of each ontology are then applied to choose the candidate for each entity that maximises the coherence of the disambiguation graph. We evaluated the method on three gold standards: the subset of the CRAFT corpus with ChEBI annotations (CRAFT-ChEBI), the subset of the BC5CDR corpus with disease annotations from the MEDIC vocabulary (BC5CDR-Diseases) and the subset with chemical annotations from the CTD-Chemical vocabulary (BC5CDR-Chemicals). The F1-Score achieved by REEL was 85.8%, 80.9% and 90.3% in these gold standards, respectively, outperforming baseline approaches. Conclusions We demonstrated that RE tools can improve Named Entity Linking by capturing semantic information expressed in text missing in Knowledge Bases and use it to improve the disambiguation graph of Named Entity Linking models. REEL can be adapted to any text mining pipeline and potentially to any domain, as long as there is an ontology or other knowledge Base available.


Author(s):  
Simone Tedeschi ◽  
Simone Conia ◽  
Francesco Cecconi ◽  
Roberto Navigli

Author(s):  
Tom Oberhauser ◽  
Tim Bischoff ◽  
Karl Brendel ◽  
Maluna Menke ◽  
Tobias Klatt ◽  
...  

2020 ◽  
Author(s):  
Mohammad Golam Sohrab ◽  
Khoa Duong ◽  
Makoto Miwa ◽  
Goran Topić ◽  
Ikeda Masami ◽  
...  
Keyword(s):  

2016 ◽  
Author(s):  
Toni Gruetze ◽  
Gjergji Kasneci ◽  
Zhe Zuo ◽  
Felix Naumann

Sign in / Sign up

Export Citation Format

Share Document