Entity Linking: Finding Extracted Entities in a Knowledge Base

The exponential growth of data emerging out of social media is causing challenges in decision-making systems and poses a critical hindrance in searching for the potential information. The major objective of this chapter is to convert the unstructured data in social media into the meaningful structure format, which in return brings the robustness to the information extraction process. Further, it has the inherent capability to prune for named entities from the unstructured data and store the entities into the knowledge base for important facts. In this chapter, the authors explain the methods to identify all the critical interpretations taken over to find the named entities from Twitter streams and the techniques to proportionally link it with appropriate knowledge sources such as DBpedia.

Download Full-text

WikiIdRank++: EXTENSIONS AND IMPROVEMENTS OF THE WikiIdRank SYSTEM FOR ENTITY LINKING

International Journal of Artificial Intelligence Tools ◽

10.1142/s0218213013500188 ◽

2013 ◽

Vol 22 (03) ◽

pp. 1350018 ◽

Cited By ~ 1

Author(s):

M. D. JIMÉNEZ ◽

N. FERNÁNDEZ ◽

J. ARIAS FISTEUS ◽

L. SÁNCHEZ

Keyword(s):

Knowledge Base ◽

Text Analysis ◽

Base Population ◽

Entity Linking ◽

Named Entities ◽

Named Entity ◽

Global Comparison ◽

Knowledge Base Population ◽

Reference Knowledge ◽

Unstructured Information

The amount of information available on the Web has grown considerably in recent years, leading to the need to structure it in order to access it in a quick and accurate way. In order to develop techniques to automate the structuring process, the Knowledge Base Population (KBP) track of the Text Analysis Conference (TAC) was created. This forum aims to encourage research in automated systems capable of capturing knowledge in unstructured information. One of the tasks proposed in the context of the KBP track is named entity linking, and its goal is to link named entities mentioned in a document to instances in a reference knowledge base built from Wikipedia. This paper focuses on the entity linking task in the context of KBP 2010, where two different varieties of this task were considered, depending on whether the use of the text from Wikipedia was allowed or not. Specifically, the paper proposes a set of modifications to a system that participated in KBP 2010, named WikiIdRank, in order to improve its performance. The different modifications were evaluated in the official KBP 2010 corpus, showing that the best combination increases the accuracy of the initial system in a 7.04%. Though the resultant system, named WikiIdRank++, is unsupervised and does not take advantage of Wikipedia text, a comparison with other approaches in KBP indicates that the system would rank as 4th (out of 16) in the global comparison, outperforming other approaches that use human supervision and take advantage of Wikipedia textual contents. Furthermore, the system would rank as 1st in the category of systems that do not use Wikipedia text.

Download Full-text

Entity Linking: An Issue to Extract Corresponding Entity With Knowledge Base

IEEE Access ◽

10.1109/access.2017.2787787 ◽

2018 ◽

Vol 6 ◽

pp. 6220-6231 ◽

Cited By ~ 10

Author(s):

Gongqing Wu ◽

Ying He ◽

Xuegang Hu

Keyword(s):

Knowledge Base ◽

Entity Linking

Download Full-text

A collective entity linking algorithm with parallel computing on large-scale knowledge base

The Journal of Supercomputing ◽

10.1007/s11227-019-03046-7 ◽

2019 ◽

Vol 76 (2) ◽

pp. 948-963 ◽

Cited By ~ 3

Author(s):

Yingchun Xia ◽

Xingyue Wang ◽

Lichuan Gu ◽

Qijuan Gao ◽

Jun Jiao ◽

...

Keyword(s):

Parallel Computing ◽

Knowledge Base ◽

Large Scale ◽

Entity Linking

Download Full-text

Automatic Linking of Terms from Scientific Texts with Knowledge Base Entities

Vestnik NSU Series Information Technologies ◽

10.25205/1818-7900-2021-19-2-65-75 ◽

2021 ◽

Vol 19 (2) ◽

pp. 65-75

Author(s):

A. A. Mezentseva ◽

E. P. Bruches ◽

T. V. Batura

Keyword(s):

Knowledge Base ◽

Text Processing ◽

Semantic Content ◽

Knowledge Bases ◽

Scientific Article ◽

Entity Linking ◽

Scientific Publications ◽

Scientific Texts ◽

Two Stages

Due to the growth of the number of scientific publications, the tasks related to scientific article processing become more actual. Such texts have a special structure, lexical and semantic content that should be taken into account while processing. Using information from knowledge bases can significantly improve the quality of text processing systems. This paper is dedicated to the entity linking task for scientific articles in Russian, where we consider scientific terms as entities. During our work, we annotated a corpus with scientific texts, where each term was linked with an entity from a knowledge base. Also, we implemented an algorithm for entity linking and evaluated it on the corpus. The algorithm consists of two stages: candidate generation for an input term and ranking this set of candidates to choose the best match. We used string matching of an input term and an entity in a knowledge base to generate a set of candidates. To rank the candidates and choose the most relevant entity for a term, information about the number of links to other entities within the knowledge base and to other sites is used. We analyzed the obtained results and proposed possible ways to improve the quality of the algorithm, for example, using information about the context and a knowledge base structure. The annotated corpus is publicly available and can be useful for other researchers.

Download Full-text