Domain Specific Named Entity Extraction for Modeling and Populating Ontologies

Named Entity Extraction (NEE) is the process of identifying entities in texts and, very commonly, linking them to related (Web) resources. This task is useful in several applications, e.g. for question answering, annotating documents, post-processing of search results, etc. However, existing NEE tools lack an open or easy configuration although this is very important for building domain-specific applications. For example, supporting a new category of entities, or specifying how to link the detected entities with online resources, is either impossible or very laborious. In this paper, we show how we can exploit semantic information (Linked Data) at real-time for configuring (handily) a NEE system and we propose a generic model for configuring such services. To explicitly define the semantics of the proposed model, we introduce an RDF/S vocabulary, called “Open NEE Configuration Model”, which allows a NEE service to describe (and publish as Linked Data) its entity mining capabilities, but also to be dynamically configured. To allow relating the output of a NEE process with an applied configuration, we propose an extension of the Open Annotation Data Model which also enables an application to run advanced queries over the annotated data. As a proof of concept, we present X-Link, a fully-configurable NEE framework that realizes this approach. Contrary to the existing tools, X-Link allows the user to easily define the categories of entities that are interesting for the application at hand by exploiting one or more semantic Knowledge Bases. The user is also able to update a category and specify how to semantically link and enrich the identified entities. This enhanced configurability allows X-Link to be easily configured for different contexts for building domain-specific applications. To test the approach, we conducted a task-based evaluation with users that demonstrates its usability, and a case study that demonstrates its feasibility.

Download Full-text

Tibetan–Chinese named entity extraction based on Wikipedia

Advances in Computer Science and Technology ◽

10.2495/iccst140531 ◽

2014 ◽

Author(s):

Y. Sun ◽

Q. Zhao

Keyword(s):

Entity Extraction ◽

Named Entity ◽

Named Entity Extraction

Download Full-text

Improved named entity translation and bilingual named entity extraction

Proceedings. Fourth IEEE International Conference on Multimodal Interfaces ◽

10.1109/icmi.2002.1167002 ◽

2003 ◽

Cited By ~ 11

Author(s):

Fei Huang ◽

S. Vogel

Keyword(s):

Entity Extraction ◽

Named Entity ◽

Named Entity Extraction

Download Full-text

Named entity extraction and disambiguation for informal text : the missing link

10.3990/1.9789036536479 ◽

2014 ◽

Author(s):

Mena Badieh Habib Morgan

Keyword(s):

Entity Extraction ◽

Missing Link ◽

Named Entity ◽

Named Entity Extraction

Download Full-text

A Named Entity Extraction using Word Information Repeatedly Collected from Unlabeled Data

Computational Linguistics and Intelligent Text Processing - Lecture Notes in Computer Science ◽

10.1007/978-3-642-12116-6_18 ◽

2010 ◽

pp. 212-223

Author(s):

Tomoya Iwakura

Keyword(s):

Unlabeled Data ◽

Entity Extraction ◽

Named Entity ◽

Named Entity Extraction

Download Full-text

Wide–Coverage Spanish Named Entity Extraction

Advances in Artificial Intelligence — IBERAMIA 2002 - Lecture Notes in Computer Science ◽

10.1007/3-540-36131-6_69 ◽

2002 ◽

pp. 674-683 ◽

Cited By ~ 1

Author(s):

Xavier Carreras ◽

Lluís Màrquez ◽

Lluís Padró

Keyword(s):

Entity Extraction ◽

Named Entity ◽

Named Entity Extraction

Download Full-text

An Approach to Named Entity Extraction from Mongolian Historical Documents

2015 International Conference on Culture and Computing (Culture Computing) ◽

10.1109/culture.and.computing.2015.41 ◽

2015 ◽

Cited By ~ 2

Author(s):

Biligsaikhan Batjargal ◽

Garmaabazar Khaltarkhuu ◽

Akira Maeda

Keyword(s):

Historical Documents ◽

Entity Extraction ◽

Named Entity ◽

Named Entity Extraction

Download Full-text

Named Entity Extraction for Knowledge Graphs: A Literature Overview

IEEE Access ◽

10.1109/access.2020.2973928 ◽

2020 ◽

Vol 8 ◽

pp. 32862-32881 ◽

Cited By ~ 2

Author(s):

Tareq Al-Moslmi ◽

Marc Gallofre Ocana ◽

Andreas L. Opdahl ◽

Csaba Veres

Keyword(s):

Entity Extraction ◽

Named Entity ◽

Named Entity Extraction ◽

Literature Overview ◽

Knowledge Graphs

Download Full-text

Unsupervised named-entity extraction from the Web: An experimental study

Artificial Intelligence ◽

10.1016/j.artint.2005.03.001 ◽

2005 ◽

Vol 165 (1) ◽

pp. 91-134 ◽

Cited By ~ 408

Author(s):

Oren Etzioni ◽

Michael Cafarella ◽

Doug Downey ◽

Ana-Maria Popescu ◽

Tal Shaked ◽

...

Keyword(s):

Experimental Study ◽

Entity Extraction ◽

Named Entity ◽

Named Entity Extraction ◽

The Web

Download Full-text

The Study on Enlarging Specific Extractor for Technology-Related Named Entity Extraction from Text Collections of Applied Mechanics Field

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.145.451 ◽

2011 ◽

Vol 145 ◽

pp. 451-454

Author(s):

Han Gi Kim ◽

Kuk Jin Bae ◽

Eun Sun Kim ◽

Hyuk Hahn

Keyword(s):

Planning Process ◽

General Term ◽

Entity Extraction ◽

Machinery Industry ◽

Named Entity ◽

Named Entity Extraction ◽

Term Extraction ◽

Information Research ◽

New Business ◽

Machine Industry

This paper presents additional linguistic factors that should be considered to more effectively extract terms from the machinery industry documents by augmenting the general extraction patterns. We expand on the general term extraction patterns with patterns that are tailored for machinery industry documents to improve precision and recall. We establish a theoretical basis for developing a system to support information research in the machinery industry. Using this system, we expect to increase the efficiency of new business planning process in the machine industry.

Download Full-text