Domain Specific Named Entity Extraction for Modeling and Populating Ontologies

Author(s):  
B. Damayanthi Jesudas ◽  
B. Gurumoorthy
2015 ◽  
Vol 24 (02) ◽  
pp. 1540012 ◽  
Author(s):  
Pavlos Fafalios ◽  
Manolis Baritakis ◽  
Yannis Tzitzikas

Named Entity Extraction (NEE) is the process of identifying entities in texts and, very commonly, linking them to related (Web) resources. This task is useful in several applications, e.g. for question answering, annotating documents, post-processing of search results, etc. However, existing NEE tools lack an open or easy configuration although this is very important for building domain-specific applications. For example, supporting a new category of entities, or specifying how to link the detected entities with online resources, is either impossible or very laborious. In this paper, we show how we can exploit semantic information (Linked Data) at real-time for configuring (handily) a NEE system and we propose a generic model for configuring such services. To explicitly define the semantics of the proposed model, we introduce an RDF/S vocabulary, called “Open NEE Configuration Model”, which allows a NEE service to describe (and publish as Linked Data) its entity mining capabilities, but also to be dynamically configured. To allow relating the output of a NEE process with an applied configuration, we propose an extension of the Open Annotation Data Model which also enables an application to run advanced queries over the annotated data. As a proof of concept, we present X-Link, a fully-configurable NEE framework that realizes this approach. Contrary to the existing tools, X-Link allows the user to easily define the categories of entities that are interesting for the application at hand by exploiting one or more semantic Knowledge Bases. The user is also able to update a category and specify how to semantically link and enrich the identified entities. This enhanced configurability allows X-Link to be easily configured for different contexts for building domain-specific applications. To test the approach, we conducted a task-based evaluation with users that demonstrates its usability, and a case study that demonstrates its feasibility.


IEEE Access ◽  
2020 ◽  
Vol 8 ◽  
pp. 32862-32881 ◽  
Author(s):  
Tareq Al-Moslmi ◽  
Marc Gallofre Ocana ◽  
Andreas L. Opdahl ◽  
Csaba Veres

2005 ◽  
Vol 165 (1) ◽  
pp. 91-134 ◽  
Author(s):  
Oren Etzioni ◽  
Michael Cafarella ◽  
Doug Downey ◽  
Ana-Maria Popescu ◽  
Tal Shaked ◽  
...  

2011 ◽  
Vol 145 ◽  
pp. 451-454
Author(s):  
Han Gi Kim ◽  
Kuk Jin Bae ◽  
Eun Sun Kim ◽  
Hyuk Hahn

This paper presents additional linguistic factors that should be considered to more effectively extract terms from the machinery industry documents by augmenting the general extraction patterns. We expand on the general term extraction patterns with patterns that are tailored for machinery industry documents to improve precision and recall. We establish a theoretical basis for developing a system to support information research in the machinery industry. Using this system, we expect to increase the efficiency of new business planning process in the machine industry.


Sign in / Sign up

Export Citation Format

Share Document