scholarly journals Training set augmentation in training neural- network language model for ontology population

2021 ◽  
Vol 12 (5-2021) ◽  
pp. 22-34
Author(s):  
Pavel A. Lomov ◽  
◽  
Marina L. Malozemova ◽  

This paper is a continuation of the research focused on solving the problem of ontology population using training on an automatically generated training set and the subsequent use of a neural-network language model for analyzing texts in order to discover new concepts to add to the ontology. The article is devoted to the text data augmentation - increasing the size of the training set by modification of its samples. Along with this, a solution to the problem of clarifying concepts (i.e. adjusting their boundaries in sentences), which were found during the automatic formation of the training set, is considered. A brief overview of existing approaches to text data augmentation, as well as approaches to extracting so-called nested named entities (nested NER), is presented. A procedure is proposed for clarifying the boundaries of the discovered concepts of the training set and its augmentation for subsequent training a neural-network language model in order to identify new concepts of ontology in the domain texts. The results of the experimental evaluation of the trained model and the main directions of further research are considered.

2021 ◽  
Vol 12 (5-2021) ◽  
pp. 166-170
Author(s):  
Pavel A. Lomov ◽  
◽  
Marina L. Malozemova ◽  

The paper considers one of the subtasks of ontology learning - the ontology population, which implies the extension of existing ontology by new instances without changing the structure of its classes and relations. A brief overview of existing ontology learning approaches is presented. A highly automated technology for ontology population based on training and application of the neural-network language model to identify and extract potential instances of ontology classes from domain texts is proposed. The main stages of its application, as well as the results of its experimental evaluation and the main directions of its further improvement are considered.


2016 ◽  
Vol 140 (4) ◽  
pp. 3115-3116
Author(s):  
Aiko Hagiwara ◽  
Hitoshi Ito ◽  
Manon Ichiki ◽  
Takeshi Mishima ◽  
Akio Kobayashi ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document