Understanding Horizon 2020 Data: A Knowledge Graph-Based Approach

This paper aims to meaningfully analyse the Horizon 2020 data existing in the CORDIS repository of EU, and accordingly offer evidence and insights to aid organizations in the formulation of consortia that will prepare and submit winning research proposals to forthcoming calls. The analysis is performed on aggregated data concerning 32,090 funded projects, 34,295 organizations participated in them, and 87,067 public deliverables produced. The modelling of data is performed through a knowledge graph-based approach, aiming to semantically capture existing relationships and reveal hidden information. The main contribution of this work lies in the proper utilization and orchestration of keyphrase extraction and named entity recognition models, together with meaningful graph analytics on top of an efficient graph database. The proposed approach enables users to ask complex questions about the interconnection of various entities related to previously funded research projects. A set of representative queries demonstrating our data representation and analysis approach are given at the end of the paper.

Download Full-text

Building the Knowledge Graph for Zakat (KGZ) in Indonesian Language

ASM Science Journal ◽

10.32802/asmscj.2021.758 ◽

2021 ◽

Vol 16 ◽

pp. 1-10

Author(s):

Husni Teja Sukmana ◽

JM Muslimin ◽

Asep Fajar Firmansyah ◽

Lee Kyung Oh

Keyword(s):

Named Entity Recognition ◽

General Purpose ◽

Entity Recognition ◽

Basic Knowledge ◽

Knowledge Graph ◽

Specific Domain ◽

Named Entity ◽

Description Framework ◽

General Source ◽

Domain Information

In Indonesia, philanthropy is identical to Zakat. Zakat belongs to a specific domain because it has its characteristics of knowledge. This research studied knowledge graph in the Zakat domain called KGZ which is conducted in Indonesia. This area is still rarely performed, thus it becomes the first knowledge graph for Zakat in Indonesia. It is designed to provide basic knowledge on Zakat and managing the Zakat in Indonesia. There are some issues with building KGZ, firstly, the existing Indonesian named entity recognition (NER) is non-restricted and general-purpose based which data is obtained from a general source like news. Second, there is no dataset for NER in the Zakat domain. We define four steps to build KGZ, involving data acquisition, extracting entities and their relationship, mapping to ontology, and deploying knowledge graphs and visualizations. This research contributed a knowledge graph for Zakat (KGZ) and a building NER model for Zakat, called KGZ-NER. We defined 17 new named entity classes related to Zakat with 272 entities, 169 relationships and provided labelled datasets for KGZ-NER that are publicly accessible. We applied the Indonesian-Open Domain Information Extractor framework to process identifying entities’ relationships. Then designed modeling of information using resources description framework (RDF) to build the knowledge base for KGZ and store it to GraphDB, a product from Ontotext. This NER model has a precision 0.7641, recall 0.4544, and F1-score 0.5655. The increasing data size of KGZ is required to discover all of the knowledge of Zakat and managing Zakat in Indonesia. Moreover, sufficient resources are required in future works.

Download Full-text

Deep Learning-Based Named Entity Recognition and Knowledge Graph Construction for Geological Hazards

ISPRS International Journal of Geo-Information ◽

10.3390/ijgi9010015 ◽

2019 ◽

Vol 9 (1) ◽

pp. 15 ◽

Cited By ~ 2

Author(s):

Runyu Fan ◽

Lizhe Wang ◽

Jining Yan ◽

Weijing Song ◽

Yingqian Zhu ◽

...

Keyword(s):

Deep Learning ◽

Large Scale ◽

Conditional Random Field ◽

Named Entity Recognition ◽

Entity Recognition ◽

Knowledge Graph ◽

Geological Hazard ◽

Geological Hazards ◽

Named Entity ◽

Corpus Construction

Constructing a knowledge graph of geological hazards literature can facilitate the reuse of geological hazards literature and provide a reference for geological hazard governance. Named entity recognition (NER), as a core technology for constructing a geological hazard knowledge graph, has to face the challenges that named entities in geological hazard literature are diverse in form, ambiguous in semantics, and uncertain in context. This can introduce difficulties in designing practical features during the NER classification. To address the above problem, this paper proposes a deep learning-based NER model; namely, the deep, multi-branch BiGRU-CRF model, which combines a multi-branch bidirectional gated recurrent unit (BiGRU) layer and a conditional random field (CRF) model. In an end-to-end and supervised process, the proposed model automatically learns and transforms features by a multi-branch bidirectional GRU layer and enhances the output with a CRF layer. Besides the deep, multi-branch BiGRU-CRF model, we also proposed a pattern-based corpus construction method to construct the corpus needed for the deep, multi-branch BiGRU-CRF model. Experimental results indicated the proposed deep, multi-branch BiGRU-CRF model outperformed state-of-the-art models. The proposed deep, multi-branch BiGRU-CRF model constructed a large-scale geological hazard literature knowledge graph containing 34,457 entities nodes and 84,561 relations.

Download Full-text

Design and Evaluation of a Prescription Drug Monitoring Program for Chinese Patent Medicine based on Knowledge Graph

Evidence-based Complementary and Alternative Medicine ◽

10.1155/2021/9970063 ◽

2021 ◽

Vol 2021 ◽

pp. 1-8

Author(s):

Wangping Xiong ◽

Jun Cao ◽

Xian Zhou ◽

Jianqiang Du ◽

Bin Nie ◽

...

Keyword(s):

Prescription Drug ◽

Drug Monitoring ◽

Named Entity Recognition ◽

Monitoring Program ◽

Entity Recognition ◽

Knowledge Graph ◽

Named Entity ◽

Patent Medicines ◽

Prescription Drug Monitoring Program ◽

Chinese Patent

Background. Chinese patent medicines are increasingly used clinically, and the prescription drug monitoring program is an effective tool to promote drug safety and maintain health. Methods. We constructed a prescription drug monitoring program for Chinese patent medicines based on knowledge graphs. First, we extracted the key information of Chinese patent medicines, diseases, and symptoms from the domain-specific corpus by the information extraction. Second, based on the extracted entities and relationships, a knowledge graph was constructed to form a rule base for the monitoring of data. Then, the named entity recognition model extracted the key information from the electronic medical record to be monitored and matched the knowledge graph to realize the monitoring of the Chinese patent medicines in the prescription. Results. Named entity recognition based on the pretrained model achieved an F1 value of 83.3% on the Chinese patent medicines dataset. On the basis of entity recognition technology and knowledge graph, we implemented a prescription drug monitoring program for Chinese patent medicines. The accuracy rate of combined medication monitoring of three or more drugs of the program increased from 68% to 86.4%. The accuracy rate of drug control monitoring increased from 70% to 97%. The response time for conflicting prescriptions with two drugs was shortened from 1.3S to 0.8S. The response time for conflicting prescriptions with three or more drugs was shortened from 5.2S to 1.4S. Conclusions. The program constructed in this study can respond quickly and improve the efficiency of monitoring prescriptions. It is of great significance to ensure the safety of patients’ medication.

Download Full-text

DeNERT-KG: Named Entity and Relation Extraction Model Using DQN, Knowledge Graph, and BERT

Applied Sciences ◽

10.3390/app10186429 ◽

2020 ◽

Vol 10 (18) ◽

pp. 6429

Author(s):

SungMin Yang ◽

SoYeop Yoo ◽

OkRan Jeong

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Language Model ◽

Named Entity Recognition ◽

Relation Extraction ◽

Entity Recognition ◽

Knowledge Graph ◽

Named Entity ◽

Artificial Intelligence Technology

Along with studies on artificial intelligence technology, research is also being carried out actively in the field of natural language processing to understand and process people’s language, in other words, natural language. For computers to learn on their own, the skill of understanding natural language is very important. There are a wide variety of tasks involved in the field of natural language processing, but we would like to focus on the named entity registration and relation extraction task, which is considered to be the most important in understanding sentences. We propose DeNERT-KG, a model that can extract subject, object, and relationships, to grasp the meaning inherent in a sentence. Based on the BERT language model and Deep Q-Network, the named entity recognition (NER) model for extracting subject and object is established, and a knowledge graph is applied for relation extraction. Using the DeNERT-KG model, it is possible to extract the subject, type of subject, object, type of object, and relationship from a sentence, and verify this model through experiments.

Download Full-text

Named Entity Recognition in Traditional Chinese Medicine Clinical Cases Combining BiLSTM-CRF with Knowledge Graph

Knowledge Science, Engineering and Management - Lecture Notes in Computer Science ◽

10.1007/978-3-030-29551-6_48 ◽

2019 ◽

pp. 537-548 ◽

Cited By ~ 2

Author(s):

Zhe Jin ◽

Yin Zhang ◽

Haodan Kuang ◽

Liang Yao ◽

Wenjin Zhang ◽

...

Keyword(s):

Chinese Medicine ◽

Traditional Chinese Medicine ◽

Named Entity Recognition ◽

Entity Recognition ◽

Knowledge Graph ◽

Named Entity ◽

Clinical Cases

Download Full-text

DOZEN: Cross-Domain Zero Shot Named Entity Recognition with Knowledge Graph

Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval ◽

10.1145/3404835.3463113 ◽

2021 ◽

Author(s):

Hoang-Van Nguyen ◽

Francesco Gelli ◽

Soujanya Poria

Keyword(s):

Named Entity Recognition ◽

Entity Recognition ◽

Knowledge Graph ◽

Named Entity ◽

Cross Domain

Download Full-text

Chinese Named Entity Recognition Method in History and Culture Field Based on BERT

International Journal of Computational Intelligence Systems ◽

10.1007/s44196-021-00019-8 ◽

2021 ◽

Vol 14 (1) ◽

Author(s):

Shuang Liu ◽

Hui Yang ◽

Jiayi Li ◽

Simon Kolmanič

Keyword(s):

Short Term Memory ◽

Conditional Random Field ◽

Language Model ◽

Named Entity Recognition ◽

Entity Recognition ◽

Knowledge Graph ◽

Recognition Method ◽

Short Term ◽

Named Entity ◽

Long Short Term Memory

AbstractWith rapid development of the Internet, people have undergone tremendous changes in the way they obtain information. In recent years, knowledge graph is becoming a popular tool for the public to acquire knowledge. For knowledge graph of Chinese history and culture, most researchers adopted traditional named entity recognition methods to extract entity information from unstructured historical text data. However, the traditional named entity recognition method has certain defects, and it is easy to ignore the association between entities. To extract entities from a large amount of historical and cultural information more accurately and efficiently, this paper proposes one named entity recognition model combining Bidirectional Encoder Representations from Transformers and Bidirectional Long Short-Term Memory-Conditional Random Field (BERT-BiLSTM-CRF). First, a BERT pre-trained language model is used to encode a single character to obtain a vector representation corresponding to each character. Then one Bidirectional Long Short-Term Memory (BiLSTM) layer is applied to semantically encode the input text. Finally, the label with the highest probability is output through the Conditional Random Field (CRF) layer to obtain each character’s category. This model uses the Bidirectional Encoder Representations from Transformers (BERT) pre-trained language model to replace the static word vectors trained in the traditional way. In comparison, the BERT pre-trained language model can dynamically generate semantic vectors according to the context of words, which improves the representation ability of word vectors. The experimental results prove that the model proposed in this paper has achieved excellent results in the task of named entity recognition in the field of historical culture. Compared with the existing named entity identification methods, the precision rate, recall rate, and $$F_1$$ F 1 value have been significantly improved.

Download Full-text

A Knowledge Graph System for the Maintenance of Coal Mine Equipment

Mathematical Problems in Engineering ◽

10.1155/2021/2866751 ◽

2021 ◽

Vol 2021 ◽

pp. 1-13

Author(s):

Guozhen Zhang ◽

Xiangang Cao ◽

Mengyuan Zhang

Keyword(s):

Knowledge Management ◽

Coal Mine ◽

Large Scale ◽

Named Entity Recognition ◽

Entity Recognition ◽

Knowledge Graph ◽

Equipment Maintenance ◽

Data Set ◽

Named Entity ◽

Graph System

With the rapid development of coal mine intelligent technology, the complexity of coal mine equipment has been continuously improved and the equipment maintenance resources have been continuously enriched. The traditional coal mine equipment maintenance knowledge management technology can no longer meet the current needs of equipment maintenance knowledge management, and the problems of low utilization rate, poor interoperability, and serious loss of knowledge have gradually emerged. It is urgent to study new knowledge system construction and knowledge management application technology for large-scale coal mine equipment maintenance resources. Knowledge graph is a technical method to describe the relationship between things in the objective world by using a graph model. It can effectively solve the problem of knowledge dynamic mining and management under large-scale data. Therefore, this paper focuses on the establishment of a coal mine equipment maintenance knowledge graph system by using knowledge graph technology. The main research contents are as follows: Firstly, based on the current situation that there is no unified basic knowledge system in the field of coal mine equipment maintenance, this paper establishes the coal mine equipment maintenance ontology (CMEMO) to effectively solve the problem that there are no unified representation, integration, and sharing of coal mine equipment maintenance knowledge in this field and provide support for the construction of coal mine equipment maintenance knowledge graph. Then, aiming at the problem that the traditional named-entity recognition method has a poor recognition effect and relies too much on artificial feature design, this paper proposes a named-entity recognition model for coal mine equipment maintenance based on neural network (BERT-BiLSTM-CRF) and applies the model to the coal mine equipment maintenance data set for verification. The experimental results show that, under the same data set, the entity recognition effect of this model is more leading than that of other models. Finally, through demand analysis and architecture design, combined with the constructed ontology model of coal mine equipment maintenance field, the entity identification of coal mine equipment maintenance is completed based on the BERT-BiLSTM-CRF model and the Django application framework is used to build the coal mine equipment maintenance knowledge graph system to realize the functions of each module of the knowledge graph system.

Download Full-text

Research on Named Entity Recognition Technology of Knowledge Graph for Flipped Classroom

2021 4th International Conference on Artificial Intelligence and Big Data (ICAIBD) ◽

10.1109/icaibd51990.2021.9459080 ◽

2021 ◽

Author(s):

Yifeng Li ◽

Yuan Tan ◽

Ming Zhou ◽

Guangjun Zeng ◽

Zhe Chen

Keyword(s):

Flipped Classroom ◽

Named Entity Recognition ◽

Entity Recognition ◽

Knowledge Graph ◽

Named Entity

Download Full-text

An Algorithm of Vocabulary Enhanced Intelligent Question Answering Based on FLAT1

10.3233/faia210460 ◽

2021 ◽

Author(s):

Jing Sheng Lei ◽

Shi Chao Ye ◽

Sheng Ying Yang ◽

Wei Song ◽

Guan Mian Liang

Keyword(s):

Question Answering ◽

Named Entity Recognition ◽

Recognition Task ◽

Recognition Algorithm ◽

Entity Recognition ◽

Knowledge Graph ◽

Question Answering System ◽

Named Entity ◽

Natural Language Question ◽

Ultimate Failure

The main purpose of the intelligent question answering system based on the knowledge graph is to accurately match the natural language question and the triple information in the knowledge graph. Among them, the entity recognition part is one of the key points. The wrong entity recognition result will cause the error to be done propagated, resulting in the ultimate failure to get the correct answer. In recent years, the lexical enhancement structure of word nodes combined with word nodes has been proved to be an effective method for Chinese named entity recognition. In order to solve the above problems, this paper proposes a vocabulary-enhanced entity recognition algorithm (KGFLAT) based on FLAT for intelligent question answering system. This method uses a new dictionary that combines the entity information of the knowledge graph, and only uses layer normalization for the removal of residual connection for the shallower network model. The system uses data provided by the NLPCC 2018 Task7 KBQA task for evaluation. The experimental results show that this method can effectively solve the entity recognition task in the intelligent question answering system and achieve the improvement of the FLAT model, and the average F1 value is 94.72

Download Full-text