An Experimental Study of Hybrid Machine Learning Models for Extracting Named Entities

Mapping Intimacies ◽

10.29007/dp5m ◽

2019 ◽

Author(s):

Lei Jiang ◽

Elena Bolshakova

Keyword(s):

Neural Network ◽

Conditional Random Fields ◽

Named Entity Recognition ◽

Network Models ◽

Entity Recognition ◽

Neural Network Models ◽

Named Entities ◽

Hybrid Neural Network ◽

Named Entity ◽

Two Hybrid

The paper describes two hybrid neural network models for named entity recognition (NER) in texts, as well as results of experiments with them. The first model, namely Bi-LSTM-CRF, is known and used for NER, while the other model named Gated-CNN- CRF is proposed in this work. It combines convolutional neural network (CNN), gated linear units, and conditional random fields (CRF). Both models were tested for NER on three different language datasets, for English, Russian, and Chinese. All resulted scores of precision, recall and F1-measure for both models are close to the state-of-the-art for NER, and for the English dataset CoNLL-2003, Gated-CNN-CRF model achieves 92.66 of F1-measure, outperforming the known result.

Download Full-text

End-to-End Recurrent Neural Network Models for Vietnamese Named Entity Recognition: Word-Level Vs. Character-Level

Communications in Computer and Information Science - Computational Linguistics ◽

10.1007/978-981-10-8438-6_18 ◽

2018 ◽

pp. 219-232 ◽

Cited By ~ 5

Author(s):

Thai-Hoang Pham ◽

Phuong Le-Hong

Keyword(s):

Neural Network ◽

Recurrent Neural Network ◽

Named Entity Recognition ◽

Network Models ◽

Entity Recognition ◽

Neural Network Models ◽

Named Entity ◽

Word Level ◽

End To End

Download Full-text

Named Entity Correction in Neural Machine Translation Using the Attention Alignment Map

Applied Sciences ◽

10.3390/app11157026 ◽

2021 ◽

Vol 11 (15) ◽

pp. 7026

Author(s):

Jangwon Lee ◽

Jungi Lee ◽

Minho Lee ◽

Gil-Jin Jang

Keyword(s):

Machine Translation ◽

Named Entity Recognition ◽

Network Models ◽

Entity Recognition ◽

Neural Network Models ◽

Named Entities ◽

Neural Machine Translation ◽

Named Entity ◽

Input And Output ◽

Artificial Neural Network Models

Neural machine translation (NMT) methods based on various artificial neural network models have shown remarkable performance in diverse tasks and have become mainstream for machine translation currently. Despite the recent successes of NMT applications, a predefined vocabulary is still required, meaning that it cannot cope with out-of-vocabulary (OOV) or rarely occurring words. In this paper, we propose a postprocessing method for correcting machine translation outputs using a named entity recognition (NER) model to overcome the problem of OOV words in NMT tasks. We use attention alignment mapping (AAM) between the named entities of input and output sentences, and mistranslated named entities are corrected using word look-up tables. The proposed method corrects named entities only, so it does not require retraining of existing NMT models. We carried out translation experiments on a Chinese-to-Korean translation task for Korean historical documents, and the evaluation results demonstrated that the proposed method improved the bilingual evaluation understudy (BLEU) score by 3.70 from the baseline.

Download Full-text

Research on Chinese medical named entity recognition based on collaborative cooperation of multiple neural network models

Journal of Biomedical Informatics ◽

10.1016/j.jbi.2020.103395 ◽

2020 ◽

Vol 104 ◽

pp. 103395 ◽

Cited By ~ 3

Author(s):

Bin Ji ◽

Shasha Li ◽

Jie Yu ◽

Jun Ma ◽

Jintao Tang ◽

...

Keyword(s):

Neural Network ◽

Named Entity Recognition ◽

Network Models ◽

Entity Recognition ◽

Neural Network Models ◽

Named Entity ◽

Multiple Neural Network

Download Full-text

Improving Named Entity Recognition for Biomedical and Patent Data Using Bi-LSTM Deep Neural Network Models

Natural Language Processing and Information Systems - Lecture Notes in Computer Science ◽

10.1007/978-3-030-51310-8_3 ◽

2020 ◽

pp. 25-36

Author(s):

Farag Saad ◽

Hidir Aras ◽

René Hackl-Sommer

Keyword(s):

Neural Network ◽

Deep Neural Network ◽

Named Entity Recognition ◽

Network Models ◽

Entity Recognition ◽

Patent Data ◽

Neural Network Models ◽

Named Entity

Download Full-text

Named-Entity Recognition in Sports Field Based on a Character-Level Graph Convolutional Network

Information ◽

10.3390/info11010030 ◽

2020 ◽

Vol 11 (1) ◽

pp. 30

Author(s):

Xieraili Seti ◽

Aishan Wumaier ◽

Turgen Yibulayin ◽

Diliyaer Paerhati ◽

Lulu Wang ◽

...

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Conditional Random Fields ◽

Structural Information ◽

Named Entity Recognition ◽

Entity Recognition ◽

Convolutional Network ◽

Named Entities ◽

Attention Model ◽

Named Entity

Traditional methods for identifying naming ignore the correlation between named entities and lose hierarchical structural information between the named entities in a given text. Although traditional named-entity methods are effective for conventional datasets that have simple structures, they are not as effective for sports texts. This paper proposes a Chinese sports text named-entity recognition method based on a character graph convolutional neural network (Char GCN) with a self-attention mechanism model. In this method, each Chinese character in the sports text is regarded as a node. The edge between the nodes is constructed using a similar character position and the character feature of the named-entity in the sports text. The internal structural information of the entity is extracted using a character map convolutional neural network. The hierarchical semantic information of the sports text is captured by the self-attention model to enhance the relationship between the named entities and capture the relevance and dependency between the characters. The conditional random fields classification function can accurately identify the named entities in the Chinese sports text. The results conducted on four datasets demonstrate that the proposed method improves the F-Score values significantly to 92.51%, 91.91%, 93.98%, and 95.01%, respectively, in comparison to the traditional naming methods.

Download Full-text

Coreference Aware Representation Learning for Neural Named Entity Recognition

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2019/687 ◽

2019 ◽

Cited By ~ 2

Author(s):

Zeyu Dai ◽

Hongliang Fei ◽

Ping Li

Keyword(s):

Neural Network ◽

State Of The Art ◽

Structural Information ◽

Named Entity Recognition ◽

Network Models ◽

Third Party ◽

Entity Recognition ◽

Neural Network Models ◽

Named Entity ◽

Art Performance

Recent neural network models have achieved state-of-the-art performance on the task of named entity recognition (NER). However, previous neural network models typically treat the input sentences as a linear sequence of words but ignore rich structural information, such as the coreference relations among non-adjacent words, phrases or entities. In this paper, we propose a novel approach to learn coreference-aware word representations for the NER task at the document level. In particular, we enrich the well-known neural architecture ``CNN-BiLSTM-CRF'' with a coreference layer on top of the BiLSTM layer to incorporate coreferential relations. Furthermore, we introduce the coreference regularization to ensure the coreferential entities to share similar representations and consistent predictions within the same coreference cluster. Our proposed model achieves new state-of-the-art performance on two NER benchmarks: CoNLL-2003 and OntoNotes v5.0. More importantly, we demonstrate that our framework does not rely on gold coreference knowledge, and can still work well even when the coreferential relations are generated by a third-party toolkit.

Download Full-text

Information Extraction of Cybersecurity Concepts: An LSTM Approach

Applied Sciences ◽

10.3390/app9193945 ◽

2019 ◽

Vol 9 (19) ◽

pp. 3945 ◽

Cited By ~ 4

Author(s):

Houssem Gasmi ◽

Jannik Laval ◽

Abdelaziz Bouras

Keyword(s):

Neural Network ◽

Domain Knowledge ◽

Conditional Random Fields ◽

Short Term Memory ◽

Network Models ◽

Relation Extraction ◽

Entity Recognition ◽

Feature Engineering ◽

Neural Network Models ◽

Feature Based

Extracting cybersecurity entities and the relationships between them from online textual resources such as articles, bulletins, and blogs and converting these resources into more structured and formal representations has important applications in cybersecurity research and is valuable for professional practitioners. Previous works to accomplish this task were mainly based on utilizing feature-based models. Feature-based models are time-consuming and need labor-intensive feature engineering to describe the properties of entities, domain knowledge, entity context, and linguistic characteristics. Therefore, to alleviate the need for feature engineering, we propose the usage of neural network models, specifically the long short-term memory (LSTM) models to accomplish the tasks of Named Entity Recognition (NER) and Relation Extraction (RE). We evaluated the proposed models on two tasks. The first task is performing NER and evaluating the results against the state-of-the-art Conditional Random Fields (CRFs) method. The second task is performing RE using three LSTM models and comparing their results to assess which model is more suitable for the domain of cybersecurity. The proposed models achieved competitive performance with less feature-engineering work. We demonstrate that exploiting neural network models in cybersecurity text mining is effective and practical.

Download Full-text

Statistical Method for Named Entity Recognition in Telugu, an Indian Language

International Journal of Recent Technology and Engineering - 2 ◽

10.35940/ijrte.b3500.078219 ◽

2019 ◽

Vol 8 (2) ◽

pp. 4211-4216

Keyword(s):

Language Processing ◽

Conditional Random Fields ◽

Named Entity Recognition ◽

Entity Recognition ◽

Semantic Features ◽

Indian Language ◽

Named Entities ◽

Maximum Entropy Models ◽

Named Entity ◽

Proper Nouns

One of the important tasks of Natural Language Processing (NLP) is Named Entity Recognition (NER). The primary operation of NER is to identify proper nouns i.e. to locate all the named entities in the text and tag them as certain named entity categories such as Entity, Time expression and Numeric expression. In the previous works, NER for Telugu language is addressed with Conditional Random Fields (CRF) and Maximum Entropy models however they failed to handle ambiguous named entity tags for the same named entity. This paper presents a hybrid statistical system for Named Entity Recognition in Telugu language in which named entities are identified by both dictionary-based approach and statistical Hidden Markov Model (HMM). The proposed method uses Lexicon-lookup dictionary and contexts based on semantic features for predicting named entity tags. Further HMM is used to resolve the named entity ambiguities in predicted named entity tags. The present work reports an average accuracy of 86.3% for finding the named entities

Download Full-text

Conditional Random Fields for Biomedical Named Entity Recognition Revisited

10.21203/rs.3.rs-36431/v1 ◽

2020 ◽

Author(s):

Xie-Yuan Xie

Keyword(s):

Random Fields ◽

Conditional Random Fields ◽

Named Entity Recognition ◽

Entity Recognition ◽

Biomedical Domain ◽

Minimal Set ◽

Named Entities ◽

Named Entity ◽

Biomedical Texts ◽

Biomedical Named Entity Recognition

Abstract Named Entity Recognition (NER) is a key task which automatically extracts Named Entities (NE) from the text. Names of persons, places, date and time are examples of NEs. We are applying Conditional Random Fields (CRFs) for NER in biomedical domain. Examples of NEs in biomedical texts are gene, proteins. We used a minimal set of features to train CRF algorithm and obtained a good results for biomedical texts.

Download Full-text

Using hybrid Neural Network to address Chinese Named Entity Recognition

2014 IEEE 3rd International Conference on Cloud Computing and Intelligence Systems ◽

10.1109/ccis.2014.7175774 ◽

2014 ◽

Cited By ~ 2

Author(s):

Guoyu Wang ◽

Yongquan Cai ◽

Fujiang Ge

Keyword(s):

Neural Network ◽

Named Entity Recognition ◽

Entity Recognition ◽

Hybrid Neural Network ◽

Named Entity

Download Full-text