Semi-Supervised Bidirectional Long Short-Term Memory and Conditional Random Fields Model for Named-Entity Recognition Using Embeddings from Language Models Representations

Increasingly, popular online museums have significantly changed the way people acquire cultural knowledge. These online museums have been generating abundant amounts of cultural relics data. In recent years, researchers have used deep learning models that can automatically extract complex features and have rich representation capabilities to implement named-entity recognition (NER). However, the lack of labeled data in the field of cultural relics makes it difficult for deep learning models that rely on labeled data to achieve excellent performance. To address this problem, this paper proposes a semi-supervised deep learning model named SCRNER (Semi-supervised model for Cultural Relics’ Named Entity Recognition) that utilizes the bidirectional long short-term memory (BiLSTM) and conditional random fields (CRF) model trained by seldom labeled data and abundant unlabeled data to attain an effective performance. To satisfy the semi-supervised sample selection, we propose a repeat-labeled (relabeled) strategy to select samples of high confidence to enlarge the training set iteratively. In addition, we use embeddings from language model (ELMo) representations to dynamically acquire word representations as the input of the model to solve the problem of the blurred boundaries of cultural objects and Chinese characteristics of texts in the field of cultural relics. Experimental results demonstrate that our proposed model, trained on limited labeled data, achieves an effective performance in the task of named entity recognition of cultural relics.

Download Full-text

Bidirectional Long Short-Term Memory (BILSTM) with Conditional Random Fields (CRF) for Knowledge Named Entity Recognition in Online Judges (OJS)

International Journal on Natural Language Computing ◽

10.5121/ijnlc.2018.7401 ◽

2018 ◽

Vol 7 (4) ◽

pp. 01-08

Author(s):

Muhammad Asif Khan ◽

Tayyab Naveed ◽

Elmaam Yagoub ◽

Guojin Zhu

Keyword(s):

Random Fields ◽

Conditional Random Fields ◽

Short Term Memory ◽

Named Entity Recognition ◽

Entity Recognition ◽

Short Term ◽

Term Memory ◽

Named Entity ◽

Long Short Term Memory

Download Full-text

Cybersecurity named entity recognition using bidirectional long short-term memory with conditional random fields

Tsinghua Science & Technology ◽

10.26599/tst.2019.9010033 ◽

2021 ◽

Vol 26 (3) ◽

pp. 259-265

Author(s):

Pingchuan Ma ◽

Bo Jiang ◽

Zhigang Lu ◽

Ning Li ◽

Zhengwei Jiang

Keyword(s):

Random Fields ◽

Conditional Random Fields ◽

Short Term Memory ◽

Named Entity Recognition ◽

Entity Recognition ◽

Short Term ◽

Term Memory ◽

Named Entity ◽

Long Short Term Memory

Download Full-text

Biomedical named entity recognition using deep neural networks with contextual information

BMC Bioinformatics ◽

10.1186/s12859-019-3321-4 ◽

2019 ◽

Vol 20 (1) ◽

Cited By ~ 8

Author(s):

Hyejin Cho ◽

Hyunju Lee

Keyword(s):

Deep Learning ◽

Short Term Memory ◽

Contextual Information ◽

Named Entity Recognition ◽

Entity Recognition ◽

Learning Approaches ◽

Short Term ◽

Term Memory ◽

Named Entity ◽

Long Short Term Memory

Abstract Background In biomedical text mining, named entity recognition (NER) is an important task used to extract information from biomedical articles. Previously proposed methods for NER are dictionary- or rule-based methods and machine learning approaches. However, these traditional approaches are heavily reliant on large-scale dictionaries, target-specific rules, or well-constructed corpora. These methods to NER have been superseded by the deep learning-based approach that is independent of hand-crafted features. However, although such methods of NER employ additional conditional random fields (CRF) to capture important correlations between neighboring labels, they often do not incorporate all the contextual information from text into the deep learning layers. Results We propose herein an NER system for biomedical entities by incorporating n-grams with bi-directional long short-term memory (BiLSTM) and CRF; this system is referred to as a contextual long short-term memory networks with CRF (CLSTM). We assess the CLSTM model on three corpora: the disease corpus of the National Center for Biotechnology Information (NCBI), the BioCreative II Gene Mention corpus (GM), and the BioCreative V Chemical Disease Relation corpus (CDR). Our framework was compared with several deep learning approaches, such as BiLSTM, BiLSTM with CRF, GRAM-CNN, and BERT. On the NCBI corpus, our model recorded an F-score of 85.68% for the NER of diseases, showing an improvement of 1.50% over previous methods. Moreover, although BERT used transfer learning by incorporating more than 2.5 billion words, our system showed similar performance with BERT with an F-scores of 81.44% for gene NER on the GM corpus and a outperformed F-score of 86.44% for the NER of chemicals and diseases on the CDR corpus. We conclude that our method significantly improves performance on biomedical NER tasks. Conclusion The proposed approach is robust in recognizing biological entities in text.

Download Full-text

Probabilistic vs deep learning based approaches for narrow domain NER in Spanish

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-179868 ◽

2020 ◽

Vol 39 (2) ◽

pp. 2015-2025

Author(s):

Orlando Ramos-Flores ◽

David Pinto ◽

Manuel Montes-y-Gómez ◽

Andrés Vázquez

Keyword(s):

Deep Learning ◽

Conditional Random Fields ◽

Short Term Memory ◽

Named Entity Recognition ◽

Training Data ◽

Entity Recognition ◽

Mexican Spanish ◽

Named Entity ◽

Long Short Term Memory ◽

Deep Learning Model

This work presents an experimental study on the task of Named Entity Recognition (NER) for a narrow domain in Spanish language. This study considers two approaches commonly used in this kind of problem, namely, a Conditional Random Fields (CRF) model and Recurrent Neural Network (RNN). For the latter, we employed a bidirectional Long Short-Term Memory with ELMO’s pre-trained word embeddings for Spanish. The comparison between the probabilistic model and the deep learning model was carried out in two collections, the Spanish dataset from CoNLL-2002 considering four classes under the IOB tagging schema, and a Mexican Spanish news dataset with seventeen classes under IOBES schema. The paper presents an analysis about the scalability, robustness, and common errors of both models. This analysis indicates in general that the BiLSTM-ELMo model is more suitable than the CRF model when there is “enough” training data, and also that it is more scalable, as its performance was not significantly affected in the incremental experiments (by adding one class at a time). On the other hand, results indicate that the CRF model is more adequate for scenarios having small training datasets and many classes.

Download Full-text

Attention-Based Bidirectional Long Short-Term Memory Networks for Chinese Named Entity Recognition

Proceedings of the 2019 4th International Conference on Machine Learning Technologies - ICMLT 2019 ◽

10.1145/3340997.3341002 ◽

2019 ◽

Cited By ~ 1

Author(s):

Chaoyi Huang ◽

Youguang Chen ◽

Qiancheng Liang

Keyword(s):

Short Term Memory ◽

Named Entity Recognition ◽

Entity Recognition ◽

Short Term ◽

Term Memory ◽

Named Entity ◽

Long Short Term Memory

Download Full-text

Character convolutions for Arabic Named Entity Recognition with Long Short-Term Memory Networks

Computer Speech & Language ◽

10.1016/j.csl.2019.05.003 ◽

2019 ◽

Vol 58 ◽

pp. 335-346 ◽

Cited By ~ 4

Author(s):

Muhammad Khalifa ◽

Khaled Shaalan

Keyword(s):

Short Term Memory ◽

Named Entity Recognition ◽

Entity Recognition ◽

Short Term ◽

Term Memory ◽

Named Entity ◽

Long Short Term Memory

Download Full-text

Named entity recognition with long short-term memory

10.3115/1119176.1119202 ◽

2003 ◽

Cited By ~ 47

Author(s):

James Hammerton

Keyword(s):

Short Term Memory ◽

Named Entity Recognition ◽

Entity Recognition ◽

Short Term ◽

Term Memory ◽

Named Entity ◽

Long Short Term Memory

Download Full-text

Long short-term memory RNN for biomedical named entity recognition

BMC Bioinformatics ◽

10.1186/s12859-017-1868-5 ◽

2017 ◽

Vol 18 (1) ◽

Cited By ~ 36

Author(s):

Chen Lyu ◽

Bo Chen ◽

Yafeng Ren ◽

Donghong Ji

Keyword(s):

Short Term Memory ◽

Named Entity Recognition ◽

Entity Recognition ◽

Short Term ◽

Term Memory ◽

Named Entity ◽

Long Short Term Memory ◽

Biomedical Named Entity Recognition

Download Full-text

Pengenalan Entitas Bernama Otomatis untuk Bahasa Indonesia dengan Pendekatan Pembelajaran Mesin

10.31227/osf.io/vud2p ◽

2018 ◽

Cited By ~ 2

Author(s):

Yudi Wibisono ◽

Masayu Leylia Khodra

Keyword(s):

Short Term Memory ◽

Conditional Random Field ◽

Named Entity Recognition ◽

Entity Recognition ◽

Short Term ◽

Term Memory ◽

Named Entity ◽

Long Short Term Memory ◽

F Measure ◽

Bahasa Indonesia

Pengenalan entitas bernama (named-entity recognition atau NER) adalah proses otomatis mengekstraksi entitas bernama yang dianggap penting di dalam sebuah teks dan menentukan kategorinya ke dalam kategori terdefinisi. Sebagai contoh, untuk teks berita, NER dapat mengekstraksi nama orang, nama organisasi, dan nama lokasi. NER bermanfaat dalam berbagai aplikasi analisis teks, misalnya pencarian, sistem tanya jawab, peringkasan teks dan mesin penerjemah. Tantangan utama NER adalah penanganan ambiguitas makna karena konteks kata pada kalimat, misalnya kata “Cendana” dapat merupakan nama lokasi (Jalan Cendana), atau nama organisasi (Keluarga Cendana), atau nama tanaman. Tantangan lainnya adalah penentuan batas entitas, misalnya “[Istora Senayan] [Jakarta]”. Berbagai kakas NER telah dikembangkan untuk berbagai bahasa terutama Bahasa Inggris dengan kinerja yang baik, tetapi kakas NER bahasa Indonesia masih memiliki kinerja yang belum baik. Makalah ini membahas pendekatan berbasis pembelajaran mesin untuk menghasilkan model NER bahasa Indonesia. Pendekatan ini sangat bergantung pada korpus yang menjadi sumber belajar, dan teknik pembelajaran mesin yang digunakan. Teknik yang akan digunakan adalah LSTM - CRF (Long Short Term Memory – Conditional Random Field). Hasil terbaik (F-measure = 0.72) didapatkan dengan menggunakan word embedding GloVe Wikipedia Bahasa Indonesia.

Download Full-text

Chinese Named Entity Recognition Method in History and Culture Field Based on BERT

International Journal of Computational Intelligence Systems ◽

10.1007/s44196-021-00019-8 ◽

2021 ◽

Vol 14 (1) ◽

Author(s):

Shuang Liu ◽

Hui Yang ◽

Jiayi Li ◽

Simon Kolmanič

Keyword(s):

Short Term Memory ◽

Conditional Random Field ◽

Language Model ◽

Named Entity Recognition ◽

Entity Recognition ◽

Knowledge Graph ◽

Recognition Method ◽

Short Term ◽

Named Entity ◽

Long Short Term Memory

AbstractWith rapid development of the Internet, people have undergone tremendous changes in the way they obtain information. In recent years, knowledge graph is becoming a popular tool for the public to acquire knowledge. For knowledge graph of Chinese history and culture, most researchers adopted traditional named entity recognition methods to extract entity information from unstructured historical text data. However, the traditional named entity recognition method has certain defects, and it is easy to ignore the association between entities. To extract entities from a large amount of historical and cultural information more accurately and efficiently, this paper proposes one named entity recognition model combining Bidirectional Encoder Representations from Transformers and Bidirectional Long Short-Term Memory-Conditional Random Field (BERT-BiLSTM-CRF). First, a BERT pre-trained language model is used to encode a single character to obtain a vector representation corresponding to each character. Then one Bidirectional Long Short-Term Memory (BiLSTM) layer is applied to semantically encode the input text. Finally, the label with the highest probability is output through the Conditional Random Field (CRF) layer to obtain each character’s category. This model uses the Bidirectional Encoder Representations from Transformers (BERT) pre-trained language model to replace the static word vectors trained in the traditional way. In comparison, the BERT pre-trained language model can dynamically generate semantic vectors according to the context of words, which improves the representation ability of word vectors. The experimental results prove that the model proposed in this paper has achieved excellent results in the task of named entity recognition in the field of historical culture. Compared with the existing named entity identification methods, the precision rate, recall rate, and $$F_1$$ F 1 value have been significantly improved.

Download Full-text