A Military Named Entity Recognition Method based on pre-training language model and BiLSTM-CRF

Entity Recognition ◽

Knowledge Graph ◽

Recognition Method ◽

Short Term ◽

Named Entity ◽

Long Short Term Memory

AbstractWith rapid development of the Internet, people have undergone tremendous changes in the way they obtain information. In recent years, knowledge graph is becoming a popular tool for the public to acquire knowledge. For knowledge graph of Chinese history and culture, most researchers adopted traditional named entity recognition methods to extract entity information from unstructured historical text data. However, the traditional named entity recognition method has certain defects, and it is easy to ignore the association between entities. To extract entities from a large amount of historical and cultural information more accurately and efficiently, this paper proposes one named entity recognition model combining Bidirectional Encoder Representations from Transformers and Bidirectional Long Short-Term Memory-Conditional Random Field (BERT-BiLSTM-CRF). First, a BERT pre-trained language model is used to encode a single character to obtain a vector representation corresponding to each character. Then one Bidirectional Long Short-Term Memory (BiLSTM) layer is applied to semantically encode the input text. Finally, the label with the highest probability is output through the Conditional Random Field (CRF) layer to obtain each character’s category. This model uses the Bidirectional Encoder Representations from Transformers (BERT) pre-trained language model to replace the static word vectors trained in the traditional way. In comparison, the BERT pre-trained language model can dynamically generate semantic vectors according to the context of words, which improves the representation ability of word vectors. The experimental results prove that the model proposed in this paper has achieved excellent results in the task of named entity recognition in the field of historical culture. Compared with the existing named entity identification methods, the precision rate, recall rate, and $$F_1$$ F 1 value have been significantly improved.

Proceedings of the 4th International Conference on Machine Learning and Soft Computing ◽

Named Entity Recognition Method for Fault Knowledge based on Deep Learning

10.1145/3380688.3380690 ◽

2020 ◽

Author(s):

Zhicheng Chen ◽

Xiaobao Liu ◽

Yanchao Yin ◽

Hongbiao Lu

Keyword(s):

Deep Learning ◽

Entity Recognition ◽

Recognition Method ◽

Named Entity ◽

Knowledge Based

An Improved Chinese Named Entity Recognition Method with TB-LSTM-CRF

2020 2nd Symposium on Signal Processing Systems ◽

10.1145/3421515.3421534 ◽

2020 ◽

Author(s):

Jiazheng Li ◽

Tao Wang ◽

Weiwen Zhang

Keyword(s):

Entity Recognition ◽

Recognition Method ◽

Joint Pre-trained Chinese Named Entity Recognition based on Bi-directional Language Model

International Journal of Pattern Recognition and Artificial Intelligence ◽

10.1142/s0218001421530037 ◽

2021 ◽

Author(s):

Ma Changxia ◽

Zhang Chen

Keyword(s):

Language Model ◽

Entity Recognition ◽

BioALBERT: A Simple and Effective Pre-trained Language Model for Biomedical Named Entity Recognition

10.21203/rs.3.rs-90025/v1 ◽

2020 ◽

Author(s):

Usman Naseem ◽

Matloob Khushi ◽

Vinay Reddy ◽

Sakthivel Rajendran ◽

Imran Razzak ◽

...

Keyword(s):

State Of The Art ◽

Language Model ◽

Biomedical Named Entity Recognition

Training Data ◽

Entity Recognition ◽

Future Research ◽

Named Entity ◽

Domain Specific ◽

Context Dependent ◽

Abstract Background: In recent years, with the growing amount of biomedical documents, coupled with advancement in natural language processing algorithms, the research on biomedical named entity recognition (BioNER) has increased exponentially. However, BioNER research is challenging as NER in the biomedical domain are: (i) often restricted due to limited amount of training data, (ii) an entity can refer to multiple types and concepts depending on its context and, (iii) heavy reliance on acronyms that are sub-domain specific. Existing BioNER approaches often neglect these issues and directly adopt the state-of-the-art (SOTA) models trained in general corpora which often yields unsatisfactory results. Results: We propose biomedical ALBERT (A Lite Bidirectional Encoder Representations from Transformers for Biomedical Text Mining) - bioALBERT - an effective domain-specific pre-trained language model trained on huge biomedical corpus designed to capture biomedical context-dependent NER. We adopted self-supervised loss function used in ALBERT that targets on modelling inter-sentence coherence to better learn context-dependent representations and incorporated parameter reduction strategies to minimise memory usage and enhance the training time in BioNER. In our experiments, BioALBERT outperformed comparative SOTA BioNER models on eight biomedical NER benchmark datasets with four different entity types. The performance is increased for; (i) disease type corpora by 7.47% (NCBI-disease) and 10.63% (BC5CDR-disease); (ii) drug-chem type corpora by 4.61% (BC5CDR-Chem) and 3.89 (BC4CHEMD); (iii) gene-protein type corpora by 12.25% (BC2GM) and 6.42% (JNLPBA); and (iv) Species type corpora by 6.19% (LINNAEUS) and 23.71% (Species-800) is observed which leads to a state-of-the-art results. Conclusions: The performance of proposed model on four different biomedical entity types shows that our model is robust and generalizable in recognizing biomedical entities in text. We trained four different variants of BioALBERT models which are available for the research community to be used in future research.

Chinese named entity recognition method based on BERT

10.1109/icdsca53499.2021.9650256 ◽

2021 ◽

Author(s):

Yuan Chang ◽

Lei Kong ◽

Kejia Jia ◽

Qinglei Meng

Keyword(s):

Entity Recognition ◽

Recognition Method ◽

A Comparative Study on the Performance of Named Entity Recognition in Materials and Chemistry Fields through Multiple Embedding Combination Based on a Pre-trained Neural Network Language Model

Journal of KIISE ◽

10.5626/jok.2021.48.6.696 ◽

2021 ◽

Vol 48 (6) ◽

pp. 696-706

Author(s):

Myunghoon Lee ◽

Hyeonho Shin ◽

Hong-Woo Chun ◽

Jae-Min Lee ◽

Taehyun Ha ◽

...

Keyword(s):

Neural Network ◽

Comparative Study ◽

Language Model ◽

Entity Recognition ◽

Named Entity ◽

Trained Neural Network ◽

Network Language

Online named entity recognition method for microtexts in social networking services: A case study of twitter

Expert Systems with Applications ◽

10.1016/j.eswa.2012.01.136 ◽

2012 ◽

Vol 39 (9) ◽

pp. 8066-8070 ◽

Cited By ~ 28

Author(s):

Jason J. Jung

Keyword(s):

Social Networking ◽

Entity Recognition ◽

Recognition Method ◽

Social Networking Services ◽

Distantly-Supervised Named Entity Recognition with Noise-Robust Learning and Language Model Augmented Self-Training

10.18653/v1/2021.emnlp-main.810 ◽

2021 ◽

Author(s):

Yu Meng ◽

Yunyi Zhang ◽

Jiaxin Huang ◽

Xuan Wang ◽

Yu Zhang ◽

...

Keyword(s):

Language Model ◽

Entity Recognition ◽

Robust Learning ◽

Named Entity ◽

Noise Robust

DeNERT-KG: Named Entity and Relation Extraction Model Using DQN, Knowledge Graph, and BERT

Applied Sciences ◽

10.3390/app10186429 ◽

2020 ◽

Vol 10 (18) ◽

pp. 6429

Author(s):

SungMin Yang ◽

SoYeop Yoo ◽

OkRan Jeong

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Language Model ◽

Artificial Intelligence Technology

Relation Extraction ◽

Entity Recognition ◽

Knowledge Graph ◽

Named Entity ◽

Along with studies on artificial intelligence technology, research is also being carried out actively in the field of natural language processing to understand and process people’s language, in other words, natural language. For computers to learn on their own, the skill of understanding natural language is very important. There are a wide variety of tasks involved in the field of natural language processing, but we would like to focus on the named entity registration and relation extraction task, which is considered to be the most important in understanding sentences. We propose DeNERT-KG, a model that can extract subject, object, and relationships, to grasp the meaning inherent in a sentence. Based on the BERT language model and Deep Q-Network, the named entity recognition (NER) model for extracting subject and object is established, and a knowledge graph is applied for relation extraction. Using the DeNERT-KG model, it is possible to extract the subject, type of subject, object, type of object, and relationship from a sentence, and verify this model through experiments.