Semi-Supervised Noisy Label Learning for Chinese Medical Named Entity Recognition

Abstract This paper describes our approach for the Chinese Medical named entity recognition(MER) task organized by the 2020 China conference on knowledge graph and semantic computing(CCKS) competition. In this task, we need to identify the entity boundary and category labels of six entities from Chinese electronic medical record(EMR). We construct a hybrid system composed of a semi-supervised noisy label learning model based on adversarial training and a rule postprocessing module. The core idea of the hybrid system is to reduce the impact of data noise by optimizing the model results. Besides, we use post-processing rules to correct three cases of redundant labeling, missing labeling, and wrong labeling in the model prediction results. Our method proposed in this paper achieved strict criteria of 0.9156 and relax criteria of 0.9660 on the final test set, ranking first.

Download Full-text

A Study on the Impact of Intradomain Finetuning of Deep Language Models for Legal Named Entity Recognition in Portuguese

Intelligent Systems - Lecture Notes in Computer Science ◽

10.1007/978-3-030-61377-8_46 ◽

2020 ◽

pp. 648-662

Author(s):

Luiz Henrique Bonifacio ◽

Paulo Arantes Vilela ◽

Gustavo Rocha Lobato ◽

Eraldo Rezende Fernandes

Keyword(s):

Named Entity Recognition ◽

Entity Recognition ◽

Language Models ◽

Named Entity ◽

The Impact

Download Full-text

Evaluating the impact of pre-annotation on annotation speed and potential bias: natural language processing gold standard development for clinical named entity recognition in clinical trial announcements

Journal of the American Medical Informatics Association ◽

10.1136/amiajnl-2013-001837 ◽

2014 ◽

Vol 21 (3) ◽

pp. 406-413 ◽

Cited By ~ 20

Author(s):

Todd Lingren ◽

Louise Deleger ◽

Katalin Molnar ◽

Haijun Zhai ◽

Jareen Meinzen-Derr ◽

...

Keyword(s):

Clinical Trial ◽

Natural Language Processing ◽

Language Processing ◽

Gold Standard ◽

Named Entity Recognition ◽

Entity Recognition ◽

Potential Bias ◽

Named Entity ◽

The Impact ◽

Standard Development

Download Full-text

The impact of near domain transfer on biomedical named entity recognition

10.3115/v1/w14-1103 ◽

2014 ◽

Cited By ~ 2

Author(s):

Nigel Collier ◽

Mai-vu Tran ◽

Ferdinand Paster

Keyword(s):

Named Entity Recognition ◽

Entity Recognition ◽

Named Entity ◽

Domain Transfer ◽

The Impact ◽

Biomedical Named Entity Recognition

Download Full-text

Assessing the Impact of Contextual Embeddings for Portuguese Named Entity Recognition

2019 8th Brazilian Conference on Intelligent Systems (BRACIS) ◽

10.1109/bracis.2019.00083 ◽

2019 ◽

Cited By ~ 5

Author(s):

Joaquim Santos ◽

Bernardo Consoli ◽

Cicero dos Santos ◽

Juliano Terra ◽

Sandra Collonini ◽

...

Keyword(s):

Named Entity Recognition ◽

Entity Recognition ◽

Named Entity ◽

The Impact

Download Full-text

Studying the impact of various features on the performance of Conditional Random Field-based Arabic Named Entity Recognition

2013 ACS International Conference on Computer Systems and Applications (AICCSA) ◽

10.1109/aiccsa.2013.6616423 ◽

2013 ◽

Cited By ~ 1

Author(s):

Alia Morsi ◽

Ahmed Rafea

Keyword(s):

Random Field ◽

Conditional Random Field ◽

Named Entity Recognition ◽

Entity Recognition ◽

Named Entity ◽

The Impact

Download Full-text

A Weak Supervision Approach with Adversarial Training for Named Entity Recognition

10.1007/978-3-030-89363-7_2 ◽

2021 ◽

pp. 17-30

Author(s):

Jianxuan Shao ◽

Chenyang Bu ◽

Shengwei Ji ◽

Xindong Wu

Keyword(s):

Named Entity Recognition ◽

Entity Recognition ◽

Weak Supervision ◽

Named Entity ◽

Adversarial Training

Download Full-text

The Impact of Domain-Specific Pre-Training on Named Entity Recognition Tasks in Materials Science

SSRN Electronic Journal ◽

10.2139/ssrn.3950755 ◽

2021 ◽

Author(s):

Nicholas Walker ◽

Amalie Trewartha ◽

Haoyan Huo ◽

Sanghoon Lee ◽

Kevin Cruse ◽

...

Keyword(s):

Materials Science ◽

Named Entity Recognition ◽

Entity Recognition ◽

Named Entity ◽

Domain Specific ◽

The Impact

Download Full-text

Named Entity Recognition for Chinese Social Media with Domain Adversarial Training and Language Modeling

Lecture Notes in Computer Science - Artificial Neural Networks and Machine Learning – ICANN 2019: Deep Learning ◽

10.1007/978-3-030-30484-3_54 ◽

2019 ◽

pp. 687-699

Author(s):

Yong Xu ◽

Qi Lu ◽

Muhua Zhu

Keyword(s):

Social Media ◽

Named Entity Recognition ◽

Language Modeling ◽

Entity Recognition ◽

Named Entity ◽

Adversarial Training ◽

Chinese Social Media

Download Full-text

WikiPathways: connecting communities

Nucleic Acids Research ◽

10.1093/nar/gkaa1024 ◽

2020 ◽

Vol 49 (D1) ◽

pp. D613-D621 ◽

Cited By ~ 2

Author(s):

Marvin Martens ◽

Ammar Ammar ◽

Anders Riutta ◽

Andra Waagmeester ◽

Denise N Slenter ◽

...

Keyword(s):

Named Entity Recognition ◽

Open Science ◽

Entity Recognition ◽

Biological Knowledge ◽

Pathway Database ◽

External Resources ◽

The Road ◽

Named Entity ◽

Pathway Models ◽

Core Idea

Abstract WikiPathways (https://www.wikipathways.org) is a biological pathway database known for its collaborative nature and open science approaches. With the core idea of the scientific community developing and curating biological knowledge in pathway models, WikiPathways lowers all barriers for accessing and using its content. Increasingly more content creators, initiatives, projects and tools have started using WikiPathways. Central in this growth and increased use of WikiPathways are the various communities that focus on particular subsets of molecular pathways such as for rare diseases and lipid metabolism. Knowledge from published pathway figures helps prioritize pathway development, using optical character and named entity recognition. We show the growth of WikiPathways over the last three years, highlight the new communities and collaborations of pathway authors and curators, and describe various technologies to connect to external resources and initiatives. The road toward a sustainable, community-driven pathway database goes through integration with other resources such as Wikidata and allowing more use, curation and redistribution of WikiPathways content.

Download Full-text

Biomedical Named Entity Recognition via Knowledge Guidance and Question Answering

ACM Transactions on Computing for Healthcare ◽

10.1145/3465221 ◽

2021 ◽

Vol 2 (4) ◽

pp. 1-24

Author(s):

Pratyay Banerjee ◽

Kuntal Kumar Pal ◽

Murthy Devarakonda ◽

Chitta Baral

Keyword(s):

Question Answering ◽

State Of The Art ◽

Named Entity Recognition ◽

Entity Recognition ◽

Neural Models ◽

Named Entity ◽

Input Text ◽

The Impact ◽

Biomedical Named Entity Recognition ◽

Entity Class

In this work, we formulated the named entity recognition (NER) task as a multi-answer knowledge guided question-answer task (KGQA) and showed that the knowledge guidance helps to achieve state-of-the-art results for 11 of 18 biomedical NER datasets. We prepended five different knowledge contexts—entity types, questions, definitions, and examples—to the input text and trained and tested BERT-based neural models on such input sequences from a combined dataset of the 18 different datasets. This novel formulation of the task (a) improved named entity recognition and illustrated the impact of different knowledge contexts, (b) reduced system confusion by limiting prediction to a single entity-class for each input token (i.e., B , I , O only) compared to multiple entity-classes in traditional NER (i.e., B entity 1, B entity 2, I entity 1, I , O ), (c) made detection of nested entities easier, and (d) enabled the models to jointly learn NER-specific features from a large number of datasets. We performed extensive experiments of this KGQA formulation on the biomedical datasets, and through the experiments, we showed when knowledge improved named entity recognition. We analyzed the effect of the task formulation, the impact of the different knowledge contexts, the multi-task aspect of the generic format, and the generalization ability of KGQA. We also probed the model to better understand the key contributors for these improvements.

Download Full-text