A Novel Approach for Named Entity Recognition on Hindi Language Using Residual Bilstm Network

Due to the growing need of smart-health applications in Hindi language, there is a rapid demand for health-related Named Entity Recognition (NER) system for Hindi. For the purpose of the same, this research considers Twitter social network to extract tweets dated 1st October 2016 to 15th October 2017 from Patanjali, Dabur and other Hindi language-oriented Twitter based health sites; while considering four NE types- Person, Disease, Consumable and Organization. To the best of its knowledge, the considered Twitter dataset and NE types for Hindi language is one of the first resources that is being taken care. This article introduces three stage NER system for Tweets in Hindi language (HinTwtNER system)- pre-processing stage; machine Learning stage (Hyperspace Analogue to Language (HAL) and Conditional Random Field (CRF)); and post-processing stage. HinTwtNER looks into binary features and achieves an overall F-score of 49.87% which is comparable to the Twitter based NER systems for English and other languages.

Download Full-text

Named entity recognition for Hindi language: A survey

Journal of Discrete Mathematical Sciences and Cryptography ◽

10.1080/09720529.2019.1637157 ◽

2019 ◽

Vol 22 (4) ◽

pp. 569-580 ◽

Cited By ~ 1

Author(s):

Richa Sharma ◽

Sudha Morwal ◽

Basant Agarwal

Keyword(s):

Named Entity Recognition ◽

Entity Recognition ◽

Named Entity ◽

Hindi Language

Download Full-text

A Novel Approach for Protein-Named Entity Recognition and Protein-Protein Interaction Extraction

Mathematical Problems in Engineering ◽

10.1155/2015/942435 ◽

2015 ◽

Vol 2015 ◽

pp. 1-10 ◽

Cited By ~ 3

Author(s):

Meijing Li ◽

Tsendsuren Munkhdalai ◽

Xiuming Yu ◽

Keun Ho Ryu

Keyword(s):

Language Processing ◽

Protein Interaction ◽

Named Entity Recognition ◽

Entity Recognition ◽

Support Vector ◽

Protein Protein Interaction ◽

Named Entity ◽

Novel Approach ◽

Interaction Extraction ◽

Parsing Tree

Many researchers focus on developing protein-named entity recognition (Protein-NER) or PPI extraction systems. However, the studies about these two topics cannot be merged well; then existing PPI extraction systems’ Protein-NER still needs to improve. In this paper, we developed the protein-protein interaction extraction system named PPIMiner based on Support Vector Machine (SVM) and parsing tree. PPIMiner consists of three main models: natural language processing (NLP) model, Protein-NER model, and PPI discovery model. The Protein-NER model, which is named ProNER, identifies the protein names based on two methods: dictionary-based method and machine learning-based method. ProNER is capable of identifying more proteins than dictionary-based Protein-NER model in other existing systems. The final discovered PPIs extracted via PPI discovery model are represented in detail because we showed the protein interaction types and the occurrence frequency through two different methods. In the experiments, the result shows that the performances achieved by our ProNER and PPI discovery model are better than other existing tools. PPIMiner applied this protein-named entity recognition approach and parsing tree based PPI extraction method to improve the performance of PPI extraction. We also provide an easy-to-use interface to access PPIs database and an online system for PPIs extraction and Protein-NER.

Download Full-text

Research Trends for Named Entity Recognition in Hindi Language

Data Visualization and Knowledge Engineering - Lecture Notes on Data Engineering and Communications Technologies ◽

10.1007/978-3-030-25797-2_10 ◽

2019 ◽

pp. 223-248 ◽

Cited By ~ 2

Author(s):

Arti Jain ◽

Devendra K. Tayal ◽

Divakar Yadav ◽

Anuja Arora

Keyword(s):

Named Entity Recognition ◽

Research Trends ◽

Entity Recognition ◽

Named Entity ◽

Hindi Language

Download Full-text

HILNER: A Hindi Language Named Entity Recognition System Based on Hybrid Approach

Hybrid Intelligent Systems - Advances in Intelligent Systems and Computing ◽

10.1007/978-3-030-73050-5_34 ◽

2021 ◽

pp. 340-348

Author(s):

Shilpi Srivastava

Keyword(s):

Hybrid Approach ◽

Named Entity Recognition ◽

Recognition System ◽

Entity Recognition ◽

Named Entity ◽

Hindi Language

Download Full-text

A deep neural network-based model for named entity recognition for Hindi language

Neural Computing and Applications ◽

10.1007/s00521-020-04881-z ◽

2020 ◽

Vol 32 (20) ◽

pp. 16191-16203

Author(s):

Richa Sharma ◽

Sudha Morwal ◽

Basant Agarwal ◽

Ramesh Chandra ◽

Mohammad S. Khan

Keyword(s):

Neural Network ◽

Deep Neural Network ◽

Named Entity Recognition ◽

Entity Recognition ◽

Named Entity ◽

Hindi Language

Download Full-text

Combining Minimally-supervised Methods for Arabic Named Entity Recognition

Transactions of the Association for Computational Linguistics ◽

10.1162/tacl_a_00136 ◽

2015 ◽

Vol 3 ◽

pp. 243-255 ◽

Cited By ~ 4

Author(s):

Maha Althobaiti ◽

Udo Kruschwitz ◽

Massimo Poesio

Keyword(s):

High Performance ◽

Named Entity Recognition ◽

Entity Recognition ◽

Classifier Combination ◽

Named Entity ◽

Distant Learning ◽

Novel Approach ◽

Learning Techniques ◽

Supervised Methods ◽

Minimally Supervised

Supervised methods can achieve high performance on NLP tasks, such as Named Entity Recognition (NER), but new annotations are required for every new domain and/or genre change. This has motivated research in minimally supervised methods such as semi-supervised learning and distant learning, but neither technique has yet achieved performance levels comparable to those of supervised methods. Semi-supervised methods tend to have very high precision but comparatively low recall, whereas distant learning tends to achieve higher recall but lower precision. This complementarity suggests that better results may be obtained by combining the two types of minimally supervised methods. In this paper we present a novel approach to Arabic NER using a combination of semi-supervised and distant learning techniques. We trained a semi-supervised NER classifier and another one using distant learning techniques, and then combined them using a variety of classifier combination schemes, including the Bayesian Classifier Combination (BCC) procedure recently proposed for sentiment analysis. According to our results, the BCC model leads to an increase in performance of 8 percentage points over the best base classifiers.

Download Full-text