named entity extraction
Recently Published Documents


TOTAL DOCUMENTS

72
(FIVE YEARS 9)

H-INDEX

11
(FIVE YEARS 1)

2021 ◽  
pp. 724-732
Author(s):  
Zeqi Ma ◽  
Lingwei Ma ◽  
Dongmei Fu ◽  
Guangxuan Song ◽  
Dawei Zhang


Author(s):  
Asoke Nath ◽  
Debapriya Kandar ◽  
Rahul Gupta

In recent times, with the rise of the internet, everyone is being bombarded with tons of information and data from various sources like websites, blogs and articles, social media posts and comments, e-news portals etc. Now all these data are mostly unstructured. In this paper, the authors have tried to explore the efficiency of the cross-lingual BERT model i.e. M-BERT for text classification and named entity extraction on multilingual data. The authors have used datasets of three different languages namely: French, German and Portuguese to evaluate the model performance.





2020 ◽  
Vol 9 (6) ◽  
pp. 1-22
Author(s):  
Omar ASBAYOU

This article tries to explain our rule-based Arabic Named Entity recognition (NER) and classification system. It is based on lists of classified proper names (PN) and particularly on syntactico-semantic patterns resulting in fine classification of Arabic NE. These patterns use syntactico-semantic combination of morpho-syntactic and syntactic entities. It also uses lexical classification of trigger words and NE extensions. These linguistic data are essential not only to name entity extraction but also to the taxonomic classification and to determining the NE frontiers. Our method is also based on the contextualisation and on the notion of NE class attributes and values. Inspired from X-bar theory and immediate constituents, we built a rule-based NER system composed of five levels of syntactico-semantic combination. We also show how the fine NE annotations in our system output (XML database) is exploited in information retrieval and information extraction.



Author(s):  
Hsu Myat Mo ◽  
Khin Mar Soe

Myanmar language is a low-resource language and this is one of the main reasons why Myanmar Natural Language Processing lagged behind compared to other languages. Currently, there is no publicly available named entity corpus for Myanmar language. As part of this work, a very first manually annotated Named Entity tagged corpus for Myanmar language was developed and proposed to support the evaluation of named entity extraction. At present, our named entity corpus contains approximately 170,000 name entities and 60,000 sentences. This work also contributes the first evaluation of various deep neural network architectures on Myanmar Named Entity Recognition. Experimental results of the 10-fold cross validation revealed that syllable-based neural sequence models without additional feature engineering can give better results compared to baseline CRF model. This work also aims to discover the effectiveness of neural network approaches to textual processing for Myanmar language as well as to promote future research works on this understudied language.



IEEE Access ◽  
2020 ◽  
Vol 8 ◽  
pp. 32862-32881 ◽  
Author(s):  
Tareq Al-Moslmi ◽  
Marc Gallofre Ocana ◽  
Andreas L. Opdahl ◽  
Csaba Veres




Author(s):  
Wassim Swaileh ◽  
Thierry Paquet ◽  
Sébastien Adam ◽  
Andres Rojas Camacho


Sign in / Sign up

Export Citation Format

Share Document