NAMED ENTITY RECOGNITION IN BIOMEDICAL LITERATURE USING TWO-LAYER SUPPORT VECTOR MACHINES

We present a freely available named-entity recognizer for Greek texts that identifies temporal expressions, person, and organization names. For temporal expressions, it relies on semi-automatically produced patterns. For person and organization names, it employs an ensemble of Support Vector Machines that scan the input text in two passes. The ensemble is trained using active learning, whereby the system itself proposes candidate training instances to be annotated by a human during training. The recognizer was evaluated on both a general collection of newspaper articles and a more focussed, in terms of topics, collection of financial articles.

Download Full-text

Multilingual Named Entity Recognition on Spanish-English Code-switched Tweets using Support Vector Machines

10.18653/v1/w18-3218 ◽

2018 ◽

Author(s):

Daniel Claeser ◽

Samantha Kent ◽

Dennis Felske

Keyword(s):

Support Vector Machines ◽

Named Entity Recognition ◽

Entity Recognition ◽

Support Vector ◽

Named Entity ◽

Vector Machines

Download Full-text

Named Entity Recognition for Nepali Text Using Support Vector Machines

Intelligent Information Management ◽

10.4236/iim.2014.62004 ◽

2014 ◽

Vol 06 (02) ◽

pp. 21-29 ◽

Cited By ~ 4

Author(s):

Surya Bahadur Bam ◽

Tej Bahadur Shahi

Keyword(s):

Support Vector Machines ◽

Named Entity Recognition ◽

Entity Recognition ◽

Support Vector ◽

Named Entity ◽

Vector Machines

Download Full-text

ChemTok: A New Rule Based Tokenizer for Chemical Named Entity Recognition

BioMed Research International ◽

10.1155/2016/4248026 ◽

2016 ◽

Vol 2016 ◽

pp. 1-9 ◽

Cited By ~ 5

Author(s):

Abbas Akkasi ◽

Ekrem Varoğlu ◽

Nazife Dimililer

Keyword(s):

Conditional Random Fields ◽

Named Entity Recognition ◽

Classification Performance ◽

Entity Recognition ◽

Support Vector ◽

Learning Approaches ◽

Data Set ◽

Rule Based ◽

Named Entity ◽

Vector Machines

Named Entity Recognition (NER) from text constitutes the first step in many text mining applications. The most important preliminary step for NER systems using machine learning approaches is tokenization where raw text is segmented into tokens. This study proposes an enhanced rule based tokenizer, ChemTok, which utilizes rules extracted mainly from the train data set. The main novelty of ChemTok is the use of the extracted rules in order to merge the tokens split in the previous steps, thus producing longer and more discriminative tokens. ChemTok is compared to the tokenization methods utilized by ChemSpot and tmChem. Support Vector Machines and Conditional Random Fields are employed as the learning algorithms. The experimental results show that the classifiers trained on the output of ChemTok outperforms all classifiers trained on the output of the other two tokenizers in terms of classification performance, and the number of incorrectly segmented entities.

Download Full-text

Identifying interactions between chemical entities in biomedical text

Journal of Integrative Bioinformatics ◽

10.1515/jib-2014-247 ◽

2014 ◽

Vol 11 (3) ◽

pp. 1-16 ◽

Cited By ~ 6

Author(s):

Andre Lamurias ◽

João D. Ferreira ◽

Francisco M. Couto

Keyword(s):

Named Entity Recognition ◽

Relation Extraction ◽

Ensemble Classifier ◽

Entity Recognition ◽

Support Vector ◽

Biomedical Text ◽

Web Tool ◽

Named Entity ◽

Vector Machines ◽

Chemical Named Entity Recognition

Summary Interactions between chemical compounds described in biomedical text can be of great importance to drug discovery and design, as well as pharmacovigilance. We developed a novel system, “Identifying Interactions between Chemical Entities” (IICE), to identify chemical interactions described in text. Kernel-based Support Vector Machines first identify the interactions and then an ensemble classifier validates and classifies the type of each interaction. This relation extraction module was evaluated with the corpus released for the DDI Extraction task of SemEval 2013, obtaining results comparable to stateof- the-art methods for this type of task. We integrated this module with our chemical named entity recognition module and made the whole system available as a web tool at www.lasige.di.fc.ul.pt/webtools/iice.

Download Full-text