scholarly journals Co-occurrence graphs for word sense disambiguation in the biomedical domain

2018 ◽  
Vol 87 ◽  
pp. 9-19 ◽  
Author(s):  
Andres Duque ◽  
Mark Stevenson ◽  
Juan Martinez-Romo ◽  
Lourdes Araujo
2005 ◽  
Vol 12 (5) ◽  
pp. 554-565 ◽  
Author(s):  
Martijn J. Schuemie ◽  
Jan A. Kors ◽  
Barend Mons

2013 ◽  
pp. 1306-1316
Author(s):  
Wei Xiong ◽  
Min Song ◽  
Lori deVersterre

Word sense disambiguation is the problem of selecting a sense for a word from a set of predefined possibilities. This is a significant problem in the biomedical domain where a single word may be used to describe a gene, protein, or abbreviation. In this paper, we evaluate SENSATIONAL, a novel unsupervised WSD technique, in comparison with two popular learning algorithms: support vector machines (SVM) and K-means. Based on the accuracy measure, our results show that SENSATIONAL outperforms SVM and K-means by 2% and 17%, respectively. In addition, we develop a polysemy-based search engine and an experimental visualization application that utilizes SENSATIONAL’s clustering technique.


Doklady BGUIR ◽  
2019 ◽  
pp. 60-65
Author(s):  
A. V. Pashuk ◽  
A. B. Gurinovich ◽  
N. A. Volorova ◽  
A. P. Kuznetsov

2012 ◽  
Vol 2012 ◽  
pp. 1-8 ◽  
Author(s):  
Hisham Al-Mubaid ◽  
Sandeep Gungu

In the biomedical domain, word sense ambiguity is a widely spread problem with bioinformatics research effort devoted to it being not commensurate and allowing for more development. This paper presents and evaluates a learning-based approach for sense disambiguation within the biomedical domain. The main limitation with supervised methods is the need for a corpus of manually disambiguated instances of the ambiguous words. However, the advances in automatic text annotation and tagging techniques with the help of the plethora of knowledge sources like ontologies and text literature in the biomedical domain will help lessen this limitation. The proposed method utilizes the interaction model (mutual information) between the context words and the senses of the target word to induce reliable learning models for sense disambiguation. The method has been evaluated with the benchmark dataset NLM-WSD with various settings and in biomedical entity species disambiguation. The evaluation results showed that the approach is very competitive and outperforms recently reported results of other published techniques.


Author(s):  
Wei Xiong ◽  
Min Song ◽  
Lori deVersterre

Word sense disambiguation is the problem of selecting a sense for a word from a set of predefined possibilities. This is a significant problem in the biomedical domain where a single word may be used to describe a gene, protein, or abbreviation. In this paper, we evaluate SENSATIONAL, a novel unsupervised WSD technique, in comparison with two popular learning algorithms: support vector machines (SVM) and K-means. Based on the accuracy measure, our results show that SENSATIONAL outperforms SVM and K-means by 2% and 17%, respectively. In addition, we develop a polysemy-based search engine and an experimental visualization application that utilizes SENSATIONAL’s clustering technique.


Author(s):  
Torsten Schiemann ◽  
Ulf Leser ◽  
Jörg Hakenberg

Ambiguity is a common phenomenon in text, especially in the biomedical domain. For instance, it is frequently the case that a gene, a protein encoded by the gene, and a disease associated with the protein share the same name. Resolving this problem, that is, assigning to an ambiguous word in a given context its correct meaning is called word sense disambiguation (WSD). It is a pre-requisite for associating entities in text to external identifiers and thus to put the results from text mining into a larger knowledge framework. In this chapter, we introduce the WSD problem and sketch general approaches for solving it. The authors then describe in detail the results of a study in WSD using classification. For each sense of an ambiguous term, they collected a large number of exemplary texts automatically and used them to train an SVM-based classifier. This method reaches a median success rate of 97%. The authors also provide an analysis of potential sources and methods to obtain training examples, which proved to be the most difficult part of this study.


2012 ◽  
Vol 241-244 ◽  
pp. 3103-3106
Author(s):  
Jia Cong He ◽  
Kai Ren ◽  
Wei Jie Yu

The effectiveness of supervised word sense disambiguation approaches depend in part on the availability of initial dataset. In the paper, we use an improved method based on the simple K-means clustering function, the result indicates the improvement after using the method. The whole experiments are running on the NLM WSD set, which is emphasizing on the biomedical domain.


Sign in / Sign up

Export Citation Format

Share Document