Machine Learning Techniques for Myanmar Word-Sense Disambiguation

Author(s):  
Phyu Phyu Khaing ◽  
Than Nwe Aung
2018 ◽  
Vol 7 (3.20) ◽  
pp. 428
Author(s):  
Wafaa AL-Saiagh ◽  
Sabrina Tiun ◽  
Ahmed AL-Saffar ◽  
Suryanti Awang ◽  
A. S. Al-khaleefa

Word Sense Disambiguation (WSD) is the process of determining the exact sense of a particular word in accordance to the context in a computational manner. Such task plays an essential role in multiple fields of study such as Information Retrieval and Information Extraction. With the complexity of human language, WSD came up to solve the problem behind the ambiguity between senses in which a single word would yield different meaning. In this vein, determining the exact meaning of the certain word would facilitate the process of identifying the category of such text, accurate corresponding search results and providing an accurately summarized portion. Several approaches have been proposed for the WSD including statistical, semantic and machine learning techniques. This paper aims to provide a review of such approaches by tackling and categorizing the related works in accordance to the main types. 


Author(s):  
Marina Sokolova ◽  
Stan Szpakowicz

This chapter presents applications of machine learning techniques to traditional problems in natural language processing, including part-of-speech tagging, entity recognition and word-sense disambiguation. People usually solve such problems without difficulty or at least do a very good job. Linguistics may suggest labour-intensive ways of manually constructing rule-based systems. It is, however, the easy availability of large collections of texts that has made machine learning a method of choice for processing volumes of data well above the human capacity. One of the main purposes of text processing is all manner of information extraction and knowledge extraction from such large text. Machine learning methods discussed in this chapter have stimulated wide-ranging research in natural language processing and helped build applications with serious deployment potential.


2017 ◽  
Vol 21 (1) ◽  
pp. 515-522 ◽  
Author(s):  
Muhammad Abid ◽  
Asad Habib ◽  
Jawad Ashraf ◽  
Abdul Shahid

2002 ◽  
Vol 8 (4) ◽  
pp. 311-325 ◽  
Author(s):  
V. HOSTE ◽  
I. HENDRICKX ◽  
W. DAELEMANS ◽  
A. VAN DEN BOSCH

Various Machine Learning (ML) approaches have been demonstrated to produce relatively successful Word Sense Disambiguation (WSD) systems. There are still unexplained differences among the performance measurements of different algorithms, hence it is warranted to deepen the investigation into which algorithm has the right ‘bias’ for this task. In this paper, we show that this is not easy to accomplish, due to intricate interactions between information sources, parameter settings, and properties of the training data. We investigate the impact of parameter optimization on generalization accuracy in a memory-based learning approach to English and Dutch WSD. A ‘word-expert’ architecture was adopted, yielding a set of classifiers, each specialized in one single wordform. The experts consist of multiple memory-based learning classifiers, each taking different information sources as input, combined in a voting scheme. We optimized the architectural and parametric settings for each individual word-expert by performing cross-validation experiments on the learning material. The results of these experiments show that the variation of both the algorithmic parameters and the information sources available to the classifiers leads to large fluctuations in accuracy. We demonstrate that optimization per word-expert leads to an overall significant improvement in the generalization accuracies of the produced WSD systems.


Sign in / Sign up

Export Citation Format

Share Document