Random Walks for Knowledge-Based Word Sense Disambiguation

Word Sense Disambiguation (WSD) systems automatically choose the intended meaning of a word in context. In this article we present a WSD algorithm based on random walks over large Lexical Knowledge Bases (LKB). We show that our algorithm performs better than other graph-based methods when run on a graph built from WordNet and eXtended WordNet. Our algorithm and LKB combination compares favorably to other knowledge-based approaches in the literature that use similar knowledge on a variety of English data sets and a data set on Spanish. We include a detailed analysis of the factors that affect the algorithm. The algorithm and the LKBs used are publicly available, and the results easily reproducible.

Download Full-text

A Knowledge-Based Sense Disambiguation Method to Semantically Enhanced NL Question for Restricted Domain

Information ◽

10.3390/info12110452 ◽

2021 ◽

Vol 12 (11) ◽

pp. 452

Author(s):

Ammar Arbaaeen ◽

Asadullah Shah

Keyword(s):

Natural Language ◽

Language Processing ◽

Question Answering ◽

Word Sense Disambiguation ◽

Knowledge Bases ◽

Word Sense ◽

Intended Meaning ◽

Lexical Semantic ◽

Knowledge Based ◽

Sense Disambiguation

Within the space of question answering (QA) systems, the most critical module to improve overall performance is question analysis processing. Extracting the lexical semantic of a Natural Language (NL) question presents challenges at syntactic and semantic levels for most QA systems. This is due to the difference between the words posed by a user and the terms presently stored in the knowledge bases. Many studies have achieved encouraging results in lexical semantic resolution on the topic of word sense disambiguation (WSD), and several other works consider these challenges in the context of QA applications. Additionally, few scholars have examined the role of WSD in returning potential answers corresponding to particular questions. However, natural language processing (NLP) is still facing several challenges to determine the precise meaning of various ambiguities. Therefore, the motivation of this work is to propose a novel knowledge-based sense disambiguation (KSD) method for resolving the problem of lexical ambiguity associated with questions posed in QA systems. The major contribution is the proposed innovative method, which incorporates multiple knowledge sources. This includes the question’s metadata (date/GPS), context knowledge, and domain ontology into a shallow NLP. The proposed KSD method is developed into a unique tool for a mobile QA application that aims to determine the intended meaning of questions expressed by pilgrims. The experimental results reveal that our method obtained comparable and better accuracy performance than the baselines in the context of the pilgrimage domain.

Download Full-text

A Weakly supervised word sense disambiguation for Polish using rich lexical resources

Poznan Studies in Contemporary Linguistics ◽

10.1515/psicl-2019-0013 ◽

2019 ◽

Vol 55 (2) ◽

pp. 339-365

Author(s):

Arkadiusz Janz ◽

Maciej Piasecki

Keyword(s):

Language Processing ◽

Word Sense Disambiguation ◽

Knowledge Bases ◽

Semantic Knowledge ◽

Training Data ◽

Word Sense ◽

Lexical Resources ◽

Knowledge Based ◽

Wide Range ◽

Sense Disambiguation

Abstract Automatic word sense disambiguation (WSD) has proven to be an important technique in many natural language processing tasks. For many years the problem of sense disambiguation has been approached with a wide range of methods, however, it is still a challenging problem, especially in the unsupervised setting. One of the well-known and successful approaches to WSD are knowledge-based methods leveraging lexical knowledge resources such as wordnets. As the knowledge-based approaches mostly do not use any labelled training data their performance strongly relies on the structure and the quality of used knowledge sources. However, a pure knowledge-base such as a wordnet cannot reflect all the semantic knowledge necessary to correctly disambiguate word senses in text. In this paper we explore various expansions to plWordNet as knowledge-bases for WSD. Semantic links extracted from a large valency lexicon (Walenty), glosses and usage examples, Wikipedia articles and SUMO ontology are combined with plWordNet and tested in a PageRank-based WSD algorithm. In addition, we analyse also the influence of lexical semantics vector models extracted with the help of the distributional semantics methods. Several new Polish test data sets for WSD are also introduced. All the resources, methods and tools are available on open licences.

Download Full-text

Enhancing Word Sense Disambiguation Using A Hybrid Knowledge-Based Technique

Natural Language Processing and Cognitive Science ◽

10.1515/9781501501289.15 ◽

2015 ◽

Author(s):

Eniafe Festus Ayetiran ◽

Guido Boella ◽

Luigi Di Caro ◽

Livio Robaldo

Keyword(s):

Word Sense Disambiguation ◽

Word Sense ◽

Knowledge Based ◽

Sense Disambiguation ◽

Hybrid Knowledge

Download Full-text

Knowledge-based biomedical word sense disambiguation: an evaluation and application to clinical document classification

Journal of the American Medical Informatics Association ◽

10.1136/amiajnl-2012-001350 ◽

2013 ◽

Vol 20 (5) ◽

pp. 882-886 ◽

Cited By ~ 15

Author(s):

Vijay N Garla ◽

Cynthia Brandt

Keyword(s):

Word Sense Disambiguation ◽

Document Classification ◽

Word Sense ◽

Knowledge Based ◽

Clinical Document ◽

Sense Disambiguation

Download Full-text

Knowledge-Based Biomedical Word Sense Disambiguation with Neural Concept Embeddings

2017 IEEE 17th International Conference on Bioinformatics and Bioengineering (BIBE) ◽

10.1109/bibe.2017.00-61 ◽

2017 ◽

Cited By ~ 5

Author(s):

Akm Sabbir ◽

Antonio Jimeno-Yepes ◽

Ramakanth Kavuluru

Keyword(s):

Word Sense Disambiguation ◽

Word Sense ◽

Knowledge Based ◽

Sense Disambiguation

Download Full-text

Word Sense Disambiguation

Emerging Applications of Natural Language Processing ◽

10.4018/978-1-4666-2169-5.ch002 ◽

2013 ◽

pp. 22-51

Author(s):

Pushpak Bhattacharyya ◽

Mitesh Khapra

Keyword(s):

State Of The Art ◽

Word Sense Disambiguation ◽

Current Trend ◽

General Purpose ◽

Word Sense ◽

Domain Specific ◽

Knowledge Based ◽

Current State ◽

Sense Disambiguation ◽

State Of Affairs

This chapter discusses the basic concepts of Word Sense Disambiguation (WSD) and the approaches to solving this problem. Both general purpose WSD and domain specific WSD are presented. The first part of the discussion focuses on existing approaches for WSD, including knowledge-based, supervised, semi-supervised, unsupervised, hybrid, and bilingual approaches. The accuracy value for general purpose WSD as the current state of affairs seems to be pegged at around 65%. This has motivated investigations into domain specific WSD, which is the current trend in the field. In the latter part of the chapter, we present a greedy neural network inspired algorithm for domain specific WSD and compare its performance with other state-of-the-art algorithms for WSD. Our experiments suggest that for domain-specific WSD, simply selecting the most frequent sense of a word does as well as any state-of-the-art algorithm.

Download Full-text

deepBioWSD: effective deep neural word sense disambiguation of biomedical text data

Journal of the American Medical Informatics Association ◽

10.1093/jamia/ocy189 ◽

2019 ◽

Vol 26 (5) ◽

pp. 438-446 ◽

Cited By ~ 3

Author(s):

Ahmad Pesaranghader ◽

Stan Matwin ◽

Marina Sokolova ◽

Ali Pesaranghader

Keyword(s):

Language Processing ◽

Short Term Memory ◽

Word Sense Disambiguation ◽

Training Data ◽

Biomedical Text ◽

Word Sense ◽

Vocabulary Size ◽

Unified Medical Language System ◽

Knowledge Based ◽

Sense Disambiguation

Abstract Objective In biomedicine, there is a wealth of information hidden in unstructured narratives such as research articles and clinical reports. To exploit these data properly, a word sense disambiguation (WSD) algorithm prevents downstream difficulties in the natural language processing applications pipeline. Supervised WSD algorithms largely outperform un- or semisupervised and knowledge-based methods; however, they train 1 separate classifier for each ambiguous term, necessitating a large number of expert-labeled training data, an unattainable goal in medical informatics. To alleviate this need, a single model that shares statistical strength across all instances and scales well with the vocabulary size is desirable. Materials and Methods Built on recent advances in deep learning, our deepBioWSD model leverages 1 single bidirectional long short-term memory network that makes sense prediction for any ambiguous term. In the model, first, the Unified Medical Language System sense embeddings will be computed using their text definitions; and then, after initializing the network with these embeddings, it will be trained on all (available) training data collectively. This method also considers a novel technique for automatic collection of training data from PubMed to (pre)train the network in an unsupervised manner. Results We use the MSH WSD dataset to compare WSD algorithms, with macro and micro accuracies employed as evaluation metrics. deepBioWSD outperforms existing models in biomedical text WSD by achieving the state-of-the-art performance of 96.82% for macro accuracy. Conclusions Apart from the disambiguation improvement and unsupervised training, deepBioWSD depends on considerably less number of expert-labeled data as it learns the target and the context terms jointly. These merit deepBioWSD to be conveniently deployable in real-time biomedical applications.

Download Full-text

An approach to knowledge-based Word Sense Disambiguation using semantic trees built on a WordNet lexicon network

2011 6th Conference on Speech Technology and Human-Computer Dialogue (SpeD) ◽

10.1109/sped.2011.5940744 ◽

2011 ◽

Author(s):

Andrei Minca ◽

Stefan Diaconescu

Keyword(s):

Word Sense Disambiguation ◽

Word Sense ◽

Knowledge Based ◽

Sense Disambiguation

Download Full-text

Knowledge Based Approaches To Nepali Word Sense Disambiguation

International Journal on Natural Language Computing ◽

10.5121/ijnlc.2014.3305 ◽

2014 ◽

Vol 3 (3) ◽

pp. 51-63 ◽

Cited By ~ 7

Author(s):

Arindam Roy ◽

Sunita Sarkar ◽

Bipul Syam Purkayastha

Keyword(s):

Word Sense Disambiguation ◽

Word Sense ◽

Knowledge Based ◽

Sense Disambiguation

Download Full-text

A Novel Approach to Word Sense Disambiguation Based on Topical and Semantic Association

The Scientific World JOURNAL ◽

10.1155/2013/586327 ◽

2013 ◽

Vol 2013 ◽

pp. 1-8 ◽

Cited By ~ 2

Author(s):

Xin Wang ◽

Wanli Zuo ◽

Ying Wang

Keyword(s):

Language Processing ◽

Fundamental Problem ◽

Word Sense Disambiguation ◽

Ambiguous Word ◽

Semantic Features ◽

Word Sense ◽

Semantic Association ◽

Data Set ◽

Novel Approach ◽

Sense Disambiguation

Word sense disambiguation (WSD) is a fundamental problem in nature language processing, the objective of which is to identify the most proper sense for an ambiguous word in a given context. Although WSD has been researched over the years, the performance of existing algorithms in terms of accuracy and recall is still unsatisfactory. In this paper, we propose a novel approach to word sense disambiguation based on topical and semantic association. For a given document, supposing that its topic category is accurately discriminated, the correct sense of the ambiguous term is identified through the corresponding topic and semantic contexts. We firstly extract topic discriminative terms from document and construct topical graph based on topic span intervals to implement topic identification. We then exploit syntactic features, topic span features, and semantic features to disambiguate nouns and verbs in the context of ambiguous word. Finally, we conduct experiments on the standard data set SemCor to evaluate the performance of the proposed method, and the results indicate that our approach achieves relatively better performance than existing approaches.

Download Full-text