Word Sense Clustering and Clusterability

Word sense disambiguation and the related field of automated word sense induction traditionally assume that the occurrences of a lemma can be partitioned into senses. But this seems to be a much easier task for some lemmas than others. Our work builds on recent work that proposes describing word meaning in a graded fashion rather than through a strict partition into senses; in this article we argue that not all lemmas may need the more complex graded analysis, depending on their partitionability. Although there is plenty of evidence from previous studies and from the linguistics literature that there is a spectrum of partitionability of word meanings, this is the first attempt to measure the phenomenon and to couple the machine learning literature on clusterability with word usage data used in computational linguistics. We propose to operationalize partitionability as clusterability, a measure of how easy the occurrences of a lemma are to cluster. We test two ways of measuring clusterability: (1) existing measures from the machine learning literature that aim to measure the goodness of optimal k-means clusterings, and (2) the idea that if a lemma is more clusterable, two clusterings based on two different “views” of the same data points will be more congruent. The two views that we use are two different sets of manually constructed lexical substitutes for the target lemma, on the one hand monolingual paraphrases, and on the other hand translations. We apply automatic clustering to the manual annotations. We use manual annotations because we want the representations of the instances that we cluster to be as informative and “clean” as possible. We show that when we control for polysemy, our measures of clusterability tend to correlate with partitionability, in particular some of the type-(1) clusterability measures, and that these measures outperform a baseline that relies on the amount of overlap in a soft clustering.

Download Full-text

Measuring Word Meaning in Context

Computational Linguistics ◽

10.1162/coli_a_00142 ◽

2013 ◽

Vol 39 (3) ◽

pp. 511-554 ◽

Cited By ~ 9

Author(s):

Katrin Erk ◽

Diana McCarthy ◽

Nicholas Gaylord

Keyword(s):

Computational Linguistics ◽

Native Speakers ◽

Word Sense Disambiguation ◽

Word Meaning ◽

Coarse Grained ◽

Ordinal Scale ◽

Word Sense ◽

The Novel ◽

Traditional Use ◽

Fine Grained

Word sense disambiguation (WSD) is an old and important task in computational linguistics that still remains challenging, to machines as well as to human annotators. Recently there have been several proposals for representing word meaning in context that diverge from the traditional use of a single best sense for each occurrence. They represent word meaning in context through multiple paraphrases, as points in vector space, or as distributions over latent senses. New methods of evaluating and comparing these different representations are needed. In this paper we propose two novel annotation schemes that characterize word meaning in context in a graded fashion. In WSsim annotation, the applicability of each dictionary sense is rated on an ordinal scale. Usim annotation directly rates the similarity of pairs of usages of the same lemma, again on a scale. We find that the novel annotation schemes show good inter-annotator agreement, as well as a strong correlation with traditional single-sense annotation and with annotation of multiple lexical paraphrases. Annotators make use of the whole ordinal scale, and give very fine-grained judgments that “mix and match” senses for each individual usage. We also find that the Usim ratings obey the triangle inequality, justifying models that treat usage similarity as metric. There has recently been much work on grouping senses into coarse-grained groups. We demonstrate that graded WSsim and Usim ratings can be used to analyze existing coarse-grained sense groupings to identify sense groups that may not match intuitions of untrained native speakers. In the course of the comparison, we also show that the WSsim ratings are not subsumed by any static sense grouping.

Download Full-text

Machine Learning

The Oxford Handbook of Computational Linguistics 2nd edition ◽

10.1093/oxfordhb/9780199573691.013.016 ◽

2018 ◽

Author(s):

Raymond J. Mooney

Keyword(s):

Machine Learning ◽

Computational Linguistics ◽

Word Sense Disambiguation ◽

Anaphora Resolution ◽

Word Sense ◽

Part Of Speech Tagging ◽

Part Of Speech ◽

New Concepts ◽

Training Examples ◽

Speech Tagging

This chapter introduces symbolic machine learning in which decision trees, rules, or case-based classifiers are induced from supervised training examples. It describes the representation of knowledge assumed by each of these approaches and reviews basic algorithms for inducing such representations from annotated training examples and using the acquired knowledge to classify future instances. It also briefly reviews unsupervised learning, in which new concepts are formed from unannotated examples by clustering them into coherent groups. These techniques can be applied to learn knowledge required for a variety of problems in computational linguistics ranging from part-of-speech tagging and syntactic parsing to word sense disambiguation and anaphora resolution. Applications to a variety of these problems are reviewed.

Download Full-text

Word meaning access: The one-to-many mapping from form to meaning

10.31234/osf.io/yz7c2 ◽

2019 ◽

Author(s):

Jennifer M Rodd

Keyword(s):

A Priori ◽

Word Meaning ◽

Semantic Features ◽

Contextual Cues ◽

Word Meanings ◽

Wide Range ◽

Core Feature ◽

Distributional Information ◽

Current Context ◽

The One

This chapter focuses on the process by which stored knowledge about a word’s form (orthographic or phonological) maps onto stored knowledge about its meaning. This mapping is made challenging by the ambiguity that is ubiquitous in natural language: most familiar words can refer to multiple different concepts. This one-to-many mapping from form to meaning within the lexicon is a core feature of word-meaning access. Fluent, accurate word-meaning access requires that comprehenders integrate multiple cues in order to determine which of a word’s possible semantic features are relevant in the current context. Specifically, word-meaning access is guided by (i) distributional information about the a priori relative likelihoods of different word meanings and (ii) a wide range of contextual cues that indicate which meanings are most likely in the current context.

Download Full-text

Word vs. Class-Based Word Sense Disambiguation

Journal of Artificial Intelligence Research ◽

10.1613/jair.4727 ◽

2015 ◽

Vol 54 ◽

pp. 83-122 ◽

Cited By ~ 4

Author(s):

Ruben Izquierdo ◽

Armando Suarez ◽

German Rigau

Keyword(s):

Word Sense Disambiguation ◽

Coarse Grained ◽

Semantic Features ◽

Word Sense ◽

Simple Method ◽

Word Meanings ◽

Semantic Class ◽

Semantic Classes ◽

Sense Disambiguation ◽

Word Senses

As empirically demonstrated by the Word Sense Disambiguation (WSD) tasks of the last SensEval/SemEval exercises, assigning the appropriate meaning to words in context has resisted all attempts to be successfully addressed. Many authors argue that one possible reason could be the use of inappropriate sets of word meanings. In particular, WordNet has been used as a de-facto standard repository of word meanings in most of these tasks. Thus, instead of using the word senses defined in WordNet, some approaches have derived semantic classes representing groups of word senses. However, the meanings represented by WordNet have been only used for WSD at a very fine-grained sense level or at a very coarse-grained semantic class level (also called SuperSenses). We suspect that an appropriate level of abstraction could be on between both levels. The contributions of this paper are manifold. First, we propose a simple method to automatically derive semantic classes at intermediate levels of abstraction covering all nominal and verbal WordNet meanings. Second, we empirically demonstrate that our automatically derived semantic classes outperform classical approaches based on word senses and more coarse-grained sense groupings. Third, we also demonstrate that our supervised WSD system benefits from using these new semantic classes as additional semantic features while reducing the amount of training examples. Finally, we also demonstrate the robustness of our supervised semantic class-based WSD system when tested on out of domain corpus.

Download Full-text

Machine Learning in Natural Language Processing

Handbook of Research on Machine Learning Applications and Trends ◽

10.4018/978-1-60566-766-9.ch014 ◽

2010 ◽

pp. 302-324

Author(s):

Marina Sokolova ◽

Stan Szpakowicz

Keyword(s):

Machine Learning ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Text Processing ◽

Word Sense Disambiguation ◽

Machine Learning Techniques ◽

Word Sense ◽

Part Of Speech ◽

Applications Of Machine Learning

This chapter presents applications of machine learning techniques to traditional problems in natural language processing, including part-of-speech tagging, entity recognition and word-sense disambiguation. People usually solve such problems without difficulty or at least do a very good job. Linguistics may suggest labour-intensive ways of manually constructing rule-based systems. It is, however, the easy availability of large collections of texts that has made machine learning a method of choice for processing volumes of data well above the human capacity. One of the main purposes of text processing is all manner of information extraction and knowledge extraction from such large text. Machine learning methods discussed in this chapter have stimulated wide-ranging research in natural language processing and helped build applications with serious deployment potential.

Download Full-text

Word Sense Disambiguation (WSD) for Indonesian Homograph Word Meaning Determination by LESK Algorithm Application

2019 12th International Conference on Information & Communication Technology and System (ICTS) ◽

10.1109/icts.2019.8850957 ◽

2019 ◽

Author(s):

Setio Basuki ◽

Ali Sofyan Kholimi ◽

Agus Eko Minarno ◽

Fauzi Dwi Setiawan Sumadi ◽

M. Rizal Arif Effendy

Keyword(s):

Word Sense Disambiguation ◽

Word Meaning ◽

Word Sense ◽

Sense Disambiguation ◽

Lesk Algorithm

Download Full-text

Machine Learning Techniques for Myanmar Word-Sense Disambiguation

Advances in Intelligent Systems and Computing - Genetic and Evolutionary Computing ◽

10.1007/978-3-319-23204-1_18 ◽

2015 ◽

pp. 175-185

Author(s):

Phyu Phyu Khaing ◽

Than Nwe Aung

Keyword(s):

Machine Learning ◽

Word Sense Disambiguation ◽

Machine Learning Techniques ◽

Word Sense ◽

Learning Techniques ◽

Sense Disambiguation

Download Full-text

Urdu word sense disambiguation using machine learning approach

Cluster Computing ◽

10.1007/s10586-017-0918-0 ◽

2017 ◽

Vol 21 (1) ◽

pp. 515-522 ◽

Cited By ~ 2

Author(s):

Muhammad Abid ◽

Asad Habib ◽

Jawad Ashraf ◽

Abdul Shahid

Keyword(s):

Machine Learning ◽

Word Sense Disambiguation ◽

Learning Approach ◽

Word Sense ◽

Machine Learning Approach ◽

Sense Disambiguation

Download Full-text

Generating Sense Inventories for Ambiguous Arabic Words

The International Arab Journal of Information Technology ◽

10.34028/iajit/18/3a/8 ◽

2021 ◽

Author(s):

Marwah Alian ◽

Arafat Awajan

Keyword(s):

Word Sense Disambiguation ◽

Word Sense ◽

Arabic Word ◽

Word Sense Induction ◽

Ambiguous Words ◽

Sense Disambiguation ◽

Sentence Similarity ◽

Unsupervised Approach ◽

Similar Accuracy

The process of selecting the appropriate meaning of an ambigous word according to its context is known as word sense disambiguation. In this research, we generate a number of Arabic sense inventories based on an unsupervised approach and different pre-trained embeddings, such as Aravec, Fast text, and Arabic-News embeddings. The resulted inventories from the pre-trained embeddings are evaluated to investigate their efficiency in Arabic word sense disambiguation and sentence similarity. The sense inventories are generated using an unsupervised approach that is based on a graph-based word sense induction algorithm. Results show that the Aravec-Twitter inventory achieves the best accuracy of 0.47 for 50 neighbors and a close accuracy to the Fast text inventory for 200 neighbors while it provides similar accuracy to the Arabic-News inventory for 100neighbors. The experiment of replacing ambiguous words with their sense vectors is tested for sentence similarity using all sense inventories and the results show that using Aravec-Twitter sense inventory provides a better correlation value

Download Full-text