Gold standard datasets for evaluating word sense disambiguation programs

Adam Kilgarriff

doi:10.1006/csla.1998.0108

Exemplification Modeling: Can You Give Me an Example, Please?

Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2021/520 ◽

2021 ◽

Author(s):

Edoardo Barba ◽

Luigi Procopio ◽

Caterina Lacerra ◽

Tommaso Pasini ◽

Roberto Navigli

Keyword(s):

Gold Standard ◽

State Of The Art ◽

Word Sense Disambiguation ◽

Full Range ◽

Training Data ◽

Training Procedure ◽

Word Sense ◽

The Novel ◽

Current State ◽

Sense Disambiguation

Recently, generative approaches have been used effectively to provide definitions of words in their context. However, the opposite, i.e., generating a usage example given one or more words along with their definitions, has not yet been investigated. In this work, we introduce the novel task of Exemplification Modeling (ExMod), along with a sequence-to-sequence architecture and a training procedure for it. Starting from a set of (word, definition) pairs, our approach is capable of automatically generating high-quality sentences which express the requested semantics. As a result, we can drive the creation of sense-tagged data which cover the full range of meanings in any inventory of interest, and their interactions within sentences. Human annotators agree that the sentences generated are as fluent and semantically-coherent with the input definitions as the sentences in manually-annotated corpora. Indeed, when employed as training data for Word Sense Disambiguation, our examples enable the current state of the art to be outperformed, and higher results to be achieved than when using gold-standard datasets only. We release the pretrained model, the dataset and the software at https://github.com/SapienzaNLP/exmod.

Download Full-text

Building Synonym Set for Indonesian WordNet using Commutative Method and Hierarchical Clustering

JURNAL MEDIA INFORMATIKA BUDIDARMA ◽

10.30865/mib.v4i3.2254 ◽

2020 ◽

Vol 4 (3) ◽

pp. 778

Author(s):

Valentino Rossi Fierdaus ◽

Moch Arif Bijaksana ◽

Widi Astuti

Keyword(s):

Hierarchical Clustering ◽

Gold Standard ◽

Word Sense Disambiguation ◽

Word Sense ◽

Agglomerative Hierarchical Clustering ◽

Clustering Method ◽

Sense Disambiguation ◽

Performance And Evaluation ◽

F Measure

WordNet is a compilation of Synonyms Set (synset), which consists of the words that have the same synonymous. The development of Indonesian WordNet has a goal to build an application that can accommodate and exhibit the relation of words. Synonym Set is a set composed of one or more words that have a similar meaning or synonym relation originated from the Indonesian Thesaurus. In previous studies, the establishment of synsets were transmitted with several approaches, one of which was the cluster ring to produce synsets and WSD (Word Sense Disambiguation). In this research, research is held up to discover the semantic similarities between words in the Indonesian Thesaurus automatically, and also to know the performance of the Agglomerative Hierarchical Clustering method for the development of Indonesian synsets. To calculate performance and evaluation, this research is using the F-measure method involving the gold standard

Download Full-text

Chinese Word Sense Disambiguation Based on Maximum Entropy Model with Feature Selection

Journal of Software ◽

10.3724/sp.j.1001.2010.03591 ◽

2010 ◽

Vol 21 (6) ◽

pp. 1287-1295 ◽

Cited By ~ 7

Author(s):

Jing-Zhou HE ◽

Hou-Feng WANG

Keyword(s):

Feature Selection ◽

Maximum Entropy ◽

Word Sense Disambiguation ◽

Word Sense ◽

Chinese Word ◽

Maximum Entropy Model ◽

Entropy Model ◽

Sense Disambiguation

Download Full-text

Word Sense Disambiguation Based on Dependency Fitness with Automatic Knowledge Acquisition

Journal of Software ◽

10.3724/sp.j.1001.2013.04373 ◽

2014 ◽

Vol 24 (10) ◽

pp. 2300-2311 ◽

Cited By ~ 2

Author(s):

Wen-Peng LU ◽

He-Yan HUANG

Keyword(s):

Knowledge Acquisition ◽

Word Sense Disambiguation ◽

Word Sense ◽

Sense Disambiguation ◽

Automatic Knowledge Acquisition

Download Full-text

Graph Based Word Sense Disambiguation Method Using Distance Between Words

Journal of Software ◽

10.3724/sp.j.1001.2012.04116 ◽

2012 ◽

Vol 23 (4) ◽

pp. 776-785 ◽

Cited By ~ 2

Author(s):

Zhi-Zhuo YANG ◽

He-Yan HUANG

Keyword(s):

Word Sense Disambiguation ◽

Word Sense ◽

Sense Disambiguation

Download Full-text

Margin perceptron for word sense disambiguation

Proceedings of the 2010 Symposium on Information and Communication Technology - SoICT '10 ◽

10.1145/1852611.1852625 ◽

2010 ◽

Cited By ~ 3

Author(s):

Kiem-Hieu Nguyen ◽

Cheol-Young Ock

Keyword(s):

Word Sense Disambiguation ◽

Word Sense ◽

Sense Disambiguation

Download Full-text

Multimodal Word Sense Disambiguation in Creative Practice

2020 19th IEEE International Conference on Machine Learning and Applications (ICMLA) ◽

10.1109/icmla51294.2020.00055 ◽

2020 ◽

Author(s):

Manuel Ladron de Guevara ◽

Christopher George ◽

Akshat Gupta ◽

Daragh Byrne ◽

Ramesh Krishnamurti

Keyword(s):

Word Sense Disambiguation ◽

Word Sense ◽

Creative Practice ◽

Sense Disambiguation

Download Full-text

Enhancing Word Sense Disambiguation Using A Hybrid Knowledge-Based Technique

Natural Language Processing and Cognitive Science ◽

10.1515/9781501501289.15 ◽

2015 ◽

Author(s):

Eniafe Festus Ayetiran ◽

Guido Boella ◽

Luigi Di Caro ◽

Livio Robaldo

Keyword(s):

Word Sense Disambiguation ◽

Word Sense ◽

Knowledge Based ◽

Sense Disambiguation ◽

Hybrid Knowledge

Download Full-text

Spreading semantic information by Word Sense Disambiguation

Knowledge-Based Systems ◽

10.1016/j.knosys.2017.06.013 ◽

2017 ◽

Vol 132 ◽

pp. 47-61 ◽

Cited By ~ 5

Author(s):

Yoan Gutiérrez ◽

Sonia Vázquez ◽

Andrés Montoyo

Keyword(s):

Semantic Information ◽

Word Sense Disambiguation ◽

Word Sense ◽

Sense Disambiguation

Download Full-text

Word Sense Disambiguation in the Biomedical Domain: An Overview

Journal of Computational Biology ◽

10.1089/cmb.2005.12.554 ◽

2005 ◽

Vol 12 (5) ◽

pp. 554-565 ◽

Cited By ~ 40

Author(s):

Martijn J. Schuemie ◽

Jan A. Kors ◽

Barend Mons

Keyword(s):

Word Sense Disambiguation ◽

Biomedical Domain ◽

Word Sense ◽

Sense Disambiguation

Download Full-text