CO-graph: A new graph-based technique for cross-lingual word sense disambiguation

AbstractIn this paper, we present a new method based on co-occurrence graphs for performing Cross-Lingual Word Sense Disambiguation (CLWSD). The proposed approach comprises the automatic generation of bilingual dictionaries, and a new technique for the construction of a co-occurrence graph used to select the most suitable translations from the dictionary. Different algorithms that combine both the dictionary and the co-occurrence graph are then used for performing this selection of the final translations: techniques based on sub-graphs (communities) containing clusters of words with related meanings, based on distances between nodes representing words, and based on the relative importance of each node in the whole graph. The initial output of the system is enhanced with translation probabilities, provided by a statistical bilingual dictionary. The system is evaluated using datasets from two competitions: task 3 of SemEval 2010, and task 10 of SemEval 2013. Results obtained by the different disambiguation techniques are analysed and compared to those obtained by the systems participating in the competitions. Our system offers the best results in comparison with other unsupervised systems in most of the experiments, and even overcomes supervised systems in some cases.

Download Full-text

Choosing the best dictionary for Cross-Lingual Word Sense Disambiguation

Knowledge-Based Systems ◽

10.1016/j.knosys.2015.02.007 ◽

2015 ◽

Vol 81 ◽

pp. 65-75 ◽

Cited By ~ 5

Author(s):

Andres Duque ◽

Juan Martinez-Romo ◽

Lourdes Araujo

Keyword(s):

Word Sense Disambiguation ◽

Word Sense ◽

Sense Disambiguation ◽

Cross Lingual

Download Full-text

Cross-Lingual Word Sense Disambiguation for Languages with Scarce Resources

Advances in Artificial Intelligence - Lecture Notes in Computer Science ◽

10.1007/978-3-642-21043-3_42 ◽

2011 ◽

pp. 347-358 ◽

Cited By ~ 3

Author(s):

Bahareh Sarrafzadeh ◽

Nikolay Yakovets ◽

Nick Cercone ◽

Aijun An

Keyword(s):

Word Sense Disambiguation ◽

Word Sense ◽

Scarce Resources ◽

Sense Disambiguation ◽

Cross Lingual

Download Full-text

Ontology Matching using BabelNet Dictionary and Word Sense Disambiguation Algorithms

Indonesian Journal of Electrical Engineering and Computer Science ◽

10.11591/ijeecs.v5.i1.pp196-205 ◽

2017 ◽

Vol 5 (1) ◽

pp. 196 ◽

Cited By ~ 5

Author(s):

Mohamed Biniz ◽

Rachid El Ayachi ◽

Mohamed Fakir

Keyword(s):

Natural Language Processing ◽

Language Processing ◽

Word Sense Disambiguation ◽

Similarity Measures ◽

Ontology Matching ◽

Word Sense ◽

Sense Disambiguation ◽

Lesk Algorithm ◽

Reference Ontology ◽

Selection Of

<p>Ontology matching is a discipline that means two things: first, the process of discovering correspondences between two different ontologies, and second is the result of this process, that is to say the expression of correspondences. This discipline is a crucial task to solve problems merging and evolving of heterogeneous ontologies in applications of the Semantic Web. This domain imposes several challenges, among them, the selection of appropriate similarity measures to discover the correspondences. In this article, we are interested to study algorithms that calculate the semantic similarity by using Adapted Lesk algorithm, Wu & Palmer Algorithm, Resnik Algorithm, Leacock and Chodorow Algorithm, and similarity flooding between two ontologies and BabelNet as reference ontology, we implement them, and compared experimentally. Overall, the most effective methods are Wu & Palmer and Adapted Lesk, which is widely used for Word Sense Disambiguation (WSD) in the field of Automatic Natural Language Processing (NLP).</p>

Download Full-text

SBFC: An Efficient Feature Frequency-Based Approach to Tackle Cross-Lingual Word Sense Disambiguation

Text, Speech and Dialogue - Lecture Notes in Computer Science ◽

10.1007/978-3-642-32790-2_30 ◽

2012 ◽

pp. 248-255

Author(s):

Dieter Mourisse ◽

Els Lefever ◽

Nele Verbiest ◽

Yvan Saeys ◽

Martine De Cock ◽

...

Keyword(s):

Word Sense Disambiguation ◽

Word Sense ◽

Sense Disambiguation ◽

Cross Lingual ◽

Feature Frequency

Download Full-text

Improving selection of synsets from WordNet for domain-specific word sense disambiguation

Computer Speech & Language ◽

10.1016/j.csl.2016.06.003 ◽

2017 ◽

Vol 41 ◽

pp. 128-145 ◽

Cited By ~ 8

Author(s):

Ivan Lopez-Arevalo ◽

Victor J. Sosa-Sosa ◽

Franco Rojas-Lopez ◽

Edgar Tello-Leal

Keyword(s):

Word Sense Disambiguation ◽

Word Sense ◽

Domain Specific ◽

Sense Disambiguation ◽

Selection Of

Download Full-text

UoB_UK at SemEval 2021 Task 2: Zero-Shot and Few-Shot Learning for Multi-lingual and Cross-lingual Word Sense Disambiguation.

10.18653/v1/2021.semeval-1.97 ◽

2021 ◽

Author(s):

Wei Li ◽

Harish Tayyar Madabushi ◽

Mark Lee

Keyword(s):

Word Sense Disambiguation ◽

Word Sense ◽

Sense Disambiguation ◽

Cross Lingual

Download Full-text

Ontology-Supported Text Classification Based on Cross-Lingual Word Sense Disambiguation

Applications of Fuzzy Sets Theory - Lecture Notes in Computer Science ◽

10.1007/978-3-540-73400-0_56 ◽

2007 ◽

pp. 447-455 ◽

Cited By ~ 2

Author(s):

Dan Tufiş ◽

Svetla Koeva

Keyword(s):

Text Classification ◽

Word Sense Disambiguation ◽

Word Sense ◽

Sense Disambiguation ◽

Cross Lingual

Download Full-text

IXA at CLEF 2008 Robust-WSD Task: Using Word Sense Disambiguation for (Cross Lingual) Information Retrieval

Lecture Notes in Computer Science - Evaluating Systems for Multilingual and Multimodal Information Access ◽

10.1007/978-3-642-04447-2_14 ◽

2009 ◽

pp. 118-125 ◽

Cited By ~ 3

Author(s):

Eneko Agirre ◽

Arantxa Otegi ◽

German Rigau

Keyword(s):

Information Retrieval ◽

Word Sense Disambiguation ◽

Word Sense ◽

Sense Disambiguation ◽

Cross Lingual

Download Full-text

A Naïve Bayes Approach to Cross-Lingual Word Sense Disambiguation and Lexical Substitution

Advances in Pattern Recognition - Lecture Notes in Computer Science ◽

10.1007/978-3-642-15992-3_37 ◽

2010 ◽

pp. 352-361 ◽

Cited By ~ 1

Author(s):

David Pinto ◽

Darnes Vilariño ◽

Carlos Balderas ◽

Mireya Tovar ◽

Beatriz Beltrán

Keyword(s):

Naive Bayes ◽

Word Sense Disambiguation ◽

Naïve Bayes ◽

Word Sense ◽

Sense Disambiguation ◽

Lexical Substitution ◽

Cross Lingual ◽

Bayes Approach

Download Full-text

Automatic Wordnet Development for Low-Resource Languages using Cross-Lingual WSD

Journal of Artificial Intelligence Research ◽

10.1613/jair.4968 ◽

2016 ◽

Vol 56 ◽

pp. 61-87 ◽

Cited By ~ 5

Author(s):

Nasrin Taghizadeh ◽

Hesham Faili

Keyword(s):

Language Processing ◽

Semantic Processing ◽

Large Scale ◽

Word Sense Disambiguation ◽

Expectation Maximization Algorithm ◽

Word Sense ◽

Low Resource ◽

Persian Language ◽

Sense Disambiguation ◽

Cross Lingual

‎Wordnets are an effective resource for natural language processing and information retrieval‎, ‎especially for semantic processing and meaning related tasks‎. ‎So far‎, ‎wordnets have been constructed for many languages‎. ‎However‎, ‎the automatic development of wordnets for low-resource languages has not been well studied‎. ‎In this paper‎, ‎an Expectation-Maximization algorithm is used to create high quality and large scale wordnets for poor-resource languages‎. ‎The proposed method benefits from possessing cross-lingual word sense disambiguation and develops a wordnet by only using a bi-lingual dictionary and a mono-lingual corpus‎. ‎The proposed method has been executed with Persian language and the resulting wordnet has been evaluated through several experiments‎. ‎The results show that the induced wordnet has a precision score of 90% and a recall score of 35%‎.

Download Full-text