Word Similarity Computing Based on HowNet and Synonymy Thesaurus

The role of cross-linguistic stress pattern frequency and word similarity on the acquisition of English stress pattern by native speakers of Brazilian Portuguese

Revista Diadorim ◽

10.35520/diadorim.2012.v12n0a3978 ◽

2012 ◽

Vol 12 ◽

Author(s):

Amanda Post Silveira

Keyword(s):

Second Language ◽

Native Speakers ◽

Mother Tongue ◽

American English ◽

Brazilian Portuguese ◽

Stress Pattern ◽

Target Language ◽

English As Second Language ◽

Word Similarity ◽

L1 Transfer

This is a preliminary study in which we investigate the acquisition of English as second language (L2[1]) word stress by native speakers of Brazilian Portuguese (BP, L1[2]). In this paper, we show results of a multiple choice forced choice perception test in which native speakers of American English and native speakers of Dutch judged the production of English words bearing pre-final stress that were both cognates and non-cognates with BP words. The tokens were produced by native speakers of American English and by Brazilians that speak English as a second language. The results have shown that American and Dutch listeners were consistent in their judgments on native and non-native stress productions and both speakers' groups produced variation in stress in relation to the canonical pattern. However, the variability found in American English points to the prosodic patterns of English and the variability found in Brazilian English points to the stress patterns of Portuguese. It occurs especially in words whose forms activate neighboring similar words in the L1. Transfer from the L1 appears both at segmental and prosodic levels in BP English. [1] L2 stands for second language, foreign language, target language. [2] L1 stands for first language, mother tongue, source language.

Download Full-text

Word Similarity Calculation by Using the Edit Distance Metrics with Consonant Normalization

Journal of Information Processing Systems ◽

10.3745/jips.04.0018 ◽

2015 ◽

Keyword(s):

Edit Distance ◽

Distance Metrics ◽

Word Similarity ◽

Similarity Calculation

Download Full-text

Analysis Accuracy of Similar Word Based Clustering (EWSB) Algorithm on Machine Translator Bahasa Indonesia-Minang

Kinetik Game Technology Information System Computer Network Computing Electronics and Control ◽

10.22219/kinetik.v3i3.241 ◽

2018 ◽

Vol 3 (3) ◽

Author(s):

Herry Sujaini

Keyword(s):

Machine Translation ◽

Clustering Algorithm ◽

Statistical Machine Translation ◽

Target Language ◽

Word Similarity ◽

Similar Word ◽

Word Clustering ◽

Translation Accuracy ◽

Bahasa Indonesia

Extended Word Similarity Based (EWSB) Clustering is a word clustering algorithm based on the value of words similarity obtained from the computation of a corpus. One of the benefits of clustering with this algorithm is to improve the translation of a statistical machine translation. Previous research proved that EWSB algorithm could improve the Indonesian-English translator, where the algorithm was applied to Indonesian language as target language.This paper discusses the results of a research using EWSB algorithm on a Indonesian to Minang statistical machine translator, where the algorithm is applied to Minang language as the target language. The research obtained resulted that the EWSB algorithm is quite effective when used in Minang language as the target language. The results of this study indicate that EWSB algorithm can improve the translation accuracy by 6.36%.

Download Full-text

Cyberbullying Detection, Based on the FastText and Word Similarity Schemes

ACM Transactions on Asian and Low-Resource Language Information Processing ◽

10.1145/3398191 ◽

2020 ◽

Vol 20 (1) ◽

pp. 1-15

Author(s):

Kun Wang ◽

Yanpeng Cui ◽

Jianwei Hu ◽

Yu Zhang ◽

Wei Zhao ◽

...

Keyword(s):

Word Similarity ◽

Cyberbullying Detection

Download Full-text

Intelligent recognition of semantic relationships based on antonymy

Multiagent and Grid Systems ◽

10.3233/mgs-200332 ◽

2020 ◽

Vol 16 (3) ◽

pp. 263-290

Author(s):

Hui Guan ◽

Chengzhen Jia ◽

Hongji Yang

Keyword(s):

Semantic Similarity ◽

New Approach ◽

Word Similarity ◽

Semantic Relationships ◽

Proposed Model ◽

Path Distance ◽

The Hierarchical Structure ◽

Thinking Process ◽

Similarity Measuring ◽

Intelligent Recognition

Since computing semantic similarity tends to simulate the thinking process of humans, semantic dissimilarity must play a part in this process. In this paper, we present a new approach for semantic similarity measuring by taking consideration of dissimilarity into the process of computation. Specifically, the proposed measures explore the potential antonymy in the hierarchical structure of WordNet to represent the dissimilarity between concepts and then combine the dissimilarity with the results of existing methods to achieve semantic similarity results. The relation between parameters and the correlation value is discussed in detail. The proposed model is then applied to different text granularity levels to validate the correctness on similarity measurement. Experimental results show that the proposed approach not only achieves high correlation value against human ratings but also has effective improvement to existing path-distance based methods on the word similarity level, in the meanwhile effectively correct existing sentence similarity method in some cases in Microsoft Research Paraphrase Corpus and SemEval-2014 date set.

Download Full-text

An Improved Word Similarity Algorithm Based on Semantic

2010 Third International Symposium on Information Science and Engineering ◽

10.1109/isise.2010.33 ◽

2010 ◽

Author(s):

Dongsen Si ◽

Cao Boyan ◽

Zengzhi Li ◽

Gao Qiang

Keyword(s):

Word Similarity ◽

Similarity Algorithm

Download Full-text

The Research of Word Similarity in Semantic Retrieval

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.701-702.413 ◽

2014 ◽

Vol 701-702 ◽

pp. 413-417

Author(s):

Jie Ran ◽

Ji Ya Huang ◽

Zu Xiao

Keyword(s):

Information Processing ◽

High Efficiency ◽

Processing Technology ◽

Semantic Retrieval ◽

Word Similarity ◽

Crucial Question ◽

Computing Method

Word similarity computing is a crucial question in information processing technology. In this paper, an integrated word similarity computing method is proposed by analyzed morpheme's similarity, word order's similarity and word length's similarity, and parameters of the method are decided by experiments. The experiments show that this method has high efficiency.

Download Full-text

Topic segmentation of news speech using word similarity

Proceedings of the eighth ACM international conference on Multimedia - MULTIMEDIA '00 ◽

10.1145/354384.376354 ◽

2000 ◽

Cited By ~ 3

Author(s):

Seiichi Takao ◽

Jun Ogata ◽

Yasuo Ariki

Keyword(s):

Topic Segmentation ◽

Word Similarity

Download Full-text

Semantic Word Similarity Learned from Heterogenous Knowledge Bases

Springer Proceedings in Complexity - Semantic Web and Web Science ◽

10.1007/978-1-4614-6880-6_26 ◽

2013 ◽

pp. 299-309

Author(s):

Yiling Liu ◽

Yangsheng Ji ◽

Chong Gu ◽

Shouling Cui ◽

Jiangtao Jia

Keyword(s):

Knowledge Bases ◽

Word Similarity

Download Full-text

Learning Lexical Subspaces in a Distributional Vector Space

Transactions of the Association for Computational Linguistics ◽

10.1162/tacl_a_00316 ◽

2020 ◽

Vol 8 ◽

pp. 311-329

Author(s):

Kushal Arora ◽

Aishik Chakraborty ◽

Jackie C. K. Cheung

Keyword(s):

Vector Space ◽

Semantic Relations ◽

Distributional Semantics ◽

Word Embeddings ◽

Word Similarity ◽

Lexical Semantic ◽

Novel Approach ◽

Classification Tasks

In this paper, we propose LexSub, a novel approach towards unifying lexical and distributional semantics. We inject knowledge about lexical-semantic relations into distributional word embeddings by defining subspaces of the distributional vector space in which a lexical relation should hold. Our framework can handle symmetric attract and repel relations (e.g., synonymy and antonymy, respectively), as well as asymmetric relations (e.g., hypernymy and meronomy). In a suite of intrinsic benchmarks, we show that our model outperforms previous approaches on relatedness tasks and on hypernymy classification and detection, while being competitive on word similarity tasks. It also outperforms previous systems on extrinsic classification tasks that benefit from exploiting lexical relational cues. We perform a series of analyses to understand the behaviors of our model. 1 Code available at https://github.com/aishikchakraborty/LexSub .

Download Full-text