Graph and Centroid-based Word Clustering

Author(s):  
Santipong Thaiprayoon ◽  
Herwig Unger ◽  
Mario Kubek
Keyword(s):  
Author(s):  
Herry Sujaini

Extended Word Similarity Based (EWSB) Clustering is a word clustering algorithm based on the value of words similarity obtained from the computation of a corpus. One of the benefits of clustering with this algorithm is to improve the translation of a statistical machine translation. Previous research proved that EWSB algorithm could improve the Indonesian-English translator, where the algorithm was applied to Indonesian language as target language.This paper discusses the results of a research using EWSB algorithm on a Indonesian to Minang statistical machine translator, where the algorithm is applied to Minang language as the target language. The research obtained resulted that the EWSB algorithm is quite effective when used in Minang language as the target language. The results of this study indicate that EWSB algorithm can improve the translation accuracy by 6.36%.


2016 ◽  
Vol 10 (4) ◽  
pp. 103-110 ◽  
Author(s):  
Shuai Yuan ◽  
Huan Huang ◽  
Linjing Wu

Informatica ◽  
2004 ◽  
Vol 15 (4) ◽  
pp. 565-580 ◽  
Author(s):  
Airenas Vaičiūnas ◽  
Vytautas Kaminskas ◽  
Gailius Raškinis

Author(s):  
Olga Mitrofanova ◽  
Anton Mukhin ◽  
Polina Panicheva ◽  
Vyacheslav Savitsky
Keyword(s):  

Sign in / Sign up

Export Citation Format

Share Document