N-gram approach for a URL similarity measure

2016 1st India International Conference on Information Processing (IICIP) ◽

10.1109/iicip.2016.7975313 ◽

2016 ◽

Author(s):

Neetu Singh ◽

Narendra S. Chaudhari

Keyword(s):

Similarity Measure ◽

Download Full-text

A novel method for protein 3D-structure similarity measure based on n-gram modeling

2008 8th IEEE International Conference on BioInformatics and BioEngineering ◽

10.1109/bibe.2008.4696719 ◽

2008 ◽

Author(s):

Jafar Razmara ◽

Safaai B. Deris

Keyword(s):

Similarity Measure ◽

3D Structure ◽

Protein 3D Structure ◽

Novel Method ◽

Structure Similarity ◽

Download Full-text

IMPLEMENTASI JACCARD INDEX DAN N-GRAM PADA REKAYASA APLIKASI KOREKSI KATA BERBAHASA INDONESIA

Sebatik ◽

10.46984/sebatik.v22i2.314 ◽

2018 ◽

Vol 22 (2) ◽

pp. 95-101

Author(s):

Aida Indriani ◽

Muhammad Muhammad ◽

Suprianto Suprianto ◽

Hadriansa Hadriansa

Keyword(s):

Text Mining ◽

Similarity Measure ◽

Jaccard Index ◽

Banyaknya informasi diberbagai media, membuat pengguna harus jeli dalam mencari informasi yang benar. Informasi yang dikatakan benar bukan hanya dilihat dari sumber terpercaya, tetapi dalam penulisan tidak boleh terjadi kesalahan ejaan kata (typo) yang dapat mengakibatkan kesalahpahaman makna informasi yang dibaca. Untuk meminimalkan kesalahan ejaan kata dibutuhkan peran editor dengan melakukan koreksi kata secara satu per satu. Tujuan dari penelitian ini adalah untuk membuat aplikasi koreksi kata secara otomatis, dengan memanfaatkan teknik text mining yaitu set based similarity measure. Teknik yang digunakan yaitu jaccard index dan menggunakan bantuan fitur N-gram sebanyak 3 yaitu Bi-gram, Tri-gram dan Quad-gram. Selain itu, penelitian ini bertujuan untuk menentukan fitur N-gram yang tepat dalam melakukan koreksi kata. Dengan adanya aplikasi koreksi kata ini diharapkan dapat membantu tim editor dalam melakukan pengecekan kata sebelum dipubikasikan ke umum. Untuk analisa fitur N-gram yang tepat untuk melakukan koreksi kata adalah fitur Bi-gram.

Download Full-text

Handling WSD using Hierarchical Clustering Algorithm with sentences

International Journal of Scientific Research in Science Engineering and Technology ◽

10.32628/ijsrset1841120 ◽

2018 ◽

pp. 83-88

Author(s):

Mohana Priya K ◽

Pooja Ragavi S ◽

Krishna Priya G

Keyword(s):

Hierarchical Clustering ◽

Similarity Measure ◽

Clustering Algorithm ◽

Clustering Algorithms ◽

Cosine Similarity Measure ◽

Hierarchical Clustering Algorithm ◽

Multiple Levels ◽

Sentence Clustering ◽

Clustering is the process of grouping objects into subsets that have meaning in the context of a particular problem. It does not rely on predefined classes. It is referred to as an unsupervised learning method because no information is provided about the "right answer" for any of the objects. Many clustering algorithms have been proposed and are used based on different applications. Sentence clustering is one of best clustering technique. Hierarchical Clustering Algorithm is applied for multiple levels for accuracy. For tagging purpose POS tagger, porter stemmer is used. WordNet dictionary is utilized for determining the similarity by invoking the Jiang Conrath and Cosine similarity measure. Grouping is performed with respect to the highest similarity measure value with a mean threshold. This paper incorporates many parameters for finding similarity between words. In order to identify the disambiguated words, the sense identification is performed for the adjectives and comparison is performed. semcor and machine learning datasets are employed. On comparing with previous results for WSD, our work has improvised a lot which gives a percentage of 91.2%

Download Full-text

The Extended-Average Common Submatrix Similarity Measure with Application to Handwritten Character Images

Informatica ◽

10.15388/informatica.2018.173 ◽

2018 ◽

Vol 29 (3) ◽

pp. 399-420

Author(s):

Alessia Amelio ◽

Darko Brodić ◽

Radmila Janković

Keyword(s):

Similarity Measure ◽

Handwritten Character

Download Full-text

A New Similarity Measure of Interval-Valued Intuitionistic Fuzzy Sets and its Application in Commodity Recommendation

International Review on Computers and Software (IRECOS) ◽

10.15866/irecos.v13i1.16864 ◽

2018 ◽

Vol 13 (1) ◽

pp. 28

Author(s):

Peng Luo ◽

Yongli Li ◽

Chong Wu

Keyword(s):

Similarity Measure ◽

Intuitionistic Fuzzy Sets ◽

Intuitionistic Fuzzy ◽

Interval Valued

Download Full-text

A Modified Similarity Measure for Improving Accuracy of User-Based Collaborative Filtering

Iraqi Journal of Science ◽

10.24996/ijs.2018.59.2b.15 ◽

2018 ◽

Vol 59 (2B) ◽

Keyword(s):

Collaborative Filtering ◽

Similarity Measure ◽

Improving Accuracy

Download Full-text

N-gram based Language Model for the QWERTY Keyboard Input Errors in a Touch Screen Environment

Korean Institute of Smart Media ◽

10.30693/smj.2018.7.2.54 ◽

2018 ◽

Vol 7 (2) ◽

pp. 54-59

Author(s):

Yoon Gee Ong ◽

◽

Seung Shik Kang ◽

Keyword(s):

Language Model ◽

Touch Screen ◽

Keyboard Input ◽

Download Full-text

CoSimRank: A Flexible and Efficient Graph-Theoretic Similarity Measure

10.3115/v1/p14-1131 ◽

2014 ◽

Author(s):

Sascha Rothe ◽

Hinrich Schütze

Keyword(s):

Similarity Measure ◽

Graph Theoretic

Download Full-text

A Semantic Similarity Measure between Ontological Concepts

ACTA AUTOMATICA SINICA ◽

10.3724/sp.j.1004.2012.00229 ◽

2012 ◽

Vol 38 (2) ◽

pp. 229-235 ◽

Author(s):

Wen-Qing LI ◽

Xin SUN ◽

Chang-You ZHANG ◽

Ye FENG

Keyword(s):

Semantic Similarity ◽

Similarity Measure ◽

Semantic Similarity Measure

Download Full-text

SIMILARITY MEASURE USING SIGN DISTANCE

Advances in Mathematics: Scientific Journal ◽

10.37418/amsj.10.1.19 ◽

2020 ◽

Vol 10 (1) ◽

pp. 193-197

Author(s):

D. Stephen Dinagar ◽

E. Fany Helena

Keyword(s):

Similarity Measure

Download Full-text