Hierarchical and Non-Hierarchical Medoid Clustering Using Asymmetric Similarity Measures

AbstractDistributional word similarity is most commonly perceived as a symmetric relation. Yet, directional relations are abundant in lexical semantics and in many Natural Language Processing (NLP) settings that require lexical inference, making symmetric similarity measures less suitable for their identification. This paper investigates the nature of directional (asymmetric) similarity measures that aim to quantify distributional feature inclusion. We identify desired properties of such measures for lexical inference, specify a particular measure based on Average Precision that addresses these properties, and demonstrate the empirical benefit of directional measures for two different NLP datasets.

Download Full-text

Agglomerative Hierarchical Clustering Without Reversals on Dendrograms Using Asymmetric Similarity Measures

Journal of Advanced Computational Intelligence and Intelligent Informatics ◽

10.20965/jaciii.2012.p0807 ◽

2012 ◽

Vol 16 (7) ◽

pp. 807-813

Author(s):

Satoshi Takumi ◽

◽

Sadaaki Miyamoto

Keyword(s):

Hierarchical Clustering ◽

Similarity Measures ◽

Real Data ◽

Agglomerative Hierarchical Clustering ◽

Average Linkage ◽

Linkage Methods ◽

Asymmetric Similarity

Algorithms of agglomerative hierarchical clustering using asymmetric similarity measures are studied. Two different measures between two clusters are proposed, one of which generalizes the average linkage for symmetric similarity measures. Asymmetric dendrogram representation is considered after foregoing studies. It is proved that the proposed linkage methods for asymmetric measures have no reversals in the dendrograms. Examples based on real data show how the methods work.

Download Full-text

Min and max hierarchical clustering using asymmetric similarity measures

Psychometrika ◽

10.1007/bf02291174 ◽

1973 ◽

Vol 38 (1) ◽

pp. 63-72 ◽

Cited By ~ 70

Author(s):

Lawrence Hubert

Keyword(s):

Hierarchical Clustering ◽

Similarity Measures ◽

Asymmetric Similarity

Download Full-text

Defining and combining symmetric and asymmetric similarity measures

Lecture Notes in Computer Science - Advances in Case-Based Reasoning ◽

10.1007/bfb0056321 ◽

1998 ◽

pp. 52-63 ◽

Cited By ~ 7

Author(s):

Derek G. Bridge

Keyword(s):

Similarity Measures ◽

Asymmetric Similarity

Download Full-text

Assumed similarity measures as predictors of team effectiveness in surveying. (Tech. Rep. No. 6.).

PsycEXTRA Dataset ◽

10.1037/e420962004-001 ◽

1953 ◽

Author(s):

Fred E. Fiedler

Keyword(s):

Team Effectiveness ◽

Similarity Measures ◽

Assumed Similarity

Download Full-text

DISTANCE AND SIMILARITY MEASURES FOR INTUITIONISTIC MULTIPLICATIVE PREFERENCE RELATION AND ITS APPLICATIONS

International Journal for Uncertainty Quantification ◽

10.1615/int.j.uncertaintyquantification.2017018981 ◽

2017 ◽

Vol 7 (2) ◽

pp. 117-133 ◽

Cited By ~ 46

Author(s):

Harish Garg

Keyword(s):

Preference Relation ◽

Similarity Measures

Download Full-text

Faculty Opinions recommendation of Exploiting disjointness axioms to improve semantic similarity measures.

Faculty Opinions – Post-Publication Peer Review of the Biomedical Literature ◽

10.3410/f.722317980.793528331 ◽

2017 ◽

Author(s):

Sebastian Köhler

Keyword(s):

Semantic Similarity ◽

Similarity Measures

Download Full-text

Non-Metric Similarity Measures

10.21236/ada622081 ◽

2015 ◽

Author(s):

Kai M. Ting

Keyword(s):

Similarity Measures

Download Full-text

MATHURA (MBI) - A NOVEL IMPUTATION MEASURE FOR IMPUTATION OF MISSING VALUES IN MEDICAL DATASETS

Recent Advances in Computer Science and Communications ◽

10.2174/2666255813666191216123352 ◽

2019 ◽

Vol 13 ◽

Author(s):

B. Mathura Bai ◽

N. Mangathayaru ◽

B. Padmaja Rani ◽

Shadi Aljawarneh

Keyword(s):

Similarity Measure ◽

Medical Records ◽

Missing Values ◽

Similarity Measures ◽

Common Problems ◽

Experiment Analysis

: Missing attribute values in medical datasets are one of the most common problems faced when mining medical datasets. Estimation of missing values is a major challenging task in pre-processing of datasets. Any wrong estimate of missing attribute values can lead to inefficient and improper classification thus resulting in lower classifier accuracies. Similarity measures play a key role during the imputation process. The use of an appropriate and better similarity measure can help to achieve better imputation and improved classification accuracies. This paper proposes a novel imputation measure for finding similarity between missing and non-missing instances in medical datasets. Experiments are carried by applying both the proposed imputation technique and popular benchmark existing imputation techniques. Classification is carried using KNN, J48, SMO and RBFN classifiers. Experiment analysis proved that after imputation of medical records using proposed imputation technique, the resulting classification accuracies reported by the classifiers KNN, J48 and SMO have improved when compared to other existing benchmark imputation techniques.

Download Full-text