A Fingerprint Indexing Approach Using Multiple Similarity Measures and Spectral Clustering

Author(s):  
Ntethelelo A. Mngenge ◽  
Linda Mthembu ◽  
Fulufhelo V. Nelwamondo ◽  
Cynthia H. Ngejane
2021 ◽  
Vol 7 ◽  
pp. e450
Author(s):  
Wenna Huang ◽  
Yong Peng ◽  
Yuan Ge ◽  
Wanzeng Kong

The Kmeans clustering and spectral clustering are two popular clustering methods for grouping similar data points together according to their similarities. However, the performance of Kmeans clustering might be quite unstable due to the random initialization of the cluster centroids. Generally, spectral clustering methods employ a two-step strategy of spectral embedding and discretization postprocessing to obtain the cluster assignment, which easily lead to far deviation from true discrete solution during the postprocessing process. In this paper, based on the connection between the Kmeans clustering and spectral clustering, we propose a new Kmeans formulation by joint spectral embedding and spectral rotation which is an effective postprocessing approach to perform the discretization, termed KMSR. Further, instead of directly using the dot-product data similarity measure, we make generalization on KMSR by incorporating more advanced data similarity measures and call this generalized model as KMSR-G. An efficient optimization method is derived to solve the KMSR (KMSR-G) model objective whose complexity and convergence are provided. We conduct experiments on extensive benchmark datasets to validate the performance of our proposed models and the experimental results demonstrate that our models perform better than the related methods in most cases.


Author(s):  
B. Mathura Bai ◽  
N. Mangathayaru ◽  
B. Padmaja Rani ◽  
Shadi Aljawarneh

: Missing attribute values in medical datasets are one of the most common problems faced when mining medical datasets. Estimation of missing values is a major challenging task in pre-processing of datasets. Any wrong estimate of missing attribute values can lead to inefficient and improper classification thus resulting in lower classifier accuracies. Similarity measures play a key role during the imputation process. The use of an appropriate and better similarity measure can help to achieve better imputation and improved classification accuracies. This paper proposes a novel imputation measure for finding similarity between missing and non-missing instances in medical datasets. Experiments are carried by applying both the proposed imputation technique and popular benchmark existing imputation techniques. Classification is carried using KNN, J48, SMO and RBFN classifiers. Experiment analysis proved that after imputation of medical records using proposed imputation technique, the resulting classification accuracies reported by the classifiers KNN, J48 and SMO have improved when compared to other existing benchmark imputation techniques.


2021 ◽  
Vol 40 (1) ◽  
pp. 235-250
Author(s):  
Liuxin Chen ◽  
Nanfang Luo ◽  
Xiaoling Gou

In the real multi-criteria group decision making (MCGDM) problems, there will be an interactive relationship among different decision makers (DMs). To identify the overall influence, we define the Shapley value as the DM’s weight. Entropy is a measure which makes it better than similarity measures to recognize a group decision making problem. Since we propose a relative entropy to measure the difference between two systems, which improves the accuracy of the distance measure.In this paper, a MCGDM approach named as TODIM is presented under q-rung orthopair fuzzy information.The proposed TODIM approach is developed for correlative MCGDM problems, in which the weights of the DMs are calculated in terms of Shapley values and the dominance matrices are evaluated based on relative entropy measure with q-rung orthopair fuzzy information.Furthermore, the efficacy of the proposed Gq-ROFWA operator and the novel TODIM is demonstrated through a selection problem of modern enterprises risk investment. A comparative analysis with existing methods is presented to validate the efficiency of the approach.


Sensors ◽  
2019 ◽  
Vol 19 (23) ◽  
pp. 5176
Author(s):  
Guannan Li ◽  
Ying Li ◽  
Bingxin Liu ◽  
Peng Wu ◽  
Chen Chen

Polarimetric synthetic aperture radar is an important tool in the effective detection of marine oil spills. In this study, two cases of Radarsat-2 Fine mode quad-polarimetric synthetic aperture radar datasets are exploited to detect a well-known oil seep area that collected over the Gulf of Mexico using the same research area, sensor, and time. A novel oil spill detection scheme based on a multi-polarimetric features model matching method using spectral pan-similarity measure (SPM) is proposed. A multi-polarimetric features curve is generated based on optimal polarimetric features selected using Jeffreys–Matusita distance considering its ability to discriminate between thick and thin oil slicks and seawater. The SPM is used to search for and match homogeneous unlabeled pixels and assign them to a class with the highest similarity to their spectral vector size, spectral curve shape, and spectral information content. The superiority of the SPM for oil spill detection compared to traditional spectral similarity measures is demonstrated for the first time based on accuracy assessments and computational complexity analysis by comparing with four traditional spectral similarity measures, random forest (RF), support vector machine (SVM), and decision tree (DT). Experiment results indicate that the proposed method has better oil spill detection capability, with a higher average accuracy and kappa coefficient (1.5–7.9% and 1–25% higher, respectively) than the four traditional spectral similarity measures under the same computational complexity operations. Furthermore, in most cases, the proposed method produces valuable and acceptable results that are better than the RF, SVM, and DT in terms of accuracy and computational complexity.


Sign in / Sign up

Export Citation Format

Share Document