Neighborhood rough sets with distance metric learning for feature selection

Data collection plays an important role in business agility; data can prove valuable and provide insights for important features. However, conventional data collection methods can be costly and time-consuming. This paper proposes a hybrid system R-EDML that combines a sequential feature selection performed by Reinforcement Learning (RL) with the evolutionary feature prioritization of Evolutionary Distance Metric Learning (EDML) in a clustering process. The goal is to reduce the features while maintaining or increasing the accuracy leading to less time complexity and future data collection time and cost reduction. In this method, features represented by the diagonal elements of EDML matrices are prioritized using a differential evolution algorithm. Further, a selection control strategy using RL is learned by sequentially inserting and evaluating the prioritized elements. The outcome offers the best accuracy R-EDML matrix with the least number of elements. Diagonal R-EDML focusing on the diagonal elements is compared with EDML and conventional feature selection. Full Matrix R-EDML focusing on the diagonal and non-diagonal elements is tested and compared with Information-Theoretic Metric Learning. Moreover, R-EDML policy is tested for each EDML generation and across all generations. Results show a significant decrease in the number of features while maintaining or increasing accuracy.

Download Full-text

Distance metric learning for graph structured data

Machine Learning ◽

10.1007/s10994-021-06009-3 ◽

2021 ◽

Author(s):

Tomoki Yoshida ◽

Ichiro Takeuchi ◽

Masayuki Karasuyama

Keyword(s):

Metric Learning ◽

Structured Data ◽

Distance Metric Learning ◽

Distance Metric

Download Full-text

Statistical Distance Metric Learning for Image Set Retrieval

ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp39728.2021.9413393 ◽

2021 ◽

Author(s):

Ting-Yao Hu ◽

Alexander G. Hauptmann

Keyword(s):

Metric Learning ◽

Distance Metric Learning ◽

Distance Metric ◽

Image Set ◽

Statistical Distance

Download Full-text

Eigenvector-Based Distance Metric Learning for Image Classification and Retrieval

ACM Transactions on Multimedia Computing Communications and Applications ◽

10.1145/3340262 ◽

2019 ◽

Vol 15 (3) ◽

pp. 1-19 ◽

Cited By ~ 2

Author(s):

Zhangcheng Wang ◽

Ya Li ◽

Richang Hong ◽

Xinmei Tian

Keyword(s):

Image Classification ◽

Metric Learning ◽

Distance Metric Learning ◽

Distance Metric

Download Full-text

Semi-supervised LDA Based Method for Similarity Distance Metric Learning

2021 The 4th International Conference on Information Science and Systems ◽

10.1145/3459955.3460606 ◽

2021 ◽

Author(s):

Ren Deng ◽

Yaxuan Chen ◽

Ruilin Han ◽

Han Xiao ◽

Xijie Li

Keyword(s):

Metric Learning ◽

Distance Metric Learning ◽

Distance Metric ◽

Similarity Distance

Download Full-text

Application of Distance Metric Learning to Automated Malware Detection

IEEE Access ◽

10.1109/access.2021.3094064 ◽

2021 ◽

pp. 1-1

Author(s):

Martin Jurecek ◽

Robert Lorencz

Keyword(s):

Metric Learning ◽

Malware Detection ◽

Distance Metric Learning ◽

Distance Metric

Download Full-text

Chi-Squared Distance Metric Learning for Histogram Data

Mathematical Problems in Engineering ◽

10.1155/2015/352849 ◽

2015 ◽

Vol 2015 ◽

pp. 1-12 ◽

Cited By ~ 2

Author(s):

Wei Yang ◽

Luhui Xu ◽

Xiaopan Chen ◽

Fengbin Zheng ◽

Yang Liu

Keyword(s):

Nearest Neighbor ◽

State Of The Art ◽

Metric Learning ◽

Nearest Neighbors ◽

Distance Metric Learning ◽

Distance Metric ◽

Projected Gradient Method ◽

Proper Distance ◽

Chi Squared ◽

Real World Datasets

Learning a proper distance metric for histogram data plays a crucial role in many computer vision tasks. The chi-squared distance is a nonlinear metric and is widely used to compare histograms. In this paper, we show how to learn a general form of chi-squared distance based on the nearest neighbor model. In our method, the margin of sample is first defined with respect to the nearest hits (nearest neighbors from the same class) and the nearest misses (nearest neighbors from the different classes), and then the simplex-preserving linear transformation is trained by maximizing the margin while minimizing the distance between each sample and its nearest hits. With the iterative projected gradient method for optimization, we naturally introduce thel2,1norm regularization into the proposed method for sparse metric learning. Comparative studies with the state-of-the-art approaches on five real-world datasets verify the effectiveness of the proposed method.

Download Full-text