Pattern and Feature Selection by Genetic Algorithms in Nearest Neighbor Classification

Hisao Ishibuchi;  ; Tomoharu Nakashima

doi:10.20965/jaciii.2000.p0138

Pattern and Feature Selection by Genetic Algorithms in Nearest Neighbor Classification

Journal of Advanced Computational Intelligence and Intelligent Informatics ◽

10.20965/jaciii.2000.p0138 ◽

2000 ◽

Vol 4 (2) ◽

pp. 138-145 ◽

Cited By ~ 9

Author(s):

Hisao Ishibuchi ◽

◽

Tomoharu Nakashima

Keyword(s):

Genetic Algorithm ◽

Genetic Algorithms ◽

Feature Selection ◽

Computer Simulations ◽

Nearest Neighbor ◽

Classification Performance ◽

Data Sets ◽

Nearest Neighbor Classification ◽

Reference Set ◽

Neighbor Classification

This paper proposes a genetic-algorithm-based approach for finding a compact reference set in nearest neighbor classification. The reference set is designed by selecting a small number of reference patterns from a large number of training patterns using a genetic algorithm. The genetic algorithm also removes unnecessary features. The reference set in our nearest neighbor classification consists of selected patterns with selected features. A binary string is used for representing the inclusion (or exclusion) of each pattern and feature in the reference set. Our goal is to minimize the number of selected patterns, to minimize the number of selected features, and to maximize the classification performance of the reference set. Computer simulations on commonly used data sets examine the effectiveness of our approach.

Download Full-text

Pap smear diagnosis using a hybrid intelligent scheme focusing on genetic algorithm based feature selection and nearest neighbor classification

Computers in Biology and Medicine ◽

10.1016/j.compbiomed.2008.11.006 ◽

2009 ◽

Vol 39 (1) ◽

pp. 69-78 ◽

Cited By ~ 66

Author(s):

Yannis Marinakis ◽

Georgios Dounias ◽

Jan Jantzen

Keyword(s):

Genetic Algorithm ◽

Feature Selection ◽

Pap Smear ◽

Nearest Neighbor ◽

Nearest Neighbor Classification ◽

Neighbor Classification

Download Full-text

Feature selection based on loss-margin of nearest neighbor classification

Pattern Recognition ◽

10.1016/j.patcog.2008.10.011 ◽

2009 ◽

Vol 42 (9) ◽

pp. 1914-1921 ◽

Cited By ~ 43

Author(s):

Yun Li ◽

Bao-Liang Lu

Keyword(s):

Feature Selection ◽

Nearest Neighbor ◽

Nearest Neighbor Classification ◽

Neighbor Classification

Download Full-text

Weighted K-Nearest Neighbor Classification Algorithm Based on Genetic Algorithm

TELKOMNIKA Indonesian Journal of Electrical Engineering ◽

10.11591/telkomnika.v11i10.2534 ◽

2013 ◽

Vol 11 (10) ◽

Cited By ~ 3

Author(s):

Xuesong Yan

Keyword(s):

Genetic Algorithm ◽

Nearest Neighbor ◽

Classification Algorithm ◽

K Nearest Neighbor ◽

Nearest Neighbor Classification ◽

Neighbor Classification

Download Full-text

A genetic algorithm based nearest neighbor classification to breast cancer diagnosis

Australasian Physical & Engineering Sciences in Medicine ◽

10.1007/bf03178690 ◽

2003 ◽

Vol 26 (1) ◽

pp. 6-11 ◽

Cited By ~ 3

Author(s):

R. Jain ◽

J. Mazumdar

Keyword(s):

Breast Cancer ◽

Genetic Algorithm ◽

Cancer Diagnosis ◽

Nearest Neighbor ◽

Breast Cancer Diagnosis ◽

Nearest Neighbor Classification ◽

Neighbor Classification

Download Full-text

Feature selection via minimizing nearest neighbor classification error

2010 International Conference on Machine Learning and Cybernetics ◽

10.1109/icmlc.2010.5581011 ◽

2010 ◽

Author(s):

Peng-Fei Zhu ◽

Tian-Hang Meng ◽

Yun-Long Zhao ◽

Rui-Xian Ma ◽

Qing-Hua Hu

Keyword(s):

Feature Selection ◽

Nearest Neighbor ◽

Classification Error ◽

Nearest Neighbor Classification ◽

Neighbor Classification

Download Full-text

HIERARCHICAL DISTANCE METRIC LEARNING FOR LARGE MARGIN NEAREST NEIGHBOR CLASSIFICATION

International Journal of Pattern Recognition and Artificial Intelligence ◽

10.1142/s021800141100897x ◽

2011 ◽

Vol 25 (07) ◽

pp. 1073-1087 ◽

Cited By ~ 11

Author(s):

SHILIANG SUN ◽

QIAONA CHEN

Keyword(s):

Nearest Neighbor ◽

Metric Learning ◽

Data Sets ◽

Distance Metric Learning ◽

Distance Metric ◽

Real World Data ◽

K Nearest Neighbors ◽

Large Margin ◽

Nearest Neighbor Classification ◽

Neighbor Classification

Distance metric learning is a powerful tool to improve performance in classification, clustering and regression tasks. Many techniques have been proposed for distance metric learning based on convex programming, kernel learning, dimension reduction and large margin. The recently proposed large margin nearest neighbor classification (LMNN) improves the performance of k-nearest neighbors classification (k-nn) by a learned global distance metric. However, it does not consider the locality of data distributions. We demonstrate a novel local distance metric learning method called hierarchical distance metric learning (HDM) which first builds a hierarchical structure by grouping data points according to the overlapping ratios defined by us and then learns distance metrics sequentially. In this paper, we combine HDM with LMNN and further propose a new method named hierarchical distance metric learning for large margin nearest neighbor classification (HLMNN). Experiments are performed on many artificial and real-world data sets. Comparisons with the traditional k-nn and the state-of-the-art LMNN show the effectiveness of the proposed HLMNN.

Download Full-text

Human action recognition based on boosted feature selection and naive Bayes nearest-neighbor classification

Signal Processing ◽

10.1016/j.sigpro.2012.07.017 ◽

2013 ◽

Vol 93 (6) ◽

pp. 1521-1530 ◽

Cited By ~ 21

Author(s):

Li Liu ◽

Ling Shao ◽

Peter Rockett

Keyword(s):

Feature Selection ◽

Action Recognition ◽

Nearest Neighbor ◽

Naive Bayes ◽

Human Action Recognition ◽

Human Action ◽

Naïve Bayes ◽

Nearest Neighbor Classification ◽

Neighbor Classification

Download Full-text

GA-based approaches for finding the minimum reference set for nearest neighbor classification

1998 IEEE International Conference on Evolutionary Computation Proceedings. IEEE World Congress on Computational Intelligence (Cat. No.98TH8360) ◽

10.1109/icec.1998.700139 ◽

2002 ◽

Cited By ~ 2

Author(s):

T. Nakashirna ◽

H. Ishibuchi

Keyword(s):

Nearest Neighbor ◽

Nearest Neighbor Classification ◽

Reference Set ◽

Neighbor Classification

Download Full-text

Improving the Accuracy of Nearest-Neighbor Classification Using Principled Construction and Stochastic Sampling of Training-Set Centroids

Entropy ◽

10.3390/e23020149 ◽

2021 ◽

Vol 23 (2) ◽

pp. 149

Author(s):

Stephen Whitelam

Keyword(s):

Configuration Space ◽

Nearest Neighbor ◽

Coarse Graining ◽

Machine Learning Techniques ◽

Data Sets ◽

Training Set ◽

Test Set ◽

Nearest Neighbor Classification ◽

Stochastic Sampling ◽

Neighbor Classification

A conceptually simple way to classify images is to directly compare test-set data and training-set data. The accuracy of this approach is limited by the method of comparison used, and by the extent to which the training-set data cover configuration space. Here we show that this coverage can be substantially increased using coarse-graining (replacing groups of images by their centroids) and stochastic sampling (using distinct sets of centroids in combination). We use the MNIST and Fashion-MNIST data sets to show that a principled coarse-graining algorithm can convert training images into fewer image centroids without loss of accuracy of classification of test-set images by nearest-neighbor classification. Distinct batches of centroids can be used in combination as a means of stochastically sampling configuration space, and can classify test-set data more accurately than can the unaltered training set. On the MNIST and Fashion-MNIST data sets this approach converts nearest-neighbor classification from a mid-ranking- to an upper-ranking member of the set of classical machine-learning techniques.

Download Full-text