Genetic Algorithm Based Feature Subset Selection for Fetal State Classification

Huge amount of data are available in the field of medicine which are used for diagnosing the diseases by analyzing them. Presently, prediction of diseases are made easier and accurate by employing various data mining techniques to extract information from these medical data. This paper presents an improved method of classifying the cardiotocogram (CTG) data using Multiclass Support Vector Machine (MSVM) through an optimized feature subset produced by Genetic Algorithm (GA). Various performance metrics have been evaluated and the experimental results exhibit improved classification performance when using optimized feature set comparing to the full feature set.

Download Full-text

Feature Subset Selection for Hot Method Prediction using Genetic Algorithm wrapped with Support Vector Machines

Journal of Computer Science ◽

10.3844/jcssp.2011.707.714 ◽

2011 ◽

Vol 7 (5) ◽

pp. 707-714 ◽

Cited By ~ 1

Author(s):

Johnson

Keyword(s):

Genetic Algorithm ◽

Support Vector Machines ◽

Subset Selection ◽

Feature Subset Selection ◽

Support Vector ◽

Feature Subset ◽

Vector Machines ◽

Selection For

Download Full-text

Feature subset selection for support vector machines by incremental regularized risk minimization

2004 IEEE International Joint Conference on Neural Networks (IEEE Cat. No.04CH37541) ◽

10.1109/ijcnn.2004.1380930 ◽

2005 ◽

Cited By ~ 5

Author(s):

H. Frohlich ◽

A. Zell

Keyword(s):

Support Vector Machines ◽

Subset Selection ◽

Feature Subset Selection ◽

Support Vector ◽

Feature Subset ◽

Risk Minimization ◽

Vector Machines ◽

Selection For ◽

Regularized Risk Minimization

Download Full-text

A New Hybrid Feature Subset Selection Framework Based on Binary Genetic Algorithm and Information Theory

International Journal of Computational Intelligence and Applications ◽

10.1142/s1469026819500202 ◽

2019 ◽

Vol 18 (03) ◽

pp. 1950020 ◽

Cited By ~ 13

Author(s):

Alok Kumar Shukla ◽

Pradeep Singh ◽

Manu Vardhan

Keyword(s):

Genetic Algorithm ◽

Feature Selection ◽

Classification Accuracy ◽

B Cell Lymphoma ◽

Feature Subset Selection ◽

Classification Model ◽

Significant Feature ◽

Support Vector ◽

Feature Subset ◽

Binary Genetic Algorithm

The explosion of the high-dimensional dataset in the scientific repository has been encouraging interdisciplinary research on data mining, pattern recognition and bioinformatics. The fundamental problem of the individual Feature Selection (FS) method is extracting informative features for classification model and to seek for the malignant disease at low computational cost. In addition, existing FS approaches overlook the fact that for a given cardinality, there can be several subsets with similar information. This paper introduces a novel hybrid FS algorithm, called Filter-Wrapper Feature Selection (FWFS) for a classification problem and also addresses the limitations of existing methods. In the proposed model, the front-end filter ranking method as Conditional Mutual Information Maximization (CMIM) selects the high ranked feature subset while the succeeding method as Binary Genetic Algorithm (BGA) accelerates the search in identifying the significant feature subsets. One of the merits of the proposed method is that, unlike an exhaustive method, it speeds up the FS procedure without lancing of classification accuracy on reduced dataset when a learning model is applied to the selected subsets of features. The efficacy of the proposed (FWFS) method is examined by Naive Bayes (NB) classifier which works as a fitness function. The effectiveness of the selected feature subset is evaluated using numerous classifiers on five biological datasets and five UCI datasets of a varied dimensionality and number of instances. The experimental results emphasize that the proposed method provides additional support to the significant reduction of the features and outperforms the existing methods. For microarray data-sets, we found the lowest classification accuracy is 61.24% on SRBCT dataset and highest accuracy is 99.32% on Diffuse large B-cell lymphoma (DLBCL). In UCI datasets, the lowest classification accuracy is 40.04% on the Lymphography using k-nearest neighbor (k-NN) and highest classification accuracy is 99.05% on the ionosphere using support vector machine (SVM).

Download Full-text

On-line Signature Verification Based on GA-SVM

International Journal of Online Engineering (iJOE) ◽

10.3991/ijoe.v11i6.5122 ◽

2015 ◽

Vol 11 (6) ◽

pp. 49 ◽

Cited By ~ 1

Author(s):

Dong Huang ◽

Jian Gao

Keyword(s):

Genetic Algorithm ◽

Feature Subset Selection ◽

Signature Verification ◽

Support Vector ◽

Svm Classifier ◽

Support Vector Data Description ◽

Feature Subset ◽

Dynamic Features ◽

On Line ◽

One Class Classifier

With the development of pen-based mobile device, on-line signature verification is gradually becoming a kind of important biometrics verification. This thesis proposes a method of verification of on-line handwritten signatures using both Support Vector Data Description (SVM) and Genetic Algorithm (GA). A 27-parameter feature set including shape and dynamic features is extracted from the on-line signatures data. The genuine signatures of each subject are treated as target data to train the SVM classifier. As a kernel based one-class classifier, SVM can accurately describe the feature distribution of the genuine signatures and detect the forgeries. To improving the performance of the authentication method, genetic algorithm (GA) is used to optimise classifier parameters and feature subset selection. Signature data form the SVC2013 database is used to carry out verification experiments. The proposed method can achieve an average Equal Error Rate (EER) of 4.93% of the skill forgery database.

Download Full-text

Fisher score and Matthews correlation coefficient-based feature subset selection for heart disease diagnosis using support vector machines

Knowledge and Information Systems ◽

10.1007/s10115-018-1185-y ◽

2018 ◽

Vol 58 (1) ◽

pp. 139-167 ◽

Cited By ~ 17

Author(s):

Syed Muhammad Saqlain ◽

Muhammad Sher ◽

Faiz Ali Shah ◽

Imran Khan ◽

Muhammad Usman Ashraf ◽

...

Keyword(s):

Matthews Correlation Coefficient ◽

Subset Selection ◽

Disease Diagnosis ◽

Feature Subset Selection ◽

Support Vector ◽

Feature Subset ◽

Fisher Score ◽

Vector Machines ◽

Selection For ◽

Heart Disease Diagnosis

Download Full-text

DETERMINATION OF OPTIMUM CLASSIFICATION SYSTEM FOR HYPERSPECTRAL IMAGERY AND LIDAR DATA BASED ON BEES ALGORITHM

ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences ◽

10.5194/isprsarchives-xl-1-w5-651-2015 ◽

2015 ◽

Vol XL-1-W5 ◽

pp. 651-656

Author(s):

F. Samadzadega ◽

H. Hasani

Keyword(s):

Urban Area ◽

Hyperspectral Imagery ◽

Feature Space ◽

Classification Performance ◽

Feature Subset Selection ◽

Bees Algorithm ◽

Support Vector ◽

Svm Classifier ◽

Lidar Data ◽

Feature Subset

Hyperspectral imagery is a rich source of spectral information and plays very important role in discrimination of similar land-cover classes. In the past, several efforts have been investigated for improvement of hyperspectral imagery classification. Recently the interest in the joint use of LiDAR data and hyperspectral imagery has been remarkably increased. Because LiDAR can provide structural information of scene while hyperspectral imagery provide spectral and spatial information. The complementary information of LiDAR and hyperspectral data may greatly improve the classification performance especially in the complex urban area. In this paper feature level fusion of hyperspectral and LiDAR data is proposed where spectral and structural features are extract from both dataset, then hybrid feature space is generated by feature stacking. Support Vector Machine (SVM) classifier is applied on hybrid feature space to classify the urban area. In order to optimize the classification performance, two issues should be considered: SVM parameters values determination and feature subset selection. Bees Algorithm (BA) is powerful meta-heuristic optimization algorithm which is applied to determine the optimum SVM parameters and select the optimum feature subset simultaneously. The obtained results show the proposed method can improve the classification accuracy in addition to reducing significantly the dimension of feature space.

Download Full-text