On-line Signature Verification Based on GA-SVM

With the development of pen-based mobile device, on-line signature verification is gradually becoming a kind of important biometrics verification. This thesis proposes a method of verification of on-line handwritten signatures using both Support Vector Data Description (SVM) and Genetic Algorithm (GA). A 27-parameter feature set including shape and dynamic features is extracted from the on-line signatures data. The genuine signatures of each subject are treated as target data to train the SVM classifier. As a kernel based one-class classifier, SVM can accurately describe the feature distribution of the genuine signatures and detect the forgeries. To improving the performance of the authentication method, genetic algorithm (GA) is used to optimise classifier parameters and feature subset selection. Signature data form the SVC2013 database is used to carry out verification experiments. The proposed method can achieve an average Equal Error Rate (EER) of 4.93% of the skill forgery database.

Download Full-text

On-line signature verification based on support vector data description and genetic algorithm

2008 7th World Congress on Intelligent Control and Automation ◽

10.1109/wcica.2008.4593531 ◽

2008 ◽

Cited By ~ 1

Author(s):

Ming Meng ◽

Xugang Xi ◽

Zhizeng Luo

Keyword(s):

Genetic Algorithm ◽

Signature Verification ◽

Support Vector ◽

Support Vector Data Description ◽

Vector Data ◽

Data Description ◽

On Line

Download Full-text

A New Hybrid Feature Subset Selection Framework Based on Binary Genetic Algorithm and Information Theory

International Journal of Computational Intelligence and Applications ◽

10.1142/s1469026819500202 ◽

2019 ◽

Vol 18 (03) ◽

pp. 1950020 ◽

Cited By ~ 13

Author(s):

Alok Kumar Shukla ◽

Pradeep Singh ◽

Manu Vardhan

Keyword(s):

Genetic Algorithm ◽

Feature Selection ◽

Classification Accuracy ◽

B Cell Lymphoma ◽

Feature Subset Selection ◽

Classification Model ◽

Significant Feature ◽

Support Vector ◽

Feature Subset ◽

Binary Genetic Algorithm

The explosion of the high-dimensional dataset in the scientific repository has been encouraging interdisciplinary research on data mining, pattern recognition and bioinformatics. The fundamental problem of the individual Feature Selection (FS) method is extracting informative features for classification model and to seek for the malignant disease at low computational cost. In addition, existing FS approaches overlook the fact that for a given cardinality, there can be several subsets with similar information. This paper introduces a novel hybrid FS algorithm, called Filter-Wrapper Feature Selection (FWFS) for a classification problem and also addresses the limitations of existing methods. In the proposed model, the front-end filter ranking method as Conditional Mutual Information Maximization (CMIM) selects the high ranked feature subset while the succeeding method as Binary Genetic Algorithm (BGA) accelerates the search in identifying the significant feature subsets. One of the merits of the proposed method is that, unlike an exhaustive method, it speeds up the FS procedure without lancing of classification accuracy on reduced dataset when a learning model is applied to the selected subsets of features. The efficacy of the proposed (FWFS) method is examined by Naive Bayes (NB) classifier which works as a fitness function. The effectiveness of the selected feature subset is evaluated using numerous classifiers on five biological datasets and five UCI datasets of a varied dimensionality and number of instances. The experimental results emphasize that the proposed method provides additional support to the significant reduction of the features and outperforms the existing methods. For microarray data-sets, we found the lowest classification accuracy is 61.24% on SRBCT dataset and highest accuracy is 99.32% on Diffuse large B-cell lymphoma (DLBCL). In UCI datasets, the lowest classification accuracy is 40.04% on the Lymphography using k-nearest neighbor (k-NN) and highest classification accuracy is 99.05% on the ionosphere using support vector machine (SVM).

Download Full-text

A Parallel Genetic Algorithm Based Feature Selection and Parameter Optimization for Support Vector Machine

Scientific Programming ◽

10.1155/2016/2739621 ◽

2016 ◽

Vol 2016 ◽

pp. 1-10 ◽

Cited By ~ 13

Author(s):

Zhi Chen ◽

Tao Lin ◽

Ningjiu Tang ◽

Xin Xia

Keyword(s):

Genetic Algorithm ◽

Classification Accuracy ◽

Coarse Grained ◽

Support Vector ◽

Svm Classifier ◽

Feature Subset ◽

Parallel Genetic Algorithm ◽

Support Vectors ◽

Optimal Feature Subset ◽

Optimal Feature

The extensive applications of support vector machines (SVMs) require efficient method of constructing a SVM classifier with high classification ability. The performance of SVM crucially depends on whether optimal feature subset and parameter of SVM can be efficiently obtained. In this paper, a coarse-grained parallel genetic algorithm (CGPGA) is used to simultaneously optimize the feature subset and parameters for SVM. The distributed topology and migration policy of CGPGA can help find optimal feature subset and parameters for SVM in significantly shorter time, so as to increase the quality of solution found. In addition, a new fitness function, which combines the classification accuracy obtained from bootstrap method, the number of chosen features, and the number of support vectors, is proposed to lead the search of CGPGA to the direction of optimal generalization error. Experiment results on 12 benchmark datasets show that our proposed approach outperforms genetic algorithm (GA) based method and grid search method in terms of classification accuracy, number of chosen features, number of support vectors, and running time.

Download Full-text

DETERMINATION OF OPTIMUM CLASSIFICATION SYSTEM FOR HYPERSPECTRAL IMAGERY AND LIDAR DATA BASED ON BEES ALGORITHM

ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences ◽

10.5194/isprsarchives-xl-1-w5-651-2015 ◽

2015 ◽

Vol XL-1-W5 ◽

pp. 651-656

Author(s):

F. Samadzadega ◽

H. Hasani

Keyword(s):

Urban Area ◽

Hyperspectral Imagery ◽

Feature Space ◽

Classification Performance ◽

Feature Subset Selection ◽

Bees Algorithm ◽

Support Vector ◽

Svm Classifier ◽

Lidar Data ◽

Feature Subset

Hyperspectral imagery is a rich source of spectral information and plays very important role in discrimination of similar land-cover classes. In the past, several efforts have been investigated for improvement of hyperspectral imagery classification. Recently the interest in the joint use of LiDAR data and hyperspectral imagery has been remarkably increased. Because LiDAR can provide structural information of scene while hyperspectral imagery provide spectral and spatial information. The complementary information of LiDAR and hyperspectral data may greatly improve the classification performance especially in the complex urban area. In this paper feature level fusion of hyperspectral and LiDAR data is proposed where spectral and structural features are extract from both dataset, then hybrid feature space is generated by feature stacking. Support Vector Machine (SVM) classifier is applied on hybrid feature space to classify the urban area. In order to optimize the classification performance, two issues should be considered: SVM parameters values determination and feature subset selection. Bees Algorithm (BA) is powerful meta-heuristic optimization algorithm which is applied to determine the optimum SVM parameters and select the optimum feature subset simultaneously. The obtained results show the proposed method can improve the classification accuracy in addition to reducing significantly the dimension of feature space.

Download Full-text

Text Classification of Cornell Movie Data using Data Mining with Feature Selection

International Journal of Engineering and Advanced Technology - Regular Issue ◽

10.35940/ijeat.b2329.129219 ◽

2019 ◽

Vol 9 (2) ◽

pp. 2950-2955

Keyword(s):

Feature Selection ◽

Text Mining ◽

Text Classification ◽

Feature Subset Selection ◽

Support Vector ◽

Svm Classifier ◽

Feature Subset ◽

Chi Square ◽

Feature Selection Technique ◽

Data Set

Text Classification is branch of text mining through which we can analyze the sentiment of the movie data. In this research paper we have applied different preprocessing techniques to reduce the features from cornell movie data set. We have also applied the Correlation-based feature subset selection and chi-square feature selection technique for gathering most valuable words of each category in text mining processes. The new cornell movie data set formed after applying the preprocessing steps and feature selection techniques. We have classified the cornell movie data as positive or negative using various classifiers like Support Vector Machine (SVM), Multilayer Perceptron (MLP), Naive Bayes (NB), Bays Net (BN) and Random Forest (RF) classifier. We have also compared the classification accuracy among classifiers and achieved better accuracy i. e. 87% in case of SVM classifier with reduced number of features. The suggested classifier can be useful in opinion of movie review, analysis of any blog and documents etc.

Download Full-text

On-line signature verification based on template matching approach and support vector data description

2010 International Conference on Computer Application and System Modeling (ICCASM 2010) ◽

10.1109/iccasm.2010.5622460 ◽

2010 ◽

Author(s):

Liu Dong ◽

Ge Yun-Jian ◽

Zhang Xue-Yong

Keyword(s):

Template Matching ◽

Signature Verification ◽

Support Vector ◽

Support Vector Data Description ◽

Vector Data ◽

Data Description ◽

On Line

Download Full-text

HYBRID OF GENETIC ALGORITHM AND SIMULATED ANNEALING FOR SUPPORT VECTOR REGRESSION OPTIMIZATION IN RAINFALL FORECASTING

International Journal of Computational Intelligence and Applications ◽

10.1142/s1469026813500120 ◽

2013 ◽

Vol 12 (02) ◽

pp. 1350012 ◽

Cited By ~ 5

Author(s):

CHANGMING ZHU ◽

JIANSHENG WU

Keyword(s):

Genetic Algorithm ◽

Kernel Function ◽

Subset Selection ◽

Feature Subset Selection ◽

Support Vector ◽

Feature Subset ◽

Parameter Setting ◽

Rainfall Forecasting ◽

Input Feature ◽

Kernel Parameter

Accurate forecasting of rainfall has been one of the most important issues in hydrological research such as river training works and design of flood warning systems. Support vector regression (SVR) is a popular regression method in rainfall forecasting. Type of kernel function and kernel parameter setting in the SVR traing procedure, along with the input feature subset selection, significantly influence regression accuracy. In this paper, an effective hybrid optimization strategy by combining the strengths of genetic algorithm (GA) and simulated annealing (SA), is employed to simultaneously optimize the input feature subset selection, the type of kernel function and the kernel parameter setting of SVR, namely GASA–SVR. The developed GASA–SVR model is being applied for monthly rainfall forecasting in Guilin of Guangxi. The GA is carried out as a main frame of this hybrid algorithm while SA is used as a local search strategy to help GA jump out of local optima and avoid sinking into the local optimal solution early. Compared with SVR, pure GA–SVR and HGA–SVR, results show that the hybrid GASA–SVR model can correctly select the discriminating input features subset, successfully identify the optimal type of kernel function and all the optimal values of the parameters of SVR with the lowest prediction error values in rainfall forecasting, can also significantly improve the rainfall forecasting accuracy. Experimental results reveal that the predictions using the proposed approach are consistently better than those obtained using the other methods presented in this study in terms of the same measurements. Those results show that the proposed GASA–SVR model provides a promising alternative to monthly rainfall prediction.

Download Full-text

A Genetic Algorithm Based Support Vector Machine Model for Blood-Brain Barrier Penetration Prediction

BioMed Research International ◽

10.1155/2015/292683 ◽

2015 ◽

Vol 2015 ◽

pp. 1-13 ◽

Cited By ~ 5

Author(s):

Daqing Zhang ◽

Jianfeng Xiao ◽

Nannan Zhou ◽

Mingyue Zheng ◽

Xiaomin Luo ◽

...

Keyword(s):

Genetic Algorithm ◽

Support Vector Machine ◽

Blood Brain Barrier ◽

Subset Selection ◽

Feature Subset Selection ◽

Brain Barrier ◽

Support Vector ◽

Feature Subset ◽

Svm Model ◽

Kernel Parameters

Blood-brain barrier (BBB) is a highly complex physical barrier determining what substances are allowed to enter the brain. Support vector machine (SVM) is a kernel-based machine learning method that is widely used in QSAR study. For a successful SVM model, the kernel parameters for SVM and feature subset selection are the most important factors affecting prediction accuracy. In most studies, they are treated as two independent problems, but it has been proven that they could affect each other. We designed and implemented genetic algorithm (GA) to optimize kernel parameters and feature subset selection for SVM regression and applied it to the BBB penetration prediction. The results show that our GA/SVM model is more accurate than other currently available logBBmodels. Therefore, to optimize both SVM parameters and feature subset simultaneously with genetic algorithm is a better approach than other methods that treat the two problems separately. Analysis of our logBBmodel suggests that carboxylic acid group, polar surface area (PSA)/hydrogen-bonding ability, lipophilicity, and molecular charge play important role in BBB penetration. Among those properties relevant to BBB penetration, lipophilicity could enhance the BBB penetration while all the others are negatively correlated with BBB penetration.

Download Full-text

Feature Subset Selection for Hot Method Prediction using Genetic Algorithm wrapped with Support Vector Machines

Journal of Computer Science ◽

10.3844/jcssp.2011.707.714 ◽

2011 ◽

Vol 7 (5) ◽

pp. 707-714 ◽

Cited By ~ 1

Author(s):

Johnson

Keyword(s):

Genetic Algorithm ◽

Support Vector Machines ◽

Subset Selection ◽

Feature Subset Selection ◽

Support Vector ◽

Feature Subset ◽

Vector Machines ◽

Selection For

Download Full-text

Genetic Algorithm Based Feature Subset Selection for Fetal State Classification

Journal of Communications Technology Electronics and Computer Science ◽

10.22385/jctecs.v2i0.20 ◽

2015 ◽

Vol 2 ◽

pp. 13 ◽

Cited By ~ 6

Author(s):

Subha Velappan ◽

Murugan D ◽

Prabha S ◽

Manivanna Boopathi A

Keyword(s):

Genetic Algorithm ◽

Performance Metrics ◽

Classification Performance ◽

Feature Subset Selection ◽

Support Vector ◽

Feature Subset ◽

Huge Amount ◽

State Classification ◽

Multiclass Support Vector Machine ◽

Selection For

Huge amount of data are available in the field of medicine which are used for diagnosing the diseases by analyzing them. Presently, prediction of diseases are made easier and accurate by employing various data mining techniques to extract information from these medical data. This paper presents an improved method of classifying the cardiotocogram (CTG) data using Multiclass Support Vector Machine (MSVM) through an optimized feature subset produced by Genetic Algorithm (GA). Various performance metrics have been evaluated and the experimental results exhibit improved classification performance when using optimized feature set comparing to the full feature set.

Download Full-text