DETERMINATION OF OPTIMUM CLASSIFICATION SYSTEM FOR HYPERSPECTRAL IMAGERY AND LIDAR DATA BASED ON BEES ALGORITHM

Hyperspectral imagery is a rich source of spectral information and plays very important role in discrimination of similar land-cover classes. In the past, several efforts have been investigated for improvement of hyperspectral imagery classification. Recently the interest in the joint use of LiDAR data and hyperspectral imagery has been remarkably increased. Because LiDAR can provide structural information of scene while hyperspectral imagery provide spectral and spatial information. The complementary information of LiDAR and hyperspectral data may greatly improve the classification performance especially in the complex urban area. In this paper feature level fusion of hyperspectral and LiDAR data is proposed where spectral and structural features are extract from both dataset, then hybrid feature space is generated by feature stacking. Support Vector Machine (SVM) classifier is applied on hybrid feature space to classify the urban area. In order to optimize the classification performance, two issues should be considered: SVM parameters values determination and feature subset selection. Bees Algorithm (BA) is powerful meta-heuristic optimization algorithm which is applied to determine the optimum SVM parameters and select the optimum feature subset simultaneously. The obtained results show the proposed method can improve the classification accuracy in addition to reducing significantly the dimension of feature space.

Download Full-text

On-line Signature Verification Based on GA-SVM

International Journal of Online Engineering (iJOE) ◽

10.3991/ijoe.v11i6.5122 ◽

2015 ◽

Vol 11 (6) ◽

pp. 49 ◽

Cited By ~ 1

Author(s):

Dong Huang ◽

Jian Gao

Keyword(s):

Genetic Algorithm ◽

Feature Subset Selection ◽

Signature Verification ◽

Support Vector ◽

Svm Classifier ◽

Support Vector Data Description ◽

Feature Subset ◽

Dynamic Features ◽

On Line ◽

One Class Classifier

With the development of pen-based mobile device, on-line signature verification is gradually becoming a kind of important biometrics verification. This thesis proposes a method of verification of on-line handwritten signatures using both Support Vector Data Description (SVM) and Genetic Algorithm (GA). A 27-parameter feature set including shape and dynamic features is extracted from the on-line signatures data. The genuine signatures of each subject are treated as target data to train the SVM classifier. As a kernel based one-class classifier, SVM can accurately describe the feature distribution of the genuine signatures and detect the forgeries. To improving the performance of the authentication method, genetic algorithm (GA) is used to optimise classifier parameters and feature subset selection. Signature data form the SVC2013 database is used to carry out verification experiments. The proposed method can achieve an average Equal Error Rate (EER) of 4.93% of the skill forgery database.

Download Full-text

Text Classification of Cornell Movie Data using Data Mining with Feature Selection

International Journal of Engineering and Advanced Technology - Regular Issue ◽

10.35940/ijeat.b2329.129219 ◽

2019 ◽

Vol 9 (2) ◽

pp. 2950-2955

Keyword(s):

Feature Selection ◽

Text Mining ◽

Text Classification ◽

Feature Subset Selection ◽

Support Vector ◽

Svm Classifier ◽

Feature Subset ◽

Chi Square ◽

Feature Selection Technique ◽

Data Set

Text Classification is branch of text mining through which we can analyze the sentiment of the movie data. In this research paper we have applied different preprocessing techniques to reduce the features from cornell movie data set. We have also applied the Correlation-based feature subset selection and chi-square feature selection technique for gathering most valuable words of each category in text mining processes. The new cornell movie data set formed after applying the preprocessing steps and feature selection techniques. We have classified the cornell movie data as positive or negative using various classifiers like Support Vector Machine (SVM), Multilayer Perceptron (MLP), Naive Bayes (NB), Bays Net (BN) and Random Forest (RF) classifier. We have also compared the classification accuracy among classifiers and achieved better accuracy i. e. 87% in case of SVM classifier with reduced number of features. The suggested classifier can be useful in opinion of movie review, analysis of any blog and documents etc.

Download Full-text

Hybrid approaches to feature subset selection for data classification in high-dimensional feature space

Artificial Intelligence Research ◽

10.5430/air.v9n1p45 ◽

2020 ◽

Vol 9 (1) ◽

pp. 45

Author(s):

Maysa Ibrahem Almulla Khalaf ◽

John Q Gan

Keyword(s):

Dimensional Space ◽

Subset Selection ◽

Feature Space ◽

Feature Subset Selection ◽

High Dimensional ◽

Support Vector ◽

Feature Subset ◽

Linear Discriminant ◽

Hybrid Approaches ◽

Low Dimensional

This paper proposes two hybrid feature subset selection approaches based on the combination (union or intersection) of both supervised and unsupervised filter approaches before using a wrapper, aiming to obtain low-dimensional features with high accuracy and interpretability and low time consumption. Experiments with the proposed hybrid approaches have been conducted on seven high-dimensional feature datasets. The classifiers adopted are support vector machine (SVM), linear discriminant analysis (LDA), and K-nearest neighbour (KNN). Experimental results have demonstrated the advantages and usefulness of the proposed methods in feature subset selection in high-dimensional space in terms of the number of selected features and time spent to achieve the best classification accuracy.

Download Full-text

Genetic Algorithm Based Feature Subset Selection for Fetal State Classification

Journal of Communications Technology Electronics and Computer Science ◽

10.22385/jctecs.v2i0.20 ◽

2015 ◽

Vol 2 ◽

pp. 13 ◽

Cited By ~ 6

Author(s):

Subha Velappan ◽

Murugan D ◽

Prabha S ◽

Manivanna Boopathi A

Keyword(s):

Genetic Algorithm ◽

Performance Metrics ◽

Classification Performance ◽

Feature Subset Selection ◽

Support Vector ◽

Feature Subset ◽

Huge Amount ◽

State Classification ◽

Multiclass Support Vector Machine ◽

Selection For

Huge amount of data are available in the field of medicine which are used for diagnosing the diseases by analyzing them. Presently, prediction of diseases are made easier and accurate by employing various data mining techniques to extract information from these medical data. This paper presents an improved method of classifying the cardiotocogram (CTG) data using Multiclass Support Vector Machine (MSVM) through an optimized feature subset produced by Genetic Algorithm (GA). Various performance metrics have been evaluated and the experimental results exhibit improved classification performance when using optimized feature set comparing to the full feature set.

Download Full-text

Feature Subset Selection for Support Vector Machine Through Separability Assessment in Kernel-Defined Feature Space

Journal of Computational and Theoretical Nanoscience ◽

10.1166/jctn.2007.2434 ◽

2007 ◽

Vol 4 (7) ◽

pp. 1417-1425

Author(s):

Yaohua Tang ◽

Jinghuai Gao ◽

Cunxiang Yang ◽

Xiaoliang Yang

Keyword(s):

Support Vector Machine ◽

Subset Selection ◽

Feature Space ◽

Feature Subset Selection ◽

Support Vector ◽

Feature Subset ◽

Selection For

Download Full-text

Binary Spectrum Feature for Improved Classiﬁer Performance

10.36227/techrxiv.12993122 ◽

2020 ◽

Author(s):

Nalika Ulapane ◽

Karthick Thiyagarajan ◽

sarath kodagoda

Keyword(s):

Machine Learning ◽

Classification Performance ◽

Feature Reduction ◽

Sensor Data ◽

Machine Learning Techniques ◽

Support Vector ◽

Svm Classifier ◽

Monitoring Task ◽

Classifier Performance ◽

Spectrum Feature

<div>Classiﬁcation has become a vital task in modern machine learning and Artiﬁcial Intelligence applications, including smart sensing. Numerous machine learning techniques are available to perform classiﬁcation. Similarly, numerous practices, such as feature selection (i.e., selection of a subset of descriptor variables that optimally describe the output), are available to improve classiﬁer performance. In this paper, we consider the case of a given supervised learning classiﬁcation task that has to be performed making use of continuous-valued features. It is assumed that an optimal subset of features has already been selected. Therefore, no further feature reduction, or feature addition, is to be carried out. Then, we attempt to improve the classiﬁcation performance by passing the given feature set through a transformation that produces a new feature set which we have named the “Binary Spectrum”. Via a case study example done on some Pulsed Eddy Current sensor data captured from an infrastructure monitoring task, we demonstrate how the classiﬁcation accuracy of a Support Vector Machine (SVM) classiﬁer increases through the use of this Binary Spectrum feature, indicating the feature transformation’s potential for broader usage.</div><div><br></div>

Download Full-text

Shape-restricted support vector machine (SR-SVM): a SVM classifier taking supplementary shape information of input

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-202155 ◽

2021 ◽

Vol 40 (1) ◽

pp. 1481-1494

Author(s):

Geng Deng ◽

Yaoguo Xie ◽

Xindong Wang ◽

Qiang Fu

Keyword(s):

Support Vector Machine ◽

Classification Performance ◽

Research Literature ◽

Support Vector ◽

Svm Classifier ◽

Classification Problems ◽

Active Set ◽

Shape Information ◽

Convex Optimization Problem ◽

Shape Restrictions

Many classification problems contain shape information from input features, such as monotonic, convex, and concave. In this research, we propose a new classifier, called Shape-Restricted Support Vector Machine (SR-SVM), which takes the component-wise shape information to enhance classification accuracy. There exists vast research literature on monotonic classification covering monotonic or ordinal shapes. Our proposed classifier extends to handle convex and concave types of features, and combinations of these types. While standard SVM uses linear separating hyperplanes, our novel SR-SVM essentially constructs non-parametric and nonlinear separating planes subject to component-wise shape restrictions. We formulate SR-SVM classifier as a convex optimization problem and solve it using an active-set algorithm. The approach applies basis function expansions on the input and effectively utilizes the standard SVM solver. We illustrate our methodology using simulation and real world examples, and show that SR-SVM improves the classification performance with additional shape information of input.

Download Full-text

An Expert System Based on Fisher Score and LS-SVM for Cardiac Arrhythmia Diagnosis

Computational and Mathematical Methods in Medicine ◽

10.1155/2013/849674 ◽

2013 ◽

Vol 2013 ◽

pp. 1-6 ◽

Cited By ~ 19

Author(s):

Ersen Yılmaz

Keyword(s):

Expert System ◽

Cardiac Arrhythmia ◽

Feature Space ◽

Support Vector ◽

Feature Subset ◽

Fisher Score ◽

Data Set ◽

Second Stage ◽

Vector Machines ◽

Two Stages

An expert system having two stages is proposed for cardiac arrhythmia diagnosis. In the first stage, Fisher score is used for feature selection to reduce the feature space dimension of a data set. The second stage is classification stage in which least squares support vector machines classifier is performed by using the feature subset selected in the first stage to diagnose cardiac arrhythmia. Performance of the proposed expert system is evaluated by using an arrhythmia data set which is taken from UCI machine learning repository.

Download Full-text

A novel feature selection algorithm based on damping oscillation theory

PLoS ONE ◽

10.1371/journal.pone.0255307 ◽

2021 ◽

Vol 16 (8) ◽

pp. e0255307

Author(s):

Fujun Wang ◽

Xing Wang

Keyword(s):

Feature Selection ◽

Optimization Algorithm ◽

Euclidean Distance ◽

Oscillation Theory ◽

Feature Subset Selection ◽

Support Vector ◽

Data Sets ◽

Feature Subset ◽

Selection Algorithm ◽

Filter Model

Feature selection is an important task in big data analysis and information retrieval processing. It reduces the number of features by removing noise, extraneous data. In this paper, one feature subset selection algorithm based on damping oscillation theory and support vector machine classifier is proposed. This algorithm is called the Maximum Kendall coefficient Maximum Euclidean Distance Improved Gray Wolf Optimization algorithm (MKMDIGWO). In MKMDIGWO, first, a filter model based on Kendall coefficient and Euclidean distance is proposed, which is used to measure the correlation and redundancy of the candidate feature subset. Second, the wrapper model is an improved grey wolf optimization algorithm, in which its position update formula has been improved in order to achieve optimal results. Third, the filter model and the wrapper model are dynamically adjusted by the damping oscillation theory to achieve the effect of finding an optimal feature subset. Therefore, MKMDIGWO achieves both the efficiency of the filter model and the high precision of the wrapper model. Experimental results on five UCI public data sets and two microarray data sets have demonstrated the higher classification accuracy of the MKMDIGWO algorithm than that of other four state-of-the-art algorithms. The maximum ACC value of the MKMDIGWO algorithm is at least 0.5% higher than other algorithms on 10 data sets.

Download Full-text

MAPPING OF HIGH VALUE CROPS THROUGH AN OBJECT-BASED SVM MODEL USING LIDAR DATA AND ORTHOPHOTO IN AGUSAN DEL NORTE PHILIPPINES

ISPRS Annals of Photogrammetry Remote Sensing and Spatial Information Sciences ◽

10.5194/isprsannals-iii-7-165-2016 ◽

2016 ◽

Vol III-7 ◽

pp. 165-172 ◽

Cited By ~ 1

Author(s):

Rudolph Joshua Candare ◽

Michelle Japitana ◽

James Earl Cubillas ◽

Cherry Bryan Ramirez

Keyword(s):

Regularization Parameter ◽

Optimization Procedure ◽

Feature Space ◽

Resource Assessment ◽

Support Vector ◽

Lidar Data ◽

High Resolution Data ◽

Object Based ◽

Rule Sets ◽

Rule Set

This research describes the methods involved in the mapping of different high value crops in Agusan del Norte Philippines using LiDAR. This project is part of the Phil-LiDAR 2 Program which aims to conduct a nationwide resource assessment using LiDAR. Because of the high resolution data involved, the methodology described here utilizes object-based image analysis and the use of optimal features from LiDAR data and Orthophoto. Object-based classification was primarily done by developing rule-sets in eCognition. Several features from the LiDAR data and Orthophotos were used in the development of rule-sets for classification. Generally, classes of objects can't be separated by simple thresholds from different features making it difficult to develop a rule-set. To resolve this problem, the image-objects were subjected to Support Vector Machine learning. SVMs have gained popularity because of their ability to generalize well given a limited number of training samples. However, SVMs also suffer from parameter assignment issues that can significantly affect the classification results. More specifically, the regularization parameter C in linear SVM has to be optimized through cross validation to increase the overall accuracy. After performing the segmentation in eCognition, the optimization procedure as well as the extraction of the equations of the hyper-planes was done in Matlab. The learned hyper-planes separating one class from another in the multi-dimensional feature-space can be thought of as super-features which were then used in developing the classifier rule set in eCognition. In this study, we report an overall classification accuracy of greater than 90% in different areas.

Download Full-text