Hyperspectral Classification of Plants: A Review of Waveband Selection Generalisability

Hyperspectral sensing, measuring reflectance over visible to shortwave infrared wavelengths, has enabled the classification and mapping of vegetation at a range of taxonomic scales, often down to the species level. Classification with hyperspectral measurements, acquired by narrow band spectroradiometers or imaging sensors, has generally required some form of spectral feature selection to reduce the dimensionality of the data to a level suitable for the construction of a classification model. Despite the large number of hyperspectral plant classification studies, an in-depth review of feature selection methods and resultant waveband selections has not yet been performed. Here, we present a review of the last 22 years of hyperspectral vegetation classification literature that evaluates the overall waveband selection frequency, waveband selection frequency variation by taxonomic, structural, or functional group, and the influence of feature selection choice by comparing such methods as stepwise discriminant analysis (SDA), support vector machines (SVM), and random forests (RF). This review determined that all characteristics of hyperspectral plant studies influence the wavebands selected for classification. This includes the taxonomic, structural, and functional groups of the target samples, the methods, and scale at which hyperspectral measurements are recorded, as well as the feature selection method used. Furthermore, these influences do not appear to be consistent. Moreover, the considerable variability in waveband selection caused by the feature selectors effectively masks the analysis of any variability between studies related to plant groupings. Additionally, questions are raised about the suitability of SDA as a feature selection method, with it producing waveband selections at odds with the other feature selectors. Caution is recommended when choosing a feature selector for hyperspectral plant classification: We recommend multiple methods being performed. The resultant sets of selected spectral features can either be evaluated individually by multiple classification models or combined as an ensemble for evaluation by a single classifier. Additionally, we suggest caution when relying upon waveband recommendations from the literature to guide waveband selections or classifications for new plant discrimination applications, as such recommendations appear to be weakly generalizable between studies.

Download Full-text

An Improved Network Traffic Classification Model Based on a Support Vector Machine

Symmetry ◽

10.3390/sym12020301 ◽

2020 ◽

Vol 12 (2) ◽

pp. 301 ◽

Cited By ~ 1

Author(s):

Jie Cao ◽

Da Wang ◽

Zhaoyang Qu ◽

Hongyu Sun ◽

Bin Li ◽

...

Keyword(s):

Support Vector Machine ◽

Feature Selection ◽

Network Traffic ◽

Feature Selection Method ◽

Selection Method ◽

Classification Model ◽

Support Vector ◽

Traffic Classification ◽

Generalization Ability ◽

Network Traffic Classification

Network traffic classification based on machine learning is an important branch of pattern recognition in computer science. It is a key technology for dynamic intelligent network management and enhanced network controllability. However, the traffic classification methods still facing severe challenges: The optimal set of features is difficult to determine. The classification method is highly dependent on the effective characteristic combination. Meanwhile, it is also important to balance the experience risk and generalization ability of the classifier. In this paper, an improved network traffic classification model based on a support vector machine is proposed. First, a filter-wrapper hybrid feature selection method is proposed to solve the false deletion of combined features caused by a traditional feature selection method. Second, to balance the empirical risk and generalization ability of support vector machine (SVM) traffic classification model, an improved parameter optimization algorithm is proposed. The algorithm can dynamically adjust the quadratic search area, reduce the density of quadratic mesh generation, improve the search efficiency of the algorithm, and prevent the over-fitting while optimizing the parameters. The experiments show that the improved traffic classification model achieves higher classification accuracy, lower dimension and shorter elapsed time and performs significantly better than traditional SVM and the other three typical supervised ML algorithms.

Download Full-text

A new Unsupervised Spectral Feature Selection Method for mixed data: A filter approach

Pattern Recognition ◽

10.1016/j.patcog.2017.07.020 ◽

2017 ◽

Vol 72 ◽

pp. 314-326 ◽

Cited By ~ 18

Author(s):

Saúl Solorio-Fernández ◽

José Fco. Martínez-Trinidad ◽

J. Ariel Carrasco-Ochoa

Keyword(s):

Feature Selection ◽

Spectral Feature ◽

Feature Selection Method ◽

Selection Method ◽

Mixed Data ◽

Filter Approach ◽

Spectral Feature Selection

Download Full-text

A Supervised Filter Feature Selection method for mixed data based on Spectral Feature Selection and Information-theory redundancy analysis

Pattern Recognition Letters ◽

10.1016/j.patrec.2020.07.039 ◽

2020 ◽

Vol 138 ◽

pp. 321-328 ◽

Cited By ~ 1

Author(s):

Saúl Solorio-Fernández ◽

José Fco. Martínez-Trinidad ◽

J. Ariel Carrasco-Ochoa

Keyword(s):

Information Theory ◽

Feature Selection ◽

Redundancy Analysis ◽

Spectral Feature ◽

Feature Selection Method ◽

Selection Method ◽

Mixed Data ◽

Spectral Feature Selection

Download Full-text

A fuzzy gaussian rank aggregation ensemble feature selection method for microarray data

International Journal of Knowledge-based and Intelligent Engineering Systems ◽

10.3233/kes-190134 ◽

2021 ◽

Vol 24 (4) ◽

pp. 289-301

Author(s):

B. Venkatesh ◽

J. Anuradha

Keyword(s):

Feature Selection ◽

Microarray Data ◽

Classification Accuracy ◽

Performance Metrics ◽

Feature Selection Method ◽

Selection Method ◽

Support Vector ◽

Svm Classifier ◽

Binary Particle Swarm Optimization ◽

Selection Methods

In Microarray Data, it is complicated to achieve more classification accuracy due to the presence of high dimensions, irrelevant and noisy data. And also It had more gene expression data and fewer samples. To increase the classification accuracy and the processing speed of the model, an optimal number of features need to extract, this can be achieved by applying the feature selection method. In this paper, we propose a hybrid ensemble feature selection method. The proposed method has two phases, filter and wrapper phase in filter phase ensemble technique is used for aggregating the feature ranks of the Relief, minimum redundancy Maximum Relevance (mRMR), and Feature Correlation (FC) filter feature selection methods. This paper uses the Fuzzy Gaussian membership function ordering for aggregating the ranks. In wrapper phase, Improved Binary Particle Swarm Optimization (IBPSO) is used for selecting the optimal features, and the RBF Kernel-based Support Vector Machine (SVM) classifier is used as an evaluator. The performance of the proposed model are compared with state of art feature selection methods using five benchmark datasets. For evaluation various performance metrics such as Accuracy, Recall, Precision, and F1-Score are used. Furthermore, the experimental results show that the performance of the proposed method outperforms the other feature selection methods.

Download Full-text

Feature Selection Method Based on Mutual Information and Support Vector Machine

International Journal of Pattern Recognition and Artificial Intelligence ◽

10.1142/s021800142150021x ◽

2021 ◽

pp. 2150021

Author(s):

Gang Liu ◽

Chunlei Yang ◽

Sen Liu ◽

Chunbao Xiao ◽

Bin Song

Keyword(s):

Support Vector Machine ◽

Feature Selection ◽

Mutual Information ◽

Classification Accuracy ◽

Feature Selection Method ◽

Selection Method ◽

Support Vector ◽

Svm Classifier ◽

Standard Data ◽

Feature Dimension

A feature selection method based on mutual information and support vector machine (SVM) is proposed in order to eliminate redundant feature and improve classification accuracy. First, local correlation between features and overall correlation is calculated by mutual information. The correlation reflects the information inclusion relationship between features, so the features are evaluated and redundant features are eliminated with analyzing the correlation. Subsequently, the concept of mean impact value (MIV) is defined and the influence degree of input variables on output variables for SVM network based on MIV is calculated. The importance weights of the features described with MIV are sorted by descending order. Finally, the SVM classifier is used to implement feature selection according to the classification accuracy of feature combination which takes MIV order of feature as a reference. The simulation experiments are carried out with three standard data sets of UCI, and the results show that this method can not only effectively reduce the feature dimension and high classification accuracy, but also ensure good robustness.

Download Full-text

A Hybrid Feature Selection Method Based on Symmetrical Uncertainty and Support Vector Machine for High-Dimensional Data Classification

Intelligent Information and Database Systems - Lecture Notes in Computer Science ◽

10.1007/978-3-319-54472-4_67 ◽

2017 ◽

pp. 721-727 ◽

Cited By ~ 2

Author(s):

Yongjun Piao ◽

Keun Ho Ryu

Keyword(s):

Support Vector Machine ◽

Feature Selection ◽

High Dimensional Data ◽

Feature Selection Method ◽

Data Classification ◽

Selection Method ◽

High Dimensional ◽

Support Vector ◽

Symmetrical Uncertainty

Download Full-text

Spectral Feature Selection with Particle Swarm Optimization for Hyperspectral Classification

2012 International Conference on Industrial Control and Electronics Engineering ◽

10.1109/icicee.2012.116 ◽

2012 ◽

Cited By ~ 1

Author(s):

Jun Li ◽

Sheng Ding

Keyword(s):

Feature Selection ◽

Particle Swarm Optimization ◽

Spectral Feature ◽

Particle Swarm ◽

Swarm Optimization ◽

Spectral Feature Selection ◽

Hyperspectral Classification

Download Full-text

Feature Selection Method for Hydraulic System Faults Diagnosis Based on GA-PLS

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.44-47.1130 ◽

2010 ◽

Vol 44-47 ◽

pp. 1130-1134

Author(s):

Sheng Li ◽

Pei Lin Zhang ◽

Bing Li

Keyword(s):

Feature Selection ◽

Hydraulic System ◽

Nearest Neighbor ◽

Feature Selection Method ◽

Original Data ◽

Selection Method ◽

Classification Model ◽

K Nearest Neighbor ◽

K Nearest Neighbor Algorithm ◽

Faults Diagnosis

Feature selection is a key step in hydraulic system fault diagnosis. Some of the collected features are unrelated to classification model, and some are high correlated to other features. These features are harmful for establishing classification model. In order to solve this problem, genetic algorithm-partial least squares (GA-PLS) is proposed for selecting the representative and optimal features. K nearest neighbor algorithm (KNN) is used for diagnosing and classifying hydraulic system faults. For expressing better performance of GA-PLS, the original data of a model engineering hydraulic system is used, and the results of GA-PLS are compared with all feature used and GA. The experimental results show that, the proposed feature method can diagnose and classify hydraulic system faults more efficiently with using fewer features.

Download Full-text

Feature Selection Method for L1-norm Twin Support Vector Regression

10.1109/iccais52680.2021.9624538 ◽

2021 ◽

Author(s):

Fan Wang ◽

Qing Wu ◽

Yanlin Fu

Keyword(s):

Feature Selection ◽

Support Vector Regression ◽

Feature Selection Method ◽

Selection Method ◽

Support Vector ◽

L1 Norm ◽

Twin Support Vector Regression

Download Full-text

Computer-Aided Detection of Polyps in CT Colonography by Means of Feature Selection and Massive-Training Support Vector Regression

Machine Learning in Computer-Aided Diagnosis - Advances in Bioinformatics and Biomedical Engineering ◽

10.4018/978-1-4666-0059-1.ch009 ◽

2012 ◽

pp. 178-201

Author(s):

Jian-Wu Xu ◽

Kenji Suzuki

Keyword(s):

Feature Selection ◽

Support Vector Regression ◽

Ct Colonography ◽

Feature Selection Method ◽

Selection Method ◽

Support Vector ◽

Computer Aided Detection ◽

Training Time ◽

Computer Aided ◽

Wilks Lambda

One of the major challenges in current Computer-Aided Detection (CADe) of polyps in CT Colonography (CTC) is to improve the specificity without sacrificing the sensitivity. If a large number of False Positive (FP) detections of polyps are produced by the scheme, radiologists might lose their confidence in the use of CADe. In this chapter, the authors used a nonlinear regression model operating on image voxels and a nonlinear classification model with extracted image features based on Support Vector Machines (SVMs). They investigated the feasibility of a Support Vector Regression (SVR) in the massive-training framework, and the authors developed a Massive-Training SVR (MTSVR) in order to reduce the long training time associated with the Massive-Training Artificial Neural Network (MTANN) for reduction of FPs in CADe of polyps in CTC. In addition, the authors proposed a feature selection method directly coupled with an SVM classifier to maximize the CADe system performance. They compared the proposed feature selection method with the conventional stepwise feature selection based on Wilks’ lambda with a linear discriminant analysis classifier. The FP reduction system based on the proposed feature selection method was able to achieve a 96.0% by-polyp sensitivity with an FP rate of 4.1 per patient. The performance is better than that of the stepwise feature selection based on Wilks’ lambda (which yielded the same sensitivity with 18.0 FPs/patient). To test the performance of the proposed MTSVR, the authors compared it with the original MTANN in the distinction between actual polyps and various types of FPs in terms of the training time reduction and FP reduction performance. The authors’ CTC database consisted of 240 CTC datasets obtained from 120 patients in the supine and prone positions. With MTSVR, they reduced the training time by a factor of 190, while achieving a performance (by-polyp sensitivity of 94.7% with 2.5 FPs/patient) comparable to that of the original MTANN (which has the same sensitivity with 2.6 FPs/patient).

Download Full-text