Optimization Performance Comparison of Three Different Group Intelligence Algorithms on a SVM for Hyperspectral Imagery Classification

Group intelligence algorithms have been widely used in support vector machine (SVM) parameter optimization due to their obvious characteristics of strong parallel processing ability, fast optimization, and global optimization. However, few studies have made optimization performance comparisons of different group intelligence algorithms on SVMs, especially in terms of their application to hyperspectral remote sensing classification. In this paper, we compare the optimization performance of three different group intelligence algorithms that were run on a SVM in terms of five aspects by using three hyperspectral images (one each of the Indian Pines, University of Pavia, and Salinas): the stability to parameter settings, convergence rate, feature selection ability, sample size, and classification accuracy. Particle swarm optimization (PSO), genetic algorithms (GAs), and artificial bee colony (ABC) algorithms are the three group intelligence algorithms. Our results showed the influence of these three optimization algorithms on the C-parameter optimization of the SVM was less than their influence on the σ-parameter. The convergence rate, the number of selected features, and the accuracy of the three group intelligence algorithms were statistically significant different at the p = 0.01 level. The GA algorithm could compress more than 70% of the original data and it was the least affected by sample size. GA-SVM had the highest average overall accuracy (91.77%), followed by ABC-SVM (88.73%), and PSO-SVM (86.65%). Especially, in complex scenes (e.g., the Indian Pines image), GA-SVM showed the highest classification accuracy (87.34%, which was 8.23% higher than ABC-SVM and 16.42% higher than PSO-SVM) and the best stability (the standard deviation of its classification accuracy was 0.82%, which was 5.54% lower than ABC-SVM, and 21.63% lower than PSO-SVM). Therefore, when compared with the ABC and PSO algorithms, the GA had more advantages in terms of feature band selection, small sample size classification, and classification accuracy.

Download Full-text

Kernel Null Space Marginal Fisher Analysis for Face Recognition

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.889-890.1065 ◽

2014 ◽

Vol 889-890 ◽

pp. 1065-1068

Author(s):

Yu’e Lin ◽

Xing Zhu Liang ◽

Hua Ping Zhou

Keyword(s):

Feature Extraction ◽

Face Recognition ◽

Sample Size ◽

High Performance ◽

Null Space ◽

Small Sample Size ◽

Recognition Rate ◽

Original Data ◽

Small Sample ◽

Neighborhood Structure

In the recent years, the feature extraction algorithms based on manifold learning, which attempt to project the original data into a lower dimensional feature space by preserving the local neighborhood structure, have drawn much attention. Among them, the Marginal Fisher Analysis (MFA) achieved high performance for face recognition. However, MFA suffers from the small sample size problems and is still a linear technique. This paper develops a new nonlinear feature extraction algorithm, called Kernel Null Space Marginal Fisher Analysis (KNSMFA). KNSMFA based on a new optimization criterion is presented, which means that all the discriminant vectors can be calculated in the null space of the within-class scatter. KNSMFA not only exploits the nonlinear features but also overcomes the small sample size problems. Experimental results on ORL database indicate that the proposed method achieves higher recognition rate than the MFA method and some existing kernel feature extraction algorithms.

Download Full-text

A LAZY SUPPORT VECTOR REGRESSION MODEL FOR PREDICTION PROBLEMS WITH SMALL SAMPLE SIZE

Neural Network World ◽

10.14311/nnw.2019.29.003 ◽

2019 ◽

Vol 29 (1) ◽

pp. 33-44

Author(s):

Xuyun Fu ◽

Hui Luo ◽

Guangyao Zhang ◽

Shisheng Zhong

Keyword(s):

Regression Model ◽

Sample Size ◽

Support Vector Regression ◽

Small Sample Size ◽

Small Sample ◽

Support Vector ◽

Support Vector Regression Model ◽

Prediction Problems

Download Full-text

Stable Bagging Feature Selection on Medical Data

10.21203/rs.3.rs-50237/v2 ◽

2020 ◽

Author(s):

Salem Alelyani

Keyword(s):

Feature Selection ◽

Sample Size ◽

Classification Accuracy ◽

Variance Reduction ◽

Small Sample Size ◽

Small Sample ◽

Domain Experts ◽

Complex Dimensions ◽

The Stability ◽

Selection Algorithms

Abstract In the medical eld, distinguishing genes that are relevant to a specific disease, let's say colon cancer, is crucial to finding a cure and understanding its causes and subsequent complications. Usually, medical datasets are comprised of immensely complex dimensions with considerably small sample size. Thus, for domain experts, such as biologists, the task of identifying these genes have become a very challenging one, to say the least. Feature selection is a technique that aims to select these genes, or features in machine learning eld with respect to the disease. However, learning from a medical dataset to identify relevant features suers from the curse-of-dimensionality. Due to a large number of features with a small sample size, the selection usually returns a different subset each time a new sample is introduced into the dataset. This selection instability is intrinsically related to data variance. We assume that reducing data variance improves selection stability. In this paper, we propose an ensemble approach based on the bagging technique to improve feature selection stability in medical datasets via data variance reduction. We conducted an experiment using four microarray datasets each of which suers from high dimensionality and relatively small sample size. On each dataset, we applied five well-known feature selection algorithms to select varying number of features.The proposed technique shows a significant improvement in selection stability while at least maintaining the classification accuracy. The stability improvement ranges from 20 to 50 percent in all cases. This implies that the likelihood of selecting the same features increased 20 to 50 percent more. This is accompanied with the increase of classification accuracy in most cases, which signifies the stated results of stability.

Download Full-text

Stable bagging feature selection on medical data

Journal Of Big Data ◽

10.1186/s40537-020-00385-8 ◽

2021 ◽

Vol 8 (1) ◽

Author(s):

Salem Alelyani

Keyword(s):

Feature Selection ◽

Sample Size ◽

Classification Accuracy ◽

Variance Reduction ◽

Small Sample Size ◽

Small Sample ◽

Domain Experts ◽

Complex Dimensions ◽

The Stability ◽

Selection Algorithms

AbstractIn the medical field, distinguishing genes that are relevant to a specific disease, let’s say colon cancer, is crucial to finding a cure and understanding its causes and subsequent complications. Usually, medical datasets are comprised of immensely complex dimensions with considerably small sample size. Thus, for domain experts, such as biologists, the task of identifying these genes have become a very challenging one, to say the least. Feature selection is a technique that aims to select these genes, or features in machine learning field with respect to the disease. However, learning from a medical dataset to identify relevant features suffers from the curse-of-dimensionality. Due to a large number of features with a small sample size, the selection usually returns a different subset each time a new sample is introduced into the dataset. This selection instability is intrinsically related to data variance. We assume that reducing data variance improves selection stability. In this paper, we propose an ensemble approach based on the bagging technique to improve feature selection stability in medical datasets via data variance reduction. We conducted an experiment using four microarray datasets each of which suffers from high dimensionality and relatively small sample size. On each dataset, we applied five well-known feature selection algorithms to select varying number of features. The proposed technique shows a significant improvement in selection stability while at least maintaining the classification accuracy. The stability improvement ranges from 20 to 50 percent in all cases. This implies that the likelihood of selecting the same features increased 20 to 50 percent more. This is accompanied with the increase of classification accuracy in most cases, which signifies the stated results of stability.

Download Full-text

Faculty Opinions recommendation of Power failure: why small sample size undermines the reliability of neuroscience.

Faculty Opinions – Post-Publication Peer Review of the Biomedical Literature ◽

10.3410/f.718002370.793476498 ◽

2013 ◽

Author(s):

Björn Brembs

Keyword(s):

Sample Size ◽

Small Sample Size ◽

Small Sample ◽

Power Failure

Download Full-text

Faculty Opinions recommendation of Power failure: why small sample size undermines the reliability of neuroscience.

Faculty Opinions – Post-Publication Peer Review of the Biomedical Literature ◽

10.3410/f.718002370.793475357 ◽

2013 ◽

Author(s):

Wayne Hall ◽

Adrian Carter

Keyword(s):

Sample Size ◽

Small Sample Size ◽

Small Sample ◽

Power Failure

Download Full-text

Application of Ultrasound Elastography for Assessing Intestinal Fibrosis in Inflammatory Bowel Disease: Fiction or Reality?

Current Drug Targets ◽

10.2174/1389450121666201119142919 ◽

2020 ◽

Vol 21 ◽

Author(s):

Roberto Gabbiadini ◽

Eirini Zacharopoulou ◽

Federica Furfaro ◽

Vincenzo Craviotto ◽

Alessandra Zilli ◽

...

Keyword(s):

Inflammatory Bowel Disease ◽

Sample Size ◽

Bowel Disease ◽

Small Sample Size ◽

Shear Wave Elastography ◽

Small Sample ◽

Ultrasound Elastography ◽

Invasive Technique ◽

Intestinal Fibrosis ◽

Inflammatory Bowel

Background: Intestinal fibrosis and subsequent strictures represent an important burden in inflammatory bowel disease (IBD). The detection and evaluation of the degree of fibrosis in stricturing Crohn’s disease (CD) is important to address the best therapeutic strategy (medical anti-inflammatory therapy, endoscopic dilation, surgery). Ultrasound elastography (USE) is a non-invasive technique that has been proposed in the field of IBD for evaluating intestinal stiffness as a biomarker of intestinal fibrosis. Objective: The aim of this review is to discuss the ability and current role of ultrasound elastography in the assessment of intestinal fibrosis. Results and Conclusion: Data on USE in IBD are provided by pilot and proof-of-concept studies with small sample size. The first type of USE investigated was strain elastography, while shear wave elastography has been introduced lately. Despite the heterogeneity of the methods of the studies, USE has been proven to be able to assess intestinal fibrosis in patients with stricturing CD. However, before introducing this technique in current practice, further studies with larger sample size and homogeneous parameters, testing reproducibility, and identification of validated cut-off values are needed.

Download Full-text

Fukunaga-Koontz transform for small sample size problems

IEE Irish Signals and Systems Conference 2005 ◽

10.1049/cp:20050304 ◽

2005 ◽

Cited By ~ 4

Author(s):

A.A. Miranda

Keyword(s):

Sample Size ◽

Small Sample Size ◽

Small Sample

Download Full-text

An Integrated Feature Selection Algorithm for Cancer Classification using Gene Expression Data

Combinatorial Chemistry & High Throughput Screening ◽

10.2174/1386207322666181220124756 ◽

2019 ◽

Vol 21 (9) ◽

pp. 631-645 ◽

Cited By ~ 5

Author(s):

Saeed Ahmed ◽

Muhammad Kabir ◽

Zakir Ali ◽

Muhammad Arif ◽

Farman Ali ◽

...

Keyword(s):

Gene Expression ◽

Feature Selection ◽

Gene Expression Data ◽

Classification Accuracy ◽

Early Stage ◽

Small Sample Size ◽

Feature Selection Method ◽

Small Sample ◽

Expression Data ◽

Base Function

Aim and Objective: Cancer is a dangerous disease worldwide, caused by somatic mutations in the genome. Diagnosis of this deadly disease at an early stage is exceptionally new clinical application of microarray data. In DNA microarray technology, gene expression data have a high dimension with small sample size. Therefore, the development of efficient and robust feature selection methods is indispensable that identify a small set of genes to achieve better classification performance. Materials and Methods: In this study, we developed a hybrid feature selection method that integrates correlation-based feature selection (CFS) and Multi-Objective Evolutionary Algorithm (MOEA) approaches which select the highly informative genes. The hybrid model with Redial base function neural network (RBFNN) classifier has been evaluated on 11 benchmark gene expression datasets by employing a 10-fold cross-validation test. Results: The experimental results are compared with seven conventional-based feature selection and other methods in the literature, which shows that our approach owned the obvious merits in the aspect of classification accuracy ratio and some genes selected by extensive comparing with other methods. Conclusion: Our proposed CFS-MOEA algorithm attained up to 100% classification accuracy for six out of eleven datasets with a minimal sized predictive gene subset.

Download Full-text