small sample size problem Latest Research Papers

Deep neural networks are successful learning tools for building nonlinear models. However, a robust deep learning-based classification model needs a large dataset. Indeed, these models are often unstable when they use small datasets. To solve this issue, which is particularly critical in light of the possible clinical applications of these predictive models, researchers have developed approaches such as virtual sample generation. Virtual sample generation significantly improves learning and classification performance when working with small samples. The main objective of this study is to evaluate the ability of the proposed virtual sample generation to overcome the small sample size problem, which is a feature of the automated detection of a neurodevelopmental disorder, namely autism spectrum disorder. Results show that our method enhances diagnostic accuracy from 84%–95% using virtual samples generated on the basis of five actual clinical samples. The present findings show the feasibility of using the proposed technique to improve classification performance even in cases of clinical samples of limited size. Accounting for concerns in relation to small sample sizes, our technique represents a meaningful step forward in terms of pattern recognition methodology, particularly when it is applied to diagnostic classifications of neurodevelopmental disorders. Besides, the proposed technique has been tested with other available benchmark datasets. The experimental outcomes showed that the accuracy of the classification that used virtual samples was superior to the one that used original training data without virtual samples.

Download Full-text

A Novel Stacked Regression Algorithm Based on Slice Transform for Small Sample Size Problem in Spectroscopic Analysis

2018 5th International Conference on Information Science and Control Engineering (ICISCE) ◽

10.1109/icisce.2018.00026 ◽

2018 ◽

Author(s):

Yifan Wu ◽

Silong Peng ◽

Qiong Xie ◽

Quanjie Han

Keyword(s):

Sample Size ◽

Spectroscopic Analysis ◽

Small Sample Size ◽

Small Sample ◽

Small Sample Size Problem ◽

Size Problem

Download Full-text

Classification of Dangerous Situations for Small Sample Size Problem in Maintenance Decision Support Systems

Communications in Computer and Information Science - Analysis of Images, Social Networks and Texts ◽

10.1007/978-3-319-52920-2_31 ◽

2017 ◽

pp. 338-345 ◽

Cited By ~ 1

Author(s):

Vladimir R. Milov ◽

Andrey V. Savchenko

Keyword(s):

Decision Support ◽

Sample Size ◽

Decision Support Systems ◽

Support Systems ◽

Small Sample Size ◽

Small Sample ◽

Small Sample Size Problem ◽

Size Problem ◽

Maintenance Decision

Download Full-text

Imprecise reliability assessment for heavy numerical control machine tools against small sample size problem

Journal of Shanghai Jiaotong University (Science) ◽

10.1007/s12204-016-1770-8 ◽

2016 ◽

Vol 21 (5) ◽

pp. 605-610

Author(s):

Zheng Liu ◽

Yanfeng Li ◽

Hongzhong Huang

Keyword(s):

Sample Size ◽

Machine Tools ◽

Small Sample Size ◽

Reliability Assessment ◽

Small Sample ◽

Numerical Control ◽

Small Sample Size Problem ◽

Size Problem ◽

Control Machine

Download Full-text

OPTIMAL WAVELENGTH SELECTION ON HYPERSPECTRAL DATA WITH FUSED LASSO FOR BIOMASS ESTIMATION OF TROPICAL RAIN FOREST

ISPRS Annals of Photogrammetry Remote Sensing and Spatial Information Sciences ◽

10.5194/isprsannals-iii-8-101-2016 ◽

2016 ◽

Vol III-8 ◽

pp. 101-108 ◽

Cited By ~ 3

Author(s):

T. Takayama ◽

A. Iwasaki

Keyword(s):

Rain Forest ◽

Tropical Rain Forest ◽

Prediction Accuracy ◽

Small Sample Size ◽

Small Sample ◽

Hyperspectral Data ◽

Biomass Estimation ◽

Lasso Regression ◽

Small Sample Size Problem ◽

Fused Lasso

Above-ground biomass prediction of tropical rain forest using remote sensing data is of paramount importance to continuous large-area forest monitoring. Hyperspectral data can provide rich spectral information for the biomass prediction; however, the prediction accuracy is affected by a small-sample-size problem, which widely exists as overfitting in using high dimensional data where the number of training samples is smaller than the dimensionality of the samples due to limitation of require time, cost, and human resources for field surveys. A common approach to addressing this problem is reducing the dimensionality of dataset. Also, acquired hyperspectral data usually have low signal-to-noise ratio due to a narrow bandwidth and local or global shifts of peaks due to instrumental instability or small differences in considering practical measurement conditions. In this work, we propose a methodology based on fused lasso regression that select optimal bands for the biomass prediction model with encouraging sparsity and grouping, which solves the small-sample-size problem by the dimensionality reduction from the sparsity and the noise and peak shift problem by the grouping. The prediction model provided higher accuracy with root-mean-square error (RMSE) of 66.16 t/ha in the cross-validation than other methods; multiple linear analysis, partial least squares regression, and lasso regression. Furthermore, fusion of spectral and spatial information derived from texture index increased the prediction accuracy with RMSE of 62.62 t/ha. This analysis proves efficiency of fused lasso and image texture in biomass estimation of tropical forests.

Download Full-text

OPTIMAL WAVELENGTH SELECTION ON HYPERSPECTRAL DATA WITH FUSED LASSO FOR BIOMASS ESTIMATION OF TROPICAL RAIN FOREST

ISPRS Annals of Photogrammetry Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-annals-iii-8-101-2016 ◽

2016 ◽

Vol III-8 ◽

pp. 101-108 ◽

Cited By ~ 1

Author(s):

T. Takayama ◽

A. Iwasaki

Keyword(s):

Rain Forest ◽

Tropical Rain Forest ◽

Prediction Accuracy ◽

Small Sample Size ◽

Small Sample ◽

Hyperspectral Data ◽

Biomass Estimation ◽

Lasso Regression ◽

Small Sample Size Problem ◽

Fused Lasso

Above-ground biomass prediction of tropical rain forest using remote sensing data is of paramount importance to continuous large-area forest monitoring. Hyperspectral data can provide rich spectral information for the biomass prediction; however, the prediction accuracy is affected by a small-sample-size problem, which widely exists as overfitting in using high dimensional data where the number of training samples is smaller than the dimensionality of the samples due to limitation of require time, cost, and human resources for field surveys. A common approach to addressing this problem is reducing the dimensionality of dataset. Also, acquired hyperspectral data usually have low signal-to-noise ratio due to a narrow bandwidth and local or global shifts of peaks due to instrumental instability or small differences in considering practical measurement conditions. In this work, we propose a methodology based on fused lasso regression that select optimal bands for the biomass prediction model with encouraging sparsity and grouping, which solves the small-sample-size problem by the dimensionality reduction from the sparsity and the noise and peak shift problem by the grouping. The prediction model provided higher accuracy with root-mean-square error (RMSE) of 66.16 t/ha in the cross-validation than other methods; multiple linear analysis, partial least squares regression, and lasso regression. Furthermore, fusion of spectral and spatial information derived from texture index increased the prediction accuracy with RMSE of 62.62 t/ha. This analysis proves efficiency of fused lasso and image texture in biomass estimation of tropical forests.

Download Full-text

Locality Sensitive Proximal Classifier with Consistency for Small Sample Size Problem

2015 IEEE International Conference on Data Mining Workshop (ICDMW) ◽

10.1109/icdmw.2015.180 ◽

2015 ◽

Author(s):

Yuan-Hai Shao ◽

Zhen Wang ◽

Chun-Na Li ◽

Nai-Yang Deng

Keyword(s):

Sample Size ◽

Small Sample Size ◽

Small Sample ◽

Small Sample Size Problem ◽

Size Problem

Download Full-text

small sample size problem
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

An Improved Virtual Sample Generation Method Based on Quadrat Density Method and Quantile Regression for Small Sample Size Problem

Can Virtual Samples Solve Small Sample Size Problem of KISSME in Pedestrian Re-Identification of Smart Transportation?

A Mega-Trend-Diffusion and Monte Carlo based virtual sample generation method for small sample size problem

A Novel Virtual Sample Generation Method to Overcome the Small Sample Size Problem in Computer Aided Medical Diagnosing

A Novel Stacked Regression Algorithm Based on Slice Transform for Small Sample Size Problem in Spectroscopic Analysis

Classification of Dangerous Situations for Small Sample Size Problem in Maintenance Decision Support Systems

Imprecise reliability assessment for heavy numerical control machine tools against small sample size problem

OPTIMAL WAVELENGTH SELECTION ON HYPERSPECTRAL DATA WITH FUSED LASSO FOR BIOMASS ESTIMATION OF TROPICAL RAIN FOREST

OPTIMAL WAVELENGTH SELECTION ON HYPERSPECTRAL DATA WITH FUSED LASSO FOR BIOMASS ESTIMATION OF TROPICAL RAIN FOREST

Locality Sensitive Proximal Classifier with Consistency for Small Sample Size Problem

Export Citation Format

small sample size problemRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

An Improved Virtual Sample Generation Method Based on Quadrat Density Method and Quantile Regression for Small Sample Size Problem

Can Virtual Samples Solve Small Sample Size Problem of KISSME in Pedestrian Re-Identification of Smart Transportation?

A Mega-Trend-Diffusion and Monte Carlo based virtual sample generation method for small sample size problem

A Novel Virtual Sample Generation Method to Overcome the Small Sample Size Problem in Computer Aided Medical Diagnosing

A Novel Stacked Regression Algorithm Based on Slice Transform for Small Sample Size Problem in Spectroscopic Analysis

Classification of Dangerous Situations for Small Sample Size Problem in Maintenance Decision Support Systems

Imprecise reliability assessment for heavy numerical control machine tools against small sample size problem

OPTIMAL WAVELENGTH SELECTION ON HYPERSPECTRAL DATA WITH FUSED LASSO FOR BIOMASS ESTIMATION OF TROPICAL RAIN FOREST

OPTIMAL WAVELENGTH SELECTION ON HYPERSPECTRAL DATA WITH FUSED LASSO FOR BIOMASS ESTIMATION OF TROPICAL RAIN FOREST

Locality Sensitive Proximal Classifier with Consistency for Small Sample Size Problem

small sample size problem
Recently Published Documents