Complexity Selection with Cross-validation for Lasso and Sparse Partial Least Squares Using High-Dimensional Data

Author(s):  
Anne-Laure Boulesteix ◽  
Adrian Richter ◽  
Christoph Bernau
2012 ◽  
Vol 31 (3pt4) ◽  
pp. 1345-1354 ◽  
Author(s):  
Jose Gustavo S. Paiva ◽  
William Robson Schwartz ◽  
Helio Pedrini ◽  
Rosane Minghim

2012 ◽  
Vol 468-471 ◽  
pp. 1762-1766 ◽  
Author(s):  
Dong Yan ◽  
Shao Wei Liu ◽  
Jian Tang

Feature selection for modeling the high dimensional data, such as the near-infrared spectrum (NIR) is very important. A novel modeling approach combined the adaptive genetic algorithm-kernel partial least squares (AGA-KPLS) is proposed. The KPLS algorithm is used to construct nonlinear models with the popular kernel based modeling technology. The AGA is used to select the optimal feature sub-set from the original high dimensional data, which also used to select the kernel parameters of the KPLS algorithm simultaneously. The experimental results based on the vibration frequency spectrum show that the proposed approach has better prediction performance than the normal GA-PLS method.


The Analyst ◽  
2018 ◽  
Vol 143 (15) ◽  
pp. 3526-3539 ◽  
Author(s):  
Loong Chuen Lee ◽  
Choong-Yeun Liong ◽  
Abdul Aziz Jemain

This review highlights and discusses critically various knowledge gaps in classification modelling using PLS-DA for high dimensional data.


Agronomy ◽  
2021 ◽  
Vol 11 (4) ◽  
pp. 666
Author(s):  
Rafael Font ◽  
Mercedes del Río-Celestino ◽  
Diego Luna ◽  
Juan Gil ◽  
Antonio de Haro-Bailón

The near-infrared spectroscopy (NIRS) combined with modified partial least squares (modified PLS) regression was used for determining the neutral detergent fiber (NDF) and the acid detergent fiber (ADF) fractions of the chickpea (Cicer arietinum L.) seed. Fifty chickpea accessions (24 desi and 26 kabuli types) and fifty recombinant inbred lines F5:6 derived from a kabuli × desi cross were evaluated for NDF and ADF, and scanned by NIRS. NDF and ADF values were regressed against different spectral transformations by modified partial least squares regression. The coefficients of determination in the cross-validation and the standard deviation from the standard error of cross-validation ratio were, for NDF, 0.91 and 3.37, and for ADF, 0.98 and 6.73, respectively, showing the high potential of NIRS to assess these components in chickpea for screening (NDF) or quality control (ADF) purposes. The spectral information provided by different chromophores existing in the chickpea seed highly correlated with the NDF and ADF composition of the seed, and, thus, those electronic transitions are highly influenced on model fitting for fiber.


Sign in / Sign up

Export Citation Format

Share Document