scholarly journals Automated Fitting Process Using Robust Reliable Weighted Average on Near Infrared Spectral Data Analysis

Symmetry ◽  
2020 ◽  
Vol 12 (12) ◽  
pp. 2099
Author(s):  
Divo Dharma Silalahi ◽  
Habshah Midi ◽  
Jayanthi Arasan ◽  
Mohd Shafie Mustafa ◽  
Jean-Pierre Caliman

With the complexity of Near Infrared (NIR) spectral data, the selection of the optimal number of Partial Least Squares (PLS) components in the fitted Partial Least Squares Regression (PLSR) model is very important. Selecting a small number of PLS components leads to under fitting, whereas selecting a large number of PLS components results in over fitting. Several methods exist in the selection procedure, and each yields a different result. However, so far no one has been able to determine the more superior method. In addition, the current methods are susceptible to the presence of outliers and High Leverage Points (HLP) in a dataset. In this study, a new automated fitting process method on PLSR model is introduced. The method is called the Robust Reliable Weighted Average—PLS (RRWA-PLS), and it is less sensitive to the optimum number of PLS components. The RRWA-PLS uses the weighted average strategy from multiple PLSR models generated by the different complexities of the PLS components. The method assigns robust procedures in the weighing schemes as an improvement to the existing Weighted Average—PLS (WA-PLS) method. The weighing schemes in the proposed method are resistant to outliers and HLP and thus, preserve the contribution of the most relevant variables in the fitted model. The evaluation was done by utilizing artificial data with the Monte Carlo simulation and NIR spectral data of oil palm (Elaeis guineensis Jacq.) fruit mesocarp. Based on the results, the method claims to have shown its superiority in the improvement of the weight and variable selection procedures in the WA-PLS. It is also resistant to the influence of outliers and HLP in the dataset. The RRWA-PLS method provides a promising robust solution for the automated fitting process in the PLSR model as unlike the classical PLS, it does not require the selection of an optimal number of PLS components.

2019 ◽  
Vol 11 (36) ◽  
pp. 4593-4599
Author(s):  
Shaohui Yu ◽  
Jing Liu

A weighted clustering and pruning of wavelength variables-partial least squares (WCPV-PLS) method was proposed.


2006 ◽  
Vol 39 (9) ◽  
pp. 2073-2083 ◽  
Author(s):  
Zhenqiang Su ◽  
Weida Tong ◽  
Leming Shi ◽  
Xueguang Shao ◽  
Wensheng Cai

2017 ◽  
Vol 38 (1) ◽  
pp. 590-594
Author(s):  
Chen Yueyang ◽  
Gao Zhishan ◽  
Yu Xiaohui ◽  
Zhu Dan ◽  
Chen Ming ◽  
...  

1993 ◽  
Vol 47 (11) ◽  
pp. 1747-1750 ◽  
Author(s):  
Raymond Lew ◽  
Stephen T. Balke

In this novel application of a multivariate method, partial least-squares (PLS) was used to generate mid-infrared (MIR) spectra (rather than selected concentrations) from near-infrared (NIR) spectra. The NIR spectra were obtained by in-line monitoring of a molten polymer blend of polyethylene with polypropylene during extrusion. Off-line MIR spectra of blends were used to calibrate the PLS method. Then PLS was used to generate the MIR absorbance spectrum of a 50:50-by-weight blend not included in the calibration set from its NIR spectrum. The synthesized MIR spectrum agreed very well with a directly measured one. The exception was absorbance peaks which were so strong that they apparently represented responses that were nonlinear with respect to concentration. Although more evaluation work has yet to be done, these results are encouraging, and they indicate that NIR interpretation may readily borrow the strengths of MIR interpretation both qualitatively and quantitatively.


2003 ◽  
Vol 57 (5) ◽  
pp. 551-556 ◽  
Author(s):  
Miryeong Sohn ◽  
Franklin E. Barton ◽  
Wiley H. Morrison ◽  
Douglas D. Archibald

Shive, the nonfiberous core portion of the stem, in flax fiber after retting is related to fiber quality. The objective of this study is to develop a standard calibration model for determining shive content in retted flax by using near-infrared reflectance spectroscopy. Calibration samples were prepared by manually mixing pure, ground shive and pure, ground fiber from flax retted by three different methods (water, dew, and enzyme retting) to provide a wide range of shive content from 0 to 100%. Partial least-squares (PLS) regression was used to generate a calibration model, and spectral data were processed using various pretreatments such as a multiplicative scatter correction (MSC), normalization, derivatives, and Martens' Uncertainty option to improve the calibration model. The calibration model developed with a single sample set resulted in a standard error of 1.8% with one factor. The best algorithm was produced from first-derivative processing of the spectral data. MSC was not effective processing for this model. However, a big bias was observed when independent sample sets were applied to this calibration model to predict shive content in flax fiber. The calibration model developed using a combination sample set showed a slightly higher standard error and number of factors compared to the model for a single sample set, but this model was sufficiently accurate to apply to each sample set. The best algorithm for the combination sample set was generated from second derivatives followed by MSC processing of spectral data and from Martens' Uncertainty option; it resulted in a standard error of 2.3% with 2 factors. The value of the digital second derivative centered at 1674 nm for these spectral data was highly correlated to shive content of flax and could form the basis for a simple, low-cost sensor for the shive or fiber content in retted flax.


2007 ◽  
Vol 15 (3) ◽  
pp. 153-159 ◽  
Author(s):  
Zou Xiaobo ◽  
Li Yanxiao ◽  
Zhao Jiewen

A near infrared (NIR) spectroscopy acquisition device was developed in this study using an apple as the test sample. With this device, the apple was rolled while collecting the NIR spectra. The feasibility of using efficient selection of wavelength regions in Fourier transform NIR for a rapid and conclusive determination of the inner qualities of fruit such as soluble solids content (SSC) of apples was investigated. Graphically-oriented local multivariate calibration modelling procedures called genetic algorithm interval partial least-squares (GA-iPLS) were applied to select efficient spectral regions that provide the lowest prediction error, in comparison to the full-spectrum model. The optimal SSC predictions were obtained from a seven-factor model using five intervals among 40 intervals selected by GA-iPLS. In the determination, a root mean square error of prediction of 0.42 °Brix for SSC of apples was obtained. The result demonstrated that the new method is a very useful and effective method for developing high precision PLS models based on optimal wavelength regions.


2000 ◽  
Vol 54 (3) ◽  
pp. 413-419 ◽  
Author(s):  
L. Nørgaard ◽  
A. Saudland ◽  
J. Wagner ◽  
J. P. Nielsen ◽  
L. Munck ◽  
...  

A new graphically oriented local modeling procedure called interval partial least-squares ( iPLS) is presented for use on spectral data. The iPLS method is compared to full-spectrum partial least-squares and the variable selection methods principal variables (PV), forward stepwise selection (FSS), and recursively weighted regression (RWR). The methods are tested on a near-infrared (NIR) spectral data set recorded on 60 beer samples correlated to original extract concentration. The error of the full-spectrum correlation model between NIR and original extract concentration was reduced by a factor of 4 with the use of iPLS ( r = 0.998, and root mean square error of prediction equal to 0.17% plato), and the graphic output contributed to the interpretation of the chemical system under observation. The other methods tested gave a comparable reduction in the prediction error but suffered from the interpretation advantage of the graphic interface. The intervals chosen by iPLS cover both the variables found by FSS and all possible combinations as well as the variables found by PV and RWR, and iPLS is still able to utilize the first-order advantage.


Agronomy ◽  
2021 ◽  
Vol 11 (4) ◽  
pp. 666
Author(s):  
Rafael Font ◽  
Mercedes del Río-Celestino ◽  
Diego Luna ◽  
Juan Gil ◽  
Antonio de Haro-Bailón

The near-infrared spectroscopy (NIRS) combined with modified partial least squares (modified PLS) regression was used for determining the neutral detergent fiber (NDF) and the acid detergent fiber (ADF) fractions of the chickpea (Cicer arietinum L.) seed. Fifty chickpea accessions (24 desi and 26 kabuli types) and fifty recombinant inbred lines F5:6 derived from a kabuli × desi cross were evaluated for NDF and ADF, and scanned by NIRS. NDF and ADF values were regressed against different spectral transformations by modified partial least squares regression. The coefficients of determination in the cross-validation and the standard deviation from the standard error of cross-validation ratio were, for NDF, 0.91 and 3.37, and for ADF, 0.98 and 6.73, respectively, showing the high potential of NIRS to assess these components in chickpea for screening (NDF) or quality control (ADF) purposes. The spectral information provided by different chromophores existing in the chickpea seed highly correlated with the NDF and ADF composition of the seed, and, thus, those electronic transitions are highly influenced on model fitting for fiber.


Sign in / Sign up

Export Citation Format

Share Document