Examination of Criteria for Local Model Principal Component Regression

1997 ◽  
Vol 51 (12) ◽  
pp. 1814-1822 ◽  
Author(s):  
Gregory A. Bakken ◽  
Dixie R. Long ◽  
John H. Kalivas

In analytical chemistry, principal component regression (PCR) is widely used as a method for calibration and prediction. The motivation behind PCR is to select factors associated with predictive information and eliminate those associated with noise. The classical approach, referred to as top-down selection, chooses sequential factors based on singular value magnitudes, and the same factors are used for all future unknown samples; i.e., a global model is formed. The number of factors needed is often determined through cross-validation on the calibration samples or with an external validation set. Alternatively, a model developed specific to an unknown sample, i.e., a local model or sample-dependent model, could offer improved accuracy. The idea behind sample-dependent PCR is that factors associated with small singular values not included in a top-down PCR model can still contain relevant predictive information. This paper shows that local models generated by selecting factors on a sample-by-sample basis often reduce prediction errors compared with those for the global top-down model. However, evidence is also provided that supports the use of global top-down models. Several criteria are proposed and examined for selecting factors on a sample-dependent basis. Observations and conclusions presented are based on two near-infrared data sets.

1992 ◽  
Vol 46 (11) ◽  
pp. 1685-1694 ◽  
Author(s):  
Tomas Isaksson ◽  
Charles E. Miller ◽  
Tormod Næs

In this work, the abilities of near-infrared diffuse reflectance (NIR) and transmittance (NIT) spectroscopy to noninvasively determine the protein, fat, and water contents of plastic-wrapped homogenized meat are evaluated. One hundred homogenized beef samples, ranging from 1 to 23% fat, wrapped in polyamide/polyethylene laminates, were used. Results of multivariate calibration and prediction for protein, fat, and water contents are presented. The optimal test set prediction errors (root mean square error of prediction, RMSEP), obtained with the use of the principal component regression method with NIR data, were 0.45, 0.29 and 0.50 weight % for protein, fat, and water, respectively, for plastic-wrapped meat (compared to 0.40, 0.28 and 0.45 wt % for unwrapped meat). The optimal prediction errors for the NIT method were 0.31, 0.52 and 0.42 wt % for protein, fat, and water, respectively, for plastic-wrapped meat samples (compared to 0.27, 0.38, and 0.37 wt % for unwrapped meat). We can conclude that the addition of the laminate only slightly reduced the abilities of the NIR and NIT method to predict protein, fat, and water contents in homogenized meat.


1988 ◽  
Vol 42 (7) ◽  
pp. 1273-1284 ◽  
Author(s):  
Tomas Isaksson ◽  
Tormod Næs

Near-infrared (NIR) reflectance spectra of five different food products were measured. The spectra were transformed by multiplicative scatter correction (MSC). Principal component regression (PCR) was performed, on both scatter-corrected and uncorrected spectra. Calibration and prediction were performed for four food constituents: protein, fat, water, and carbohydrates. All regressions gave lower prediction errors (7–68% improvement) by the use of MSC spectra than by the use of uncorrected absorbance spectra. One of these data sets was studied in more detail to clarify the effects of the MSC, by using PCR score, residual, and leverage plots. The improvement by using nonlinear regression methods is indicated.


2002 ◽  
Vol 56 (12) ◽  
pp. 1593-1599 ◽  
Author(s):  
Peter Snoer Jensen ◽  
Jimmy Bak

This study investigates the use of a dual-beam, optical null, FT-IR spectrometer to measure trace organic components in aqueous solutions in the combination band region 5000–4000 cm−1. The spectrometer may be used for both single- and dual-beam measurements, thereby facilitating comparison of these two modes of operation. The concentrations of aqueous solutions of urea and glucose in the ranges 0–40 mg/dL and 0–250 mg/dL, respectively, were determined by principal component regression using both modes. The dual-beam technique eliminated instrumental variations present in the single-beam measurements that must be taken into account when quantifying trace components from single-beam spectra. The data obtained with the dual-beam technique resulted in more stable calibration models based on principal component regression. These calibration models need fewer factors and yield lower prediction errors than those based on traditional single-beam data.


1992 ◽  
Vol 46 (1) ◽  
pp. 34-43 ◽  
Author(s):  
Tormod Næs ◽  
Tomas Isaksson

This paper presents an application of locally weighted regression (LWR) in diffuse near-infrared transmittance spectroscopy. The data are from beef and pork samples. The LWR method is based on the idea that a nonlinearity can be approximated by local linear equations. Different weight functions (for the samples) as well as different distance measures for “closeness” are tested. The LWR is compared to principal component regression and partial least-squares regression. The LWR with weighted principal components is shown to give the best results. The improvements with respect to linear regression are up to 15% of the prediction errors.


2017 ◽  
Vol 2 (4) ◽  
Author(s):  
Andika Boy Yuliansyah ◽  
Sitti Wajizah ◽  
Samadi Samadi

Abstrak.     Tujuan penelitian ini adalah untuk mengevaluasi akurasi metode analisis pakan dengan metode (Near Infrared Reflectance Sectroscopy) NIRS dalam memprediksi kandungan nutrisi limbah kulit kopi serta mengetahui panjang gelombangnya.  Penelitian ini dilakukan di Laboratorium Ilmu Nutrisi dan Teknologi Pakan, Univeritas Syiah Kuala, dari Agustus hingga September 2017.  Penelitian ini menggunakan 30 sampel limbah kulit kopi yang terdiri dari 2 varietas kopi yaitu kopi arabika (Coffea arabica) dan kopi robusta (Coffea canephora). Spektrum diukur dengan menggunakan yaitu FT-IR IPTEK T-1516 pada rentang wavelengrh 1000-2500 nm dan di kalibrasi dan validasi dengan menggunakan software The Unscrambler X version 10.4.  Pretreatment yang digunakan yaitu Multiplicative scatter analysis (MSC) dan DeTrending (DT) dengan metode regresi Principal Component Regression (PCR). Parameter nutrisi yang dianalisis yaitu bahan kering (BK), protein kasar (PK) dan serat kasar (SK).  Hasil penelitian memperlihatkan bahwa NIRS dengan model yang telah dibangun tidak dapat menprediksi bahan kering dengan baik. Hal ini ditunjukkan dengan nilai r, R2 dan RPD yang rendah (0.58, 0.34 dan 3.06) serta RMSEC yang tinggi (3.06). Metode NIRS dapat memprediksi kandungan PK dan SK dengan baik pada penggunaan pretreatment MSC (PK= r: 0.87, R2: 0.76, RMSEC: 0.45 dan RPD: 2.07; SK= r: 0.87, R2: 0.75, RMSEC: 2.83 dan RPD: 2.03). Prediksi kasar untuk PK dan SK didapatkan dengan menggunakan pretreatment DT (PK= r: 0.75, R2: 0.57, RMSEC: 0.60 dan RPD: 1.55; SK= r: 0.84, R2: 0.71, RMSEC: 3.06 dan RPD: 1.88). Analysis of Coffee Pulp (Coffea sp.) Nutrition Content Using Near Infrared Reflectance Spectroscopy (NIRS) Method Abstract.   The aim of present study was to evaluate the accuration of feed analysis method of Near infrared reflectance spectroscopy (NIRS) in predicting nutritional content of Coffee pulp and to know its wavelength.  The study was conducted in  nutrition science and feed technology Laboratory,   Department of Animal Husbandry,  Faculty of Agriculture,  Syiah Kuala University,  august until september, 2017.   As many as 30 coffee pulps  were used in this study and seperated to 2 specieses of coffee, arabica coffee (Coffea arabica) and robusta coffee (Coffea canephora).  The spectrum was scanned using. FT-IR IPTEK T-1516 at 1000 to 2500 nm wavelength and calibrated and validated using The Unscrambler X version 10.4 software. Pretreatment used in this study was Multiplicative scatter analysis (MSC) dan DeTrending (DT) with Principal component regression (PCR) calibration method. Nutrition parameters analyzed were dry matter (DM), crude protein (CP) and dietary fiber (DF). The results of study showed that NIRS with prediction models that have been build cannot predicted DM content in coffee pulp. This was shown with low value of r, R2 dan RPD (0.58, 0.34 dan 3.06) and high value of RMSEC (3.60). NIRS method can predicted CP and DF content quite well using MSC pretreatment (CP= r: 0.87, R2: 0.76, RMSEC: 0.45 dan RPD: 2.07; DF= r: 0.87, R2: 0.75, RMSEC: 2.83 dan RPD: 2.03). Rough prediction for CP and DM content was obtained by using DT pretreatment (CP= r: 0.75, R2: 0.57, RMSEC: 0.60 dan RPD: 1.55; DF= r: 0.84, R2: 0.71, RMSEC: 3.06 dan RPD: 1.88). 


1991 ◽  
Vol 71 (2) ◽  
pp. 385-392 ◽  
Author(s):  
G. B. Schaalje ◽  
H. -H. Mündel

The accuracy of estimates of plant properties based on near-infrared reflectance spectroscopy (NIRS) varies with many factors including the biological material in question and the method used to calibrate the NIRS instrument. This study investigated the accuracy, relative to Kjeldahl analysis, of NIRS analysis based on two calibration methods in estimating nitrogen concentration of four stages and/or parts of soybean (Glycine max (L.) Merr.) plants. Samples of whole top growth at anthesis, whole top growth at maturity, whole top growth at maturity excluding seeds, and seeds were obtained from two field trials and one phytotron experiment. Two Kjeldahl determinations of nitrogen concentration were obtained for each sample, as well as reflectance values at each of 19 infrared wavelengths, using a Technicon InfraAlyser 400R. Different subsets of the sample data were used for calibration and assessment of accuracy. The instrument was calibrated using stepwise multiple linear regression (SMLR) and principal component regression (PCR). The residual maximum likelihood procedure was useful in showing that NIRS estimates based on either SMLR or PCR were at least as accurate as Kjeldahl estimates for all stages and/or parts except whole top growth at maturity excluding seeds. Key words: Calibration, principal component regression, stepwise regression


2017 ◽  
Vol 25 (5) ◽  
pp. 324-329 ◽  
Author(s):  
Li Dan ◽  
Wu Yi-Hui

The aim of this research was to investigate the feasibility of Fourier transform near infrared spectroscopy combined with chemometric analysis to develop a rapid method for identification of different resin types which had been deemed similar by a preliminary visual examination. Principal component analysis was applied on spectral data to classify two types of epoxy resin samples and three types of phenolic resin samples. In this case, a total of two hundred and fifteen samples were used for the evaluation and validation of two types of epoxy resin samples (SY1342 and SY1346) and three types of phenolic resin samples (Y3567, Y2705 and Y2137). All were correctly differentiated by their respective models. Moreover, in the external validation, the prediction rate of samples correctly classified was also 100%. Such classifications are very important for the detection of adulterated samples and for quality control. Near infrared spectroscopy was shown to be a very reliable, accurate and useful tool to classify resin samples in a fast, clean and inexpensive way compared to classical analysis, and it will enable copper clad laminate manufacturers to detect and take early corrective actions that will ultimately save time and money while establishing a uniform quality.


2003 ◽  
Vol 11 (1) ◽  
pp. 55-70 ◽  
Author(s):  
Laila Stordrange ◽  
Olav M. Kvalheim ◽  
Per A. Hassel ◽  
Dick Malthe-Sørenssen ◽  
Fred Olav Libnau

Partial least squares (PLS) is a powerful tool for multivariate linear regression. But what if the data show a non-linear structure? Near infrared spectra from a pharmaceutical process were used as a case study. An ANOVA test revealed that the data are well described by a 2nd order polynomial. This work investigates the application of regression techniques that account for slightly non-linear data. The regression techniques investigated are: linearising data by applying transformations, local PLS, i.e. splitting of data, and quadratic PLS. These models were compared with ordinary PLS and principal component regression (PCR). The predictive ability of the models was tested on an independent data set acquired a year later. Using the knowledge of non-linear pattern and important spectral regions, simpler models with better predictive ability can be obtained.


Sign in / Sign up

Export Citation Format

Share Document