Application of Support Vector Regression for Simultaneous Modelling of near Infrared Spectra from Multiple Process Steps

2015 ◽  
Vol 23 (2) ◽  
pp. 75-84 ◽  
Author(s):  
Rikke Tange ◽  
Morten Arendt Rasmussen ◽  
Eizo Taira ◽  
Rasmus Bro

Near infrared (NIR) spectroscopy in combination with partial least-squares regression (PLS) is widely applied in process control for non-destructive measurement of quality parameters during production. PLS assumes an approximate linear relationship between the parameter to be estimated and the intensity of its absorption bands. Spectra, however, may contain non-linearities for various reasons such as differences in viscosity, temperature, pH, particle size and chemical composition of the sample matrix. In such cases, PLS might not predict the parameter of interest sufficiently well, and one must find other methods for the calibration task. Support vector machine regression (SVR) has been gaining interest within chemometrics in recent years. SVR is capable of modelling highly non-linear data, also when data are of very high dimensions. The aim of this study was to develop calibration models of NIR spectra from four different process steps in a raw-sugar factory. The models were intended for monitoring two quality parameters at the individual process steps. Our goal was to obtain global calibration models covering all four process steps in order to obtain simple calibration maintenance. SVR was used for the calibration task, since all of the above-mentioned sources of non-linearities were present. SVR results were benchmarked against PLS. SVR had a better prediction performance than PLS (1) for models built on individual process steps, (2) for global models covering all four process steps and (3) when the global models were evaluated on the individual process steps. Moreover, the majority of SVR models had prediction errors close to reference uncertainty and hence were close to being optimal. Finally, the global SVR models predicted the individual process steps better than the corresponding local PLS models. We conclude that the nonlinear modelling method SVR was able to model non-linearities caused by pooling NIR spectra from multiple different process steps. Implementation of the global SVR models would have several advantages over the local PLS models. First, they would allow simple calibration maintenance because only one model per quality parameter would have to be maintained. Second, they would allow more precise estimation of the quality parameters and therefore better process monitoring.

2021 ◽  
Author(s):  
Dongxue Zhao ◽  
Maryem Arshad ◽  
Jie Wang ◽  
John Triantafilis

<p>Due to high rate of nutrient removal by cotton plants, the productive cotton-growing soils of Australia is becoming depleted of exchangeable (exch.) cations. For long-term development, data on exch. calcium (Ca), magnesium (Mg), potassium (K) and sodium (Na) throughout the soil profile is required. However, traditional laboratory analysis is tedious. The visible-near-infrared (Vis-NIR) spectroscopy is an alternative; whereby, spectral libraries are built which couple soil data and Vis-NIR spectra using models. While various models have been used to predict exch. cations, their performance was seldom systematically compared. Moreover, most previous studies have focused on prediction of topsoil (0–0.3 m) exch. cations while the effects of depth on applicability of topsoil spectral libraries are rarely investigated. Our first aim was to determine which model (i.e. partial least squares regression (PLSR), Cubist, random forest (RF), or support vector machine regression (SVMR)) produces the best prediction of topsoil exch. Ca, Mg, K and Na. The second aim was to evaluate if the best topsoil model can be used to predict subsurface (0.3–0.6 m) and subsoil (0.9–1.2 m) cations. The third aim was to explore the effect of spiking on the prediction in subsurface and subsoil. The fourth aim was to see if combining all depths to build a profile spectral library improved prediction. Based on independent validation, PLSR was superior for topsoil exch. cations prediction, while Cubist outperformed PLSR in some cases when spiking was applied, and the profile spectral library was considered. Topsoil PLSR could be applied to predict exch. Ca and Mg in the subsurface and subsoil, while spiking improved prediction. Moreover, a profile spectral library achieved equivalent results with when topsoil samples coupled with spiking were considered. We, therefore, recommended to predict exch. Ca and Mg throughout the profile using topsoil spectral library coupled with spiking approach.</p>


2000 ◽  
Vol 8 (2) ◽  
pp. 109-116 ◽  
Author(s):  
V.H. Segtnan ◽  
T. Isaksson

Several techniques for measuring quality parameters in foods by the use of near infrared (NIR) technology have been reported. The aim of this experiment is to evaluate the main techniques in order to find the optimal measurement conditions for NIR analysis of carbohydrates in fluid food systems. Two different model systems were studied, each system containing 61 designed samples. The first system was designed to give scatter, and was based on a commercial orange juice. The other system was designed to be scatter-free, and was based on distilled water. To all samples were added the same total amounts of glucose, fructose and sucrose, and measured using the following NIR techniques: transmittance measurements in cuvettes, dry extract diffuse reflectance (DESIR), fibre-optic transflectance and fibre-optic transmittance. Calibration models were made by partial least squares regression in the spectral regions 780–2500nm for DESIR measurements, 1100–1315nm for 10mm pathlengths and 1100–1880+2130–2350nm for 1mm pathlengths. The models were fully cross-validated. Optimal prediction errors (Root Mean Square Error of Prediction, Cross-Validated) for DESIR measurements ranged from 0.020 to 0.030% (w/w), while 1mm cuvette values ranged from 0.008 to 0.012%. For these techniques there were only small differences between juice and water samples. Using fibre-optics, 1mm transmittance gave values in the range 0.068–0.081% for juice samples and 0.022–0.066% for water samples, while 1mm transflectance gave 0.044–0.051% for juice samples and 0.045–0.078% for water samples. 10mm pathlengths provided substantially higher prediction errors than 1mm for all techniques investigated. From these results, two main conclusions can be drawn. First, when measuring off-line, direct transmittance measurements in cuvettes gave better prediction results than DESIR. Second, when using fibre-optics, transflectance gave lower prediction errors than transmittance for scattering samples, while transmittance performed better than transflectance for non-scattering samples.


1997 ◽  
Vol 20 (5) ◽  
pp. 285-290 ◽  
Author(s):  
U.A. Müller ◽  
B. Mertes ◽  
C. Fischbacher ◽  
K.U. Jageman ◽  
K. Danzer

The feasibility of using near infrared reflection spectroscopy for non-invasive blood glucose monitoring is discussed. Spectra were obtained using a diode-array spectrometer with a fiberoptic measuring head with a wavelength ranging from 800 nm to 1350 nm. Calibration was performed using partial least-squares regression and radial basis function networks. The results of different methods used to evaluate the quality of the recorded spectra in order to improve the reliability of the calibration models, are presented.


2018 ◽  
Vol 2018 ◽  
pp. 1-12 ◽  
Author(s):  
Sylvio Barbon ◽  
Ana Paula Ayub da Costa Barbon ◽  
Rafael Gomes Mantovani ◽  
Douglas Fernandes Barbin

Identification of chicken quality parameters is often inconsistent, time-consuming, and laborious. Near-infrared (NIR) spectroscopy has been used as a powerful tool for food quality assessment. However, the near-infrared (NIR) spectra comprise a large number of redundant information. Determining wavelengths relevance and selecting subsets for classification and prediction models are mandatory for the development of multispectral systems. A combination of both attribute and wavelength selection for NIR spectral information of chicken meat samples was investigated. Decision Trees and Decision Table predictors exploit these optimal wavelengths for classification tasks according to different quality grades of poultry meat. The proposed methodology was conducted with a support vector machine algorithm (SVM) to compare the precision of the proposed model. Experiments were performed on NIR spectral information (1050 wavelengths), colour (CIEL∗a∗b∗, chroma, and hue), water holding capacity (WHC), and pH of each sample analyzed. Results show that the best method was the REPTree based on 12 wavelengths, allowing for classification of poultry samples according to quality grades with 77.2% precision. The selected wavelengths could lead to potential simple multispectral acquisition devices.


2020 ◽  
pp. 000370282097470
Author(s):  
Joshua M. Ottaway ◽  
J. Chance Carter ◽  
Kristl L Adams ◽  
Joseph Camancho ◽  
Barry Lavine ◽  
...  

The peroxide value (PV) of edible oils is a measure of the degree of oxidation, which directly relates to the freshness of the oil sample. Several studies previously reported in the literature have paired various spectroscopic techniques with multivariate analyses to rapidly determine PVs using field portable and process instrumentation; those efforts presented ‘best-case’ scenarios with oils from narrowly defined training and test sets. The purpose of this paper is to evaluate the use of near- and mid-infrared absorption and Raman scattering spectroscopies on oil samples from different oil classes, including seasonal and vendor variations, to determine which measurement technique, or combination thereof, is best for predicting PVs. Following PV assays of each oil class using an established titration-based method, global and global-subset calibration models were constructed from spectroscopic data collected on the 19 oil classes used in this study. Spectra from each optical technique were used to create partial least squares regression (PLSR) calibration models to predict the PV of unknown oil samples. A global PV model based on near-infrared (8 mm optical path length – OPL) oil measurements produced the lowest RMSEP (4.9), followed by 24 mm OPL near infrared (5.1), Raman (6.9) and 50 μm OPL mid-infrared (7.3). However, it was determined that the Raman RMSEP resulted from chance correlations. Global PV models based on low-level fusion of the NIR (8 and 24 mm OPL) data and all infrared data produced the same RMSEP of 5.1. Global subset models, based on any of the spectroscopies and olive oil training sets from any class (pure, extra light, extra virgin), all failed to extrapolate to the non-olive oils. However, the near-infrared global subset model built on extra virgin olive oil could extrapolate to test samples from other olive oil classes.


2004 ◽  
Vol 34 (1) ◽  
pp. 76-84 ◽  
Author(s):  
Mulualem Tigabu ◽  
Per Christer Odén ◽  
Tong Yun Shen

The use of near-infrared (NIR) spectroscopy to discriminate between uninfested seeds of Picea abies (L.) Karst and seeds infested with Plemeliella abietina Seitn (Hymenoptera, Torymidae) larva is sensitive to seed origin and year of collection. Five seed lots collected during different years from Sweden, Finland, and Belarus were used in this study. Initially, seeds were classified as infested or uninfested with X-radiography, and then, NIR spectra from single seeds were collected with a NIR spectrometer from 1100 to 2498 nm with a resolution of 2 nm. Discriminant models were derived by partial least squares regression using raw and orthogonal signal corrected spectra (OSC). The resulting OSC model developed on a pooled data set was more robust than the raw model and resulted in 100% classification accuracy. Once irrelevant spectral variations were removed by using OSC pretreatment, single-lot calibration models resulted in similar classification rates for the new samples irrespective of origin and year of collection. Dis criminant analyses performed with selected NIR absorption bands also gave nearly 100% classification rate for new samples. The origin of spectral differences between infested and uninfested seeds was attributed to storage lipids and proteins that were completely depleted in the former by the feeding larva.


2020 ◽  
Vol 12 (4) ◽  
pp. 1476 ◽  
Author(s):  
Lei Han ◽  
Rui Chen ◽  
Huili Zhu ◽  
Yonghua Zhao ◽  
Zhao Liu ◽  
...  

Soil arsenic (AS) contamination has attracted a great deal of attention because of its detrimental effects on environments and humans. AS and inorganic AS compounds have been classified as a class of carcinogens by the World Health Organization. In order to select a high-precision method for predicting the soil AS content using hyperspectral techniques, we collected 90 soil samples from six different land use types to obtain the soil AS content by chemical analysis and hyperspectral data based on an indoor hyperspectral experiment. A partial least squares regression (PLSR), a support vector regression (SVR), and a back propagation neural network (BPNN) were used to establish a relationship between the hyperspectral and the soil AS content to predict the soil AS content. In addition, the feasibility and modeling accuracy of different interval spectral resampling, different spectral pretreatment methods, feature bands, and full-band were compared and discussed to explore the best inversion method for estimating soil AS content by hyperspectral. The results show that 10 nm + second derivative (SD) + BPNN is the optimum method to predict soil AS content estimation; R v 2 is 0.846 and residual predictive deviation (RPD) is 2.536. These results can expand the representativeness and practicability of the model to a certain extent and provide a scientific basis and technical reference for soil pollution monitoring.


Molecules ◽  
2019 ◽  
Vol 24 (11) ◽  
pp. 2029 ◽  
Author(s):  
Marina D. G. Neves ◽  
Ronei J. Poppi ◽  
Heinz W. Siesler

Nowadays, near infrared (NIR) spectroscopy has experienced a rapid progress in miniaturization (instruments < 100 g are presently available), and the price for handheld systems has reached the < $500 level for high lot sizes. Thus, the stage is set for NIR spectroscopy to become the technique of choice for food and beverage testing, not only in industry but also as a consumer application. However, contrary to the (in our opinion) exaggerated claims of some direct-to-consumer companies regarding the performance of their “food scanners” with “cloud evaluation of big data”, the present publication will demonstrate realistic analytical data derived from the development of partial least squares (PLS) calibration models for six different nutritional parameters (energy, protein, fat, carbohydrates, sugar, and fiber) based on the NIR spectra of a broad range of different pasta/sauce blends recorded with a handheld instrument. The prediction performance of the PLS calibration models for the individual parameters was double-checked by cross-validation (CV) and test-set validation. The results obtained suggest that in the near future consumers will be able to predict the nutritional parameters of their meals by using handheld NIR spectroscopy under every-day life conditions.


Soil Research ◽  
2011 ◽  
Vol 49 (2) ◽  
pp. 166 ◽  
Author(s):  
Yongni Shao ◽  
Yong He

The aim of this study was to investigate the potential of the infrared spectroscopy technique for non-destructive measurement of soil properties. For the study, 280 soil samples were collected from several regions in Zhejiang, China. Data from near infrared (NIR, 800–2500 nm), mid infrared (MIR, 4000–400 cm–1), and the combined NIR–MIR regions were compared to determine which produced the best prediction of soil properties. Least-squares support vector machines (LS-SVM) were applied to construct calibration models for soil properties such as available nitrogen (N), phosphorus (P), and potassium (K). The results showed that both spectral regions contained substantial information on N, P, and K in the soils studied, and the combined NIR–MIR region did a little worse than either the NIR or MIR region. Optimal results were obtained through LS-SVM compared with the standard partial least-squares regression method, and the correlation coefficient of prediction (rp), root mean square error for prediction, and bias were, respectively, 0.90, 16.28 mg/kg, and 0.96 mg/kg for the prediction results of N in the NIR region; and 0.88, 41.62 mg/kg, and –2.28 mg/kg for the prediction results of P, and 0.89, 33.47 mg/kg, and 2.96 mg/kg for the prediction results of K, both in the MIR region. This work demonstrated the potential of LS-SVM coupled to infrared reflectance spectroscopy for more efficient soil analysis and the acquisition of soil information.


Sign in / Sign up

Export Citation Format

Share Document