Insect Protein Content Analysis in Handcrafted Fitness Bars by NIR Spectroscopy. Gaussian Process Regression and Data Fusion for Performance Enhancement of Miniaturized Cost-Effective Consumer-Grade Sensors

Future food supply will become increasingly dependent on edible material extracted from insects. The growing popularity of artisanal food products enhanced by insect proteins creates particular needs for establishing effective methods for quality control. This study focuses on developing rapid and efficient on-site quantitative analysis of protein content in handcrafted insect bars by miniaturized near-infrared (NIR) spectrometers. Benchtop (Büchi NIRFlex N-500) and three miniaturized (MicroNIR 1700 ES, Tellspec Enterprise Sensor and SCiO Sensor) in hyphenation to partial least squares regression (PLSR) and Gaussian process regression (GPR) calibration methods and data fusion concept were evaluated via test-set validation in performance of protein content analysis. These NIR spectrometers markedly differ by technical principles, operational characteristics and cost-effectiveness. In the non-destructive analysis of intact bars, the root mean square error of cross prediction (RMSEP) values were 0.611% (benchtop) and 0.545–0.659% (miniaturized) with PLSR, and 0.506% (benchtop) and 0.482–0.580% (miniaturized) with GPR calibration, while the analyzed total protein content was 19.3–23.0%. For milled samples, with PLSR the RMSEP values improved to 0.210% for benchtop spectrometer but remained in the inferior range of 0.525–0.571% for the miniaturized ones. GPR calibration improved the predictive performance of the miniaturized spectrometers, with RMSEP values of 0.230% (MicroNIR 1700 ES), 0.326% (Tellspec) and 0.338% (SCiO). Furthermore, Tellspec and SCiO sensors are consumer-oriented devices, and their combined use for enhanced performance remains a viable economical choice. With GPR calibration and test-set validation performed for fused (Tellspec + SCiO) data, the RMSEP values were improved to 0.517% (in the analysis of intact samples) and 0.295% (for milled samples).

Download Full-text

The CARMENES search for exoplanets around M dwarfs

Astronomy and Astrophysics ◽

10.1051/0004-6361/201834483 ◽

2019 ◽

Vol 623 ◽

pp. A24 ◽

Cited By ~ 5

Author(s):

B. Fuhrmeister ◽

S. Czesla ◽

J. H. M. M. Schmitt ◽

E. N. Johnson ◽

P. Schöfer ◽

...

Keyword(s):

Gaussian Process ◽

Near Infrared ◽

Rotation Period ◽

Gaussian Process Regression ◽

Process Models ◽

M Dwarfs ◽

String Length ◽

Phase Dispersion ◽

Rotation Periods ◽

Gaussian Process Models

We use spectra from CARMENES, the Calar Alto high-Resolution search for M dwarfs with Exo-earths with Near-infrared and optical Echelle Spectrographs, to search for periods in chromospheric indices in 16 M0–M2 dwarfs. We measure spectral indices in the Hα, the Ca II infrared triplet (IRT), and the Na I D lines to study which of these indices are best-suited to finding rotation periods in these stars. Moreover, we test a number of different period-search algorithms, namely the string length method, the phase dispersion minimisation, the generalized Lomb–Scargle periodogram, and the Gaussian process regression with quasi-periodic kernel. We find periods in four stars using Hα and in five stars using the Ca II IRT, two of which have not been found before. Our results show that both Hα and the Ca II IRT lines are well suited for period searches, with the Ca II IRT index performing slightly better than Hα. Unfortunately, the Na I D lines are strongly affected by telluric airglow, and we could not find any rotation period using this index. Further, different definitions of the line indices have no major impact on the results. Comparing the different search methods, the string length method and the phase dispersion minimisation perform worst, while Gaussian process models produce the smallest numbers of false positives and non-detections.

Download Full-text

Multifidelity Data Fusion via Gradient-Enhanced Gaussian Process Regression

Communications in Computational Physics ◽

10.4208/cicp.oa-2020-0151 ◽

2020 ◽

Vol 28 (5) ◽

pp. 1812-1837

Author(s):

Yixiang Deng

Keyword(s):

Data Fusion ◽

Gaussian Process ◽

Gaussian Process Regression

Download Full-text

A screening-based gradient-enhanced Gaussian process regression model for multi-fidelity data fusion

Advanced Engineering Informatics ◽

10.1016/j.aei.2021.101437 ◽

2021 ◽

Vol 50 ◽

pp. 101437

Author(s):

Quan Lin ◽

Dawei Hu ◽

Jiexiang Hu ◽

Yuansheng Cheng ◽

Qi Zhou

Keyword(s):

Data Fusion ◽

Regression Model ◽

Gaussian Process ◽

Gaussian Process Regression

Download Full-text

Patterned polycaprolactone-filled glass microfiber microfluidic devices for total protein content analysis

Talanta ◽

10.1016/j.talanta.2017.08.031 ◽

2018 ◽

Vol 176 ◽

pp. 589-594 ◽

Cited By ~ 12

Author(s):

Gayan C. Bandara ◽

Christopher A. Heist ◽

Vincent T. Remcho

Keyword(s):

Content Analysis ◽

Protein Content ◽

Total Protein ◽

Microfluidic Devices ◽

Total Protein Content

Download Full-text

Building global models for fat and total protein content in raw milk based on historical spectroscopic data in the visible and short-wave near infrared range

Food Chemistry ◽

10.1016/j.foodchem.2016.01.127 ◽

2016 ◽

Vol 203 ◽

pp. 190-198 ◽

Cited By ~ 19

Author(s):

Anastasiia Melenteva ◽

Vladislav Galyanin ◽

Elena Savenkova ◽

Andrey Bogomolov

Keyword(s):

Protein Content ◽

Total Protein ◽

Spectroscopic Data ◽

Near Infrared ◽

Raw Milk ◽

Short Wave ◽

Infrared Range ◽

Total Protein Content ◽

Global Models ◽

Near Infrared Range

Download Full-text

EFFECTS OF EXTERNAL γ-RADIATION ON THE TOTAL PROTEIN CONTENT IN LYMPHOCYTES AND THROMBOCYTES OF SHEEP

Sel skokhozyaistvennaya Biologiya ◽

10.15389/agrobiology.2013.4.115eng ◽

2013 ◽

pp. 115-120

Author(s):

T.S. Shevchenko ◽

Keyword(s):

Protein Content ◽

Total Protein ◽

Total Protein Content ◽

Γ Radiation

Download Full-text

Using Gaussian Process Regression to Integrate the Transition Structure Factor Curve for the Many-Body Correlation Energy

10.26226/morressier.5fa409874d4e91fe5c54b97a ◽

2020 ◽

Author(s):

Laura Weiler

Keyword(s):

Gaussian Process ◽

Structure Factor ◽

Correlation Energy ◽

Gaussian Process Regression ◽

Many Body ◽

Transition Structure ◽

The Many ◽

Structure Factor Curve ◽

Body Correlation

Download Full-text

Exchange Spin Coupling from Gaussian Process Regression

10.26434/chemrxiv.12589541.v3 ◽

2020 ◽

Author(s):

Marc Philipp Bahlke ◽

Natnael Mogos ◽

Jonny Proppe ◽

Carmen Herrmann

Keyword(s):

Machine Learning ◽

Gaussian Process ◽

Gaussian Process Regression ◽

Molecular Magnets ◽

Molecular Structures ◽

Spin Coupling ◽

Structure Property ◽

Data Set ◽

Uncertainty Estimates

Heisenberg exchange spin coupling between metal centers is essential for describing and understanding the electronic structure of many molecular catalysts, metalloenzymes, and molecular magnets for potential application in information technology. We explore the machine-learnability of exchange spin coupling, which has not been studied yet. We employ Gaussian process regression since it can potentially deal with small training sets (as likely associated with the rather complex molecular structures required for exploring spin coupling) and since it provides uncertainty estimates (“error bars”) along with predicted values. We compare a range of descriptors and kernels for 257 small dicopper complexes and find that a simple descriptor based on chemical intuition, consisting only of copper-bridge angles and copper-copper distances, clearly outperforms several more sophisticated descriptors when it comes to extrapolating towards larger experimentally relevant complexes. Exchange spin coupling is similarly easy to learn as the polarizability, while learning dipole moments is much harder. The strength of the sophisticated descriptors lies in their ability to linearize structure-property relationships, to the point that a simple linear ridge regression performs just as well as the kernel-based machine-learning model for our small dicopper data set. The superior extrapolation performance of the simple descriptor is unique to exchange spin coupling, reinforcing the crucial role of choosing a suitable descriptor, and highlighting the interesting question of the role of chemical intuition vs. systematic or automated selection of features for machine learning in chemistry and material science.

Download Full-text

SAMPL6 Challenge Results from pKa Predictions Based on a General Gaussian Process Model

10.26434/chemrxiv.6406505.v2 ◽

2018 ◽

Author(s):

Caitlin C. Bannan ◽

David Mobley ◽

A. Geoff Skillman

Keyword(s):

Gaussian Process ◽

Process Model ◽

Molecular Graph ◽

Gaussian Process Regression ◽

Ionization State ◽

Training Set ◽

Physiochemical Properties ◽

Quantile Plots ◽

Physical And Chemical ◽

Good Agreement

<div>A variety of fields would benefit from accurate pK<sub>a</sub> predictions, especially drug design due to the affect a change in ionization state can have on a molecules physiochemical properties.</div><div>Participants in the recent SAMPL6 blind challenge were asked to submit predictions for microscopic and macroscopic pK<sub>a</sub>s of 24 drug like small molecules.</div><div>We recently built a general model for predicting pK<sub>a</sub>s using a Gaussian process regression trained using physical and chemical features of each ionizable group.</div><div>Our pipeline takes a molecular graph and uses the OpenEye Toolkits to calculate features describing the removal of a proton.</div><div>These features are fed into a Scikit-learn Gaussian process to predict microscopic pK<sub>a</sub>s which are then used to analytically determine macroscopic pK<sub>a</sub>s.</div><div>Our Gaussian process is trained on a set of 2,700 macroscopic pK<sub>a</sub>s from monoprotic and select diprotic molecules.</div><div>Here, we share our results for microscopic and macroscopic predictions in the SAMPL6 challenge.</div><div>Overall, we ranked in the middle of the pack compared to other participants, but our fairly good agreement with experiment is still promising considering the challenge molecules are chemically diverse and often polyprotic while our training set is predominately monoprotic.</div><div>Of particular importance to us when building this model was to include an uncertainty estimate based on the chemistry of the molecule that would reflect the likely accuracy of our prediction. </div><div>Our model reports large uncertainties for the molecules that appear to have chemistry outside our domain of applicability, along with good agreement in quantile-quantile plots, indicating it can predict its own accuracy.</div><div>The challenge highlighted a variety of means to improve our model, including adding more polyprotic molecules to our training set and more carefully considering what functional groups we do or do not identify as ionizable. </div>

Download Full-text

Gaussian Process Regression for Estimating Wind Speed From X-band Marine Radar Images

OCEANS 2018 MTS/IEEE Charleston ◽

10.1109/oceans.2018.8604842 ◽

2018 ◽

Author(s):

Xinwei Chen ◽

Weimin Huang

Keyword(s):

Wind Speed ◽

Gaussian Process ◽

Gaussian Process Regression ◽

Radar Images ◽

X Band ◽

Marine Radar

Download Full-text