scholarly journals Insect Protein Content Analysis in Handcrafted Fitness Bars by NIR Spectroscopy. Gaussian Process Regression and Data Fusion for Performance Enhancement of Miniaturized Cost-Effective Consumer-Grade Sensors

Molecules ◽  
2021 ◽  
Vol 26 (21) ◽  
pp. 6390
Author(s):  
Krzysztof B. Beć ◽  
Justyna Grabska ◽  
Nicole Plewka ◽  
Christian W. Huck

Future food supply will become increasingly dependent on edible material extracted from insects. The growing popularity of artisanal food products enhanced by insect proteins creates particular needs for establishing effective methods for quality control. This study focuses on developing rapid and efficient on-site quantitative analysis of protein content in handcrafted insect bars by miniaturized near-infrared (NIR) spectrometers. Benchtop (Büchi NIRFlex N-500) and three miniaturized (MicroNIR 1700 ES, Tellspec Enterprise Sensor and SCiO Sensor) in hyphenation to partial least squares regression (PLSR) and Gaussian process regression (GPR) calibration methods and data fusion concept were evaluated via test-set validation in performance of protein content analysis. These NIR spectrometers markedly differ by technical principles, operational characteristics and cost-effectiveness. In the non-destructive analysis of intact bars, the root mean square error of cross prediction (RMSEP) values were 0.611% (benchtop) and 0.545–0.659% (miniaturized) with PLSR, and 0.506% (benchtop) and 0.482–0.580% (miniaturized) with GPR calibration, while the analyzed total protein content was 19.3–23.0%. For milled samples, with PLSR the RMSEP values improved to 0.210% for benchtop spectrometer but remained in the inferior range of 0.525–0.571% for the miniaturized ones. GPR calibration improved the predictive performance of the miniaturized spectrometers, with RMSEP values of 0.230% (MicroNIR 1700 ES), 0.326% (Tellspec) and 0.338% (SCiO). Furthermore, Tellspec and SCiO sensors are consumer-oriented devices, and their combined use for enhanced performance remains a viable economical choice. With GPR calibration and test-set validation performed for fused (Tellspec + SCiO) data, the RMSEP values were improved to 0.517% (in the analysis of intact samples) and 0.295% (for milled samples).

2019 ◽  
Vol 623 ◽  
pp. A24 ◽  
Author(s):  
B. Fuhrmeister ◽  
S. Czesla ◽  
J. H. M. M. Schmitt ◽  
E. N. Johnson ◽  
P. Schöfer ◽  
...  

We use spectra from CARMENES, the Calar Alto high-Resolution search for M dwarfs with Exo-earths with Near-infrared and optical Echelle Spectrographs, to search for periods in chromospheric indices in 16 M0–M2 dwarfs. We measure spectral indices in the Hα, the Ca II infrared triplet (IRT), and the Na I D lines to study which of these indices are best-suited to finding rotation periods in these stars. Moreover, we test a number of different period-search algorithms, namely the string length method, the phase dispersion minimisation, the generalized Lomb–Scargle periodogram, and the Gaussian process regression with quasi-periodic kernel. We find periods in four stars using Hα and in five stars using the Ca II IRT, two of which have not been found before. Our results show that both Hα and the Ca II IRT lines are well suited for period searches, with the Ca II IRT index performing slightly better than Hα. Unfortunately, the Na I D lines are strongly affected by telluric airglow, and we could not find any rotation period using this index. Further, different definitions of the line indices have no major impact on the results. Comparing the different search methods, the string length method and the phase dispersion minimisation perform worst, while Gaussian process models produce the smallest numbers of false positives and non-detections.


2021 ◽  
Vol 50 ◽  
pp. 101437
Author(s):  
Quan Lin ◽  
Dawei Hu ◽  
Jiexiang Hu ◽  
Yuansheng Cheng ◽  
Qi Zhou

2020 ◽  
Author(s):  
Marc Philipp Bahlke ◽  
Natnael Mogos ◽  
Jonny Proppe ◽  
Carmen Herrmann

Heisenberg exchange spin coupling between metal centers is essential for describing and understanding the electronic structure of many molecular catalysts, metalloenzymes, and molecular magnets for potential application in information technology. We explore the machine-learnability of exchange spin coupling, which has not been studied yet. We employ Gaussian process regression since it can potentially deal with small training sets (as likely associated with the rather complex molecular structures required for exploring spin coupling) and since it provides uncertainty estimates (“error bars”) along with predicted values. We compare a range of descriptors and kernels for 257 small dicopper complexes and find that a simple descriptor based on chemical intuition, consisting only of copper-bridge angles and copper-copper distances, clearly outperforms several more sophisticated descriptors when it comes to extrapolating towards larger experimentally relevant complexes. Exchange spin coupling is similarly easy to learn as the polarizability, while learning dipole moments is much harder. The strength of the sophisticated descriptors lies in their ability to linearize structure-property relationships, to the point that a simple linear ridge regression performs just as well as the kernel-based machine-learning model for our small dicopper data set. The superior extrapolation performance of the simple descriptor is unique to exchange spin coupling, reinforcing the crucial role of choosing a suitable descriptor, and highlighting the interesting question of the role of chemical intuition vs. systematic or automated selection of features for machine learning in chemistry and material science.


2018 ◽  
Author(s):  
Caitlin C. Bannan ◽  
David Mobley ◽  
A. Geoff Skillman

<div>A variety of fields would benefit from accurate pK<sub>a</sub> predictions, especially drug design due to the affect a change in ionization state can have on a molecules physiochemical properties.</div><div>Participants in the recent SAMPL6 blind challenge were asked to submit predictions for microscopic and macroscopic pK<sub>a</sub>s of 24 drug like small molecules.</div><div>We recently built a general model for predicting pK<sub>a</sub>s using a Gaussian process regression trained using physical and chemical features of each ionizable group.</div><div>Our pipeline takes a molecular graph and uses the OpenEye Toolkits to calculate features describing the removal of a proton.</div><div>These features are fed into a Scikit-learn Gaussian process to predict microscopic pK<sub>a</sub>s which are then used to analytically determine macroscopic pK<sub>a</sub>s.</div><div>Our Gaussian process is trained on a set of 2,700 macroscopic pK<sub>a</sub>s from monoprotic and select diprotic molecules.</div><div>Here, we share our results for microscopic and macroscopic predictions in the SAMPL6 challenge.</div><div>Overall, we ranked in the middle of the pack compared to other participants, but our fairly good agreement with experiment is still promising considering the challenge molecules are chemically diverse and often polyprotic while our training set is predominately monoprotic.</div><div>Of particular importance to us when building this model was to include an uncertainty estimate based on the chemistry of the molecule that would reflect the likely accuracy of our prediction. </div><div>Our model reports large uncertainties for the molecules that appear to have chemistry outside our domain of applicability, along with good agreement in quantile-quantile plots, indicating it can predict its own accuracy.</div><div>The challenge highlighted a variety of means to improve our model, including adding more polyprotic molecules to our training set and more carefully considering what functional groups we do or do not identify as ionizable. </div>


Sign in / Sign up

Export Citation Format

Share Document