scholarly journals National spectral data and learning algorithms for potentially toxic elements modelling in forest soil horizons

2020 ◽  
Author(s):  
Asa Gholizadeh ◽  
Mohammadmehdi Saberioon ◽  
Eyal Ben Dor ◽  
Raphael A. Viscarra Rossel ◽  
Lubos Boruvka

Forest ecosystems are among the main parts of the biosphere; however, they have been endangered from the significant elevation and harmful effects of air and soil pollutants, including potentially toxic elements (PTEs). The concentration of PTEs in forest soils varies not only laterally but also vertically with depth. Forest surface organic horizons are of particular interest in forest ecosystem monitoring due to their role as stable adsorbents of the deposited atmospheric substances. Therefore, the main purpose of this study was to conduct rapid examinations of forest soils PTEs (Cr, Cu, Pb, Zn, and Al), testing the capability of VIS--NIR spectroscopy coupled with machine learning (ML) techniques (partial least square regression (PLSR), support vector machine regression (SVMR), and random forest (RF)) and fully connected neural network (FNN), a deep learning (DL) approach, in forest organic horizons. One-thousand-and-eighty forested sites across the Czech Republic at two soil layers, defining the fragmented (F) and humus (H) organic horizons, were investigated (total 2160 samples). PTEs as well as total Fe and SOC, as auxiliary data, were conventionally and spectrally determined and modelled in the combined organic horizons (F + H) and in each individual horizon using the ML and DL algorithms. Results indicated that the concentration of all PTEs was higher in the horizon H compared to the F horizon. Although the spectral reflectance of samples tended to decrease with increased PTEs concentration. Strongly significant positive correlations between all PTEs and total Fe in all horizons were obtained, which were higher in the H and F + H horizons than the F horizon. The highest correlations of PTEs with the spectra were at 460--590~nm, which is mostly linked to the presence of Fe-oxide. These results show the importance of Fe for spectral prediction of PTEs. Cr and Al were the most accurately predicted elements, regardless of the applied learning technique. SVMR provided the best results in assessing the H horizon (e.g., R\(^2\) = 0.88 and root mean square error (RMSE) = 3.01~mg/kg, and R\(^2\) = 0.82 and RMSE = 1682.25~mg/kg for Cr and Al, respectively); however, FNN predicted the combined F + H horizons the best (R\(^2\) = 0.89 and RMSE = 2.95~mg/kg, and R\(^2\) = 0.86 and RMSE = 1593.64~mg/kg for Cr and Al, respectively) due to the larger number of samples. In the F horizon, almost no parameters were predicted adequately. This study shows that given the availability of larger sample sizes, FNN can be a more promising technique compared to ML methods for assessment of Cr and Al concentration based on national spectral data in the forests of the Czech Republic.

2020 ◽  
Author(s):  
Asa Gholizadeh ◽  
Raphael Viscarra Rossel ◽  
Mohammadmehdi Saberioon ◽  
Lubos Boruvka ◽  
Lenka Pavlu

<p>Any strategy to change Carbon (C) pool would have a substantial effect on functionality of numerous ecosystem functions, detachment of Soil Organic Carbon (SOC), atmospheric carbon dioxide (CO<sub>2</sub>) concentration, and climate change mitigation. As the largest amount of the world’s C is stored in forests soils, the importance of forest SOC management is highlighted. Total SOC in forest varies not only laterally but also vertically with depth; however, the SOC storage of lower soil horizons have not been investigated enough despite their potential to frame our understanding of soil functioning. Visible–Near Infrared (vis–NIR) reflectance spectroscopy enables rapid examinations of the horizontal distribution of forest SOC, overcoming limitations of traditional soil assessment. This study aims to evaluate the potential of vis–NIR spectroscopy for characterizing the SOC contents of organic and mineral horizons in forests. We investigated 1080 forested sites across the Czech Republic at five individual soil layers, representing the Litter (L), Fragmented (F), and Humus (H) organic horizons, and the A<sub>1</sub> (depth of 2–10 cm) and A<sub>2</sub> (depth of 10–40 cm) mineral horizons (total 5400 samples). We then used Support Vector Machine (SVM) to model the SOC contents of (i) the profile (all organic and mineral horizons together), (ii) the combined organic horizons, (iii) the combined mineral horizons, and (iv) each individual horizon separately. The models were validated using 10-repeated 10-fold cross validation. Results showed that there was at least more than seven times as much SOC in the combined organic horizons compared to the combined mineral horizons with more variation in deeper layers. All individual horizons’ SOC was successfully predicted with low error and R<sup>2</sup> values higher than 0.63; however, the prediction accuracy of F and A<sub>1</sub> was greater compared to others (R<sup>2</sup> > 0.70 and very low-biased spatial estimates). We have shown that modelling of SOC with vis–NIR spectra in different soil horizons of highly heterogeneous forests of the Czech Republic is practical.</p>


Author(s):  
Prince Chapman Agyeman ◽  
Samuel Kudjo Ahado ◽  
John Kingsley ◽  
Ndiye Michael Kebonye ◽  
James Kobina Mensah Biney ◽  
...  

2020 ◽  
Author(s):  
Asa Gholizadeh ◽  
Raphael A. Viscarra Rossel ◽  
Mohammadmehdi Saberioon ◽  
Josef Kratina ◽  
Lubos Boruvka ◽  
...  

Any strategy to change the Carbon (C) pool has a substantial effect on the functionality of numerous ecosystem functions, the detachment of Soil Organic Carbon (SOC), the atmospheric carbon dioxide (CO2) concentration, and climate change mitigation. As the largest amount of the world's C is stored in forests soils, the importance of forest SOC management is highlighted. The total SOC in a forest varies not only laterally, but also vertically (i.e., with depth). However, the SOC storage of different forest soil horizons has not been investigated in a national scale thoroughly, despite their potential to frame our understanding of soil function. Visible--Near Infrared (vis--NIR) reflectance spectroscopy enables rapid examination of the horizontal distribution of forest SOC, overcoming the limitations of traditional soil assessment methods. This study aims to evaluate the potential of vis--NIR spectroscopy in characterizing and predicting the SOC content of organic and mineral horizons in forests. We investigate 1080 forested sites across the Czech Republic at five individual soil layers, representing the Litter (L), Fragmented (F), and Humus (H) organic horizons, as well as the A1 (depth of 2--10 cm) and A2 (depth of 10--40 cm) mineral horizons (for a total of 5400 samples). We, then, use Support Vector Machines (SVMs) to classify the soil horizons based on their spectra and also to predict the SOC content of (i) the profile (all organic and mineral horizons together), (ii) the combined organic horizons, (iii) the combined mineral horizons, and (iv) each individual horizon separately. The models are validated using 10-repeated 10-fold cross validation. The results show that there is at least more than seven times as much SOC in the combined organic horizons, compared to the combined mineral horizons, with more variation in the deeper layers. The SVM with radial based kernel is a reliable classifier for classification of soil horizons, with Correct Classification Rate (CCR) of 70% and Kappa coefficient of 0.63. All individual horizon SOCs are successfully predicted with low error and with R2 values higher than 0.63. However, the prediction accuracies of the F and A1 models are greater, compared to others (R2~0.70 and very low-biased spatial estimates). We conclude that the modelling of SOC with vis--NIR spectra in different soil horizons of highly heterogeneous forests in the Czech Republic is practical. This study provides an example of how general pedological knowledge can be used to define depth functions of SOC for forested sites.


2021 ◽  
Vol 2021 ◽  
pp. 1-9
Author(s):  
Dan Peng ◽  
Yali Liu ◽  
Jiasheng Yang ◽  
Yanlan Bi ◽  
Jingnan Chen

The rapid and accurate detection of the moisture content is of great significance to the quality evaluation and oil extraction process of walnut kernel. Near-infrared (NIR) spectroscopy is an ideal method for measuring the moisture content in walnut kernel. In this study, a regression model for moisture content in walnut kernel was developed based on NIR diffuse reflectance spectroscopy using chemometric methods. The different spectral pretreatment methods were adopted to preprocess the original spectral data. The whole spectra band was divided into 5 subbands, 10 subbands, 15 subbands, and 20 subbands to screen specific wavelengths relevant to the walnut kernel moisture content. PLS (partial least square regression), MLR (multivariate linear regression), PCR (principle component regression), and SVR (support vector regression) were used to establish the relationship model between the spectral data and measurement values of the moisture content. In comparison, the optimized modeling conditions were determined as follows: detection wavelength 1349–1490 nm, SNV-FD (standard normal variate transformation and first derivative) preprocessing method, and PLS algorithm. Under these conditions, the square correlation coefficient (R2) and root mean square error of prediction (RMSEP) of the prediction model were 0.9865 and 0.0017, respectively. The results of this study provided a feasible method for the rapid detection of moisture content in walnut kernel. To improve the performance and applicability of the model, it is necessary to continuously expand the size of the sample set.


2020 ◽  
Vol 16 ◽  
Author(s):  
Linqi Liu ◽  
JInhua Luo ◽  
Chenxi Zhao ◽  
Bingxue Zhang ◽  
Wei Fan ◽  
...  

BACKGROUND: Measuring medicinal compounds to evaluate their quality and efficacy has been recognized as a useful approach in treatment. Rhubarb anthraquinones compounds (mainly including aloe-emodin, rhein, emodin, chrysophanol and physcion) are its main effective components as purgating drug. In the current Chinese Pharmacopoeia, the total anthraquinones content is designated as its quantitative quality and control index while the content of each compound has not been specified. METHODS: On the basis of forty rhubarb samples, the correlation models between the near infrared spectra and UPLC analysis data were constructed using support vector machine (SVM) and partial least square (PLS) methods according to Kennard and Stone algorithm for dividing the calibration/prediction datasets. Good models mean they have high correlation coefficients (R2) and low root mean squared error of prediction (RMSEP) values. RESULTS: The models constructed by SVM have much better performance than those by PLS methods. The SVM models have high R2 of 0.8951, 0.9738, 0.9849, 0.9779, 0.9411 and 0.9862 that correspond to aloe-emodin, rhein, emodin, chrysophanol, physcion and total anthraquinones contents, respectively. The corresponding RMSEPs are 0.3592, 0.4182, 0.4508, 0.7121, 0.8365 and 1.7910, respectively. 75% of the predicted results have relative differences being lower than 10%. As for rhein and total anthraquinones, all of the predicted results have relative differences being lower than 10%. CONCLUSION: The nonlinear models constructed by SVM showed good performances with predicted values close to the experimental values. This can perform the rapid determination of the main medicinal ingredients in rhubarb medicinal materials.


2021 ◽  
Vol 13 (4) ◽  
pp. 641
Author(s):  
Gopal Ramdas Mahajan ◽  
Bappa Das ◽  
Dayesh Murgaokar ◽  
Ittai Herrmann ◽  
Katja Berger ◽  
...  

Conventional methods of plant nutrient estimation for nutrient management need a huge number of leaf or tissue samples and extensive chemical analysis, which is time-consuming and expensive. Remote sensing is a viable tool to estimate the plant’s nutritional status to determine the appropriate amounts of fertilizer inputs. The aim of the study was to use remote sensing to characterize the foliar nutrient status of mango through the development of spectral indices, multivariate analysis, chemometrics, and machine learning modeling of the spectral data. A spectral database within the 350–1050 nm wavelength range of the leaf samples and leaf nutrients were analyzed for the development of spectral indices and multivariate model development. The normalized difference and ratio spectral indices and multivariate models–partial least square regression (PLSR), principal component regression, and support vector regression (SVR) were ineffective in predicting any of the leaf nutrients. An approach of using PLSR-combined machine learning models was found to be the best to predict most of the nutrients. Based on the independent validation performance and summed ranks, the best performing models were cubist (R2 ≥ 0.91, the ratio of performance to deviation (RPD) ≥ 3.3, and the ratio of performance to interquartile distance (RPIQ) ≥ 3.71) for nitrogen, phosphorus, potassium, and zinc, SVR (R2 ≥ 0.88, RPD ≥ 2.73, RPIQ ≥ 3.31) for calcium, iron, copper, boron, and elastic net (R2 ≥ 0.95, RPD ≥ 4.47, RPIQ ≥ 6.11) for magnesium and sulfur. The results of the study revealed the potential of using hyperspectral remote sensing data for non-destructive estimation of mango leaf macro- and micro-nutrients. The developed approach is suggested to be employed within operational retrieval workflows for precision management of mango orchard nutrients.


2021 ◽  
Vol 11 (2) ◽  
pp. 618
Author(s):  
Tanvir Tazul Islam ◽  
Md Sajid Ahmed ◽  
Md Hassanuzzaman ◽  
Syed Athar Bin Amir ◽  
Tanzilur Rahman

Diabetes is a chronic illness that affects millions of people worldwide and requires regular monitoring of a patient’s blood glucose level. Currently, blood glucose is monitored by a minimally invasive process where a small droplet of blood is extracted and passed to a glucometer—however, this process is uncomfortable for the patient. In this paper, a smartphone video-based noninvasive technique is proposed for the quantitative estimation of glucose levels in the blood. The videos are collected steadily from the tip of the subject’s finger using smartphone cameras and subsequently converted into a Photoplethysmography (PPG) signal. A Gaussian filter is applied on top of the Asymmetric Least Square (ALS) method to remove high-frequency noise, optical noise, and motion interference from the raw PPG signal. These preprocessed signals are then used for extracting signal features such as systolic and diastolic peaks, the time differences between consecutive peaks (DelT), first derivative, and second derivative peaks. Finally, the features are fed into Principal Component Regression (PCR), Partial Least Square Regression (PLS), Support Vector Regression (SVR) and Random Forest Regression (RFR) models for the prediction of glucose level. Out of the four statistical learning techniques used, the PLS model, when applied to an unbiased dataset, has the lowest standard error of prediction (SEP) at 17.02 mg/dL.


Metabolites ◽  
2022 ◽  
Vol 12 (1) ◽  
pp. 36
Author(s):  
Khushman Taunk ◽  
Priscilla Porto-Figueira ◽  
Jorge A. M. Pereira ◽  
Ravindra Taware ◽  
Nattane Luíza da Costa ◽  
...  

The urinary volatomic profiling of Indian cohorts composed of 28 lung cancer (LC) patients and 27 healthy subjects (control group, CTRL) was established using headspace solid phase microextraction technique combined with gas chromatography mass spectrometry methodology as a powerful approach to identify urinary volatile organic metabolites (uVOMs) to discriminate among LC patients from CTRL. Overall, 147 VOMs of several chemistries were identified in the intervention groups—including naphthalene derivatives, phenols, and organosulphurs—augmented in the LC group. In contrast, benzene and terpenic derivatives were found to be more prevalent in the CTRL group. The volatomic data obtained were processed using advanced statistical analysis, namely partial least square discriminative analysis (PLS-DA), support vector machine (SVM), random forest (RF), and multilayer perceptron (MLP) methods. This resulted in the identification of nine uVOMs with a higher potential to discriminate LC patients from CTRL subjects. These were furan, o-cymene, furfural, linalool oxide, viridiflorene, 2-bromo-phenol, tricyclazole, 4-methyl-phenol, and 1-(4-hydroxy-3,5-di-tert-butylphenyl)-2-methyl-3-morpholinopropan-1-one. The metabolic pathway analysis of the data obtained identified several altered biochemical pathways in LC mainly affecting glycolysis/gluconeogenesis, pyruvate metabolism, and fatty acid biosynthesis. Moreover, acetate and octanoic, decanoic, and dodecanoic fatty acids were identified as the key metabolites responsible for such deregulation. Furthermore, studies involving larger cohorts of LC patients would allow us to consolidate the data obtained and challenge the potential of the uVOMs as candidate biomarkers for LC.


Check List ◽  
2021 ◽  
Vol 17 (3) ◽  
pp. 979-983
Author(s):  
Will K. Reeves ◽  
Jeremy R. Shaw ◽  
Mark J. Wetzel

Cognettia sphagnetorum (Vejdovský, 1878), a common inhabitant of forest soils and bogs in northern Europe, is a model organism in soil biology. We report the first documented occurrence of C. sphagnetorum in North America, based on DNA sequencing from a Sphagnum bog in western Washington, USA. Sequences were identical to that of worms from Sweden and the Czech Republic.


Sign in / Sign up

Export Citation Format

Share Document