Soil Properties: Physics Inspired, Data Driven

Author(s):  
J. Carlos Santamarina ◽  
Junghee Park ◽  
Marco Terzariol ◽  
Alejandro Cardona ◽  
Gloria M. Castro ◽  
...  
Keyword(s):  
2021 ◽  
Author(s):  
Philipp Baumann ◽  
Anatol Helfenstein ◽  
Andreas Gubler ◽  
Armin Keller ◽  
Reto Giulio Meuli ◽  
...  

Abstract. Information on soils' composition and physical, chemical and biological properties is paramount to elucidate agroecosystem functioning in space and over time. For this purposes we developed a national Swiss soil spectral library (SSL; n = 4374) in the mid-infrared (mid-IR), calibrating 17 properties from legacy measurements on soils from the Swiss biodiversity monitoring program (n = 3778; 1094 sites) and the Swiss long-term monitoring network (n = 596; 71 sites). General models were trained with the interpretable rule-based learner CUBIST, testing combinations of {5, 10, 20, 50, 100} committees of rules and {2, 5, 7, 9} neighbors to localize predictions with repeated by location grouped ten-fold cross-validation. To evaluate the information in spectra to facilitate long-term soil monitoring at a plot-level, we conducted 71 model transfers for the NABO sites to induce locally relevant information from the SSL, using the data-driven sample selection method rs-local. Eleven soil properties were estimated with discrimination capacity suitable for screening (R2 > 0.6), out of which total carbon (C), organic C (OC), total N, organic matter content, pH, and clay showed accuracy eligible for accurate diagnostics (R2 > 0.8). Cubist and the spectra estimated total C accurately with RMSE = 0.84 % while the measured range was 0.1–⁠58.3 %, and OC with RMSE = 1.20 % (measured range 0.0–⁠27.3 %). Compared to general estimates of properties from Cubist, local modeling on average reduced the root mean square error of total C per site fourfold. We found that the selected SSL subsets were highly dissimilar in terms of both their spectral input space and the measured values. This suggests that data-driven selection with RS-LOCAL leverages chemical diversity in composition rather than similarity. Our results suggest that mid-IR soil estimates were sufficiently accurate to support many soil applications that require a large volume of input data, such as precision agriculture, soil C accounting and monitoring, and digital soil mapping. This SSL can be updated continuously, for example with samples from deeper profiles and organic soils, so that the measurement of key soil properties becomes even more accurate and efficient in the near future.


2020 ◽  
Author(s):  
Philipp Baumann ◽  
Anatol Helfenstein ◽  
Andreas Gubler ◽  
Reto Meuli ◽  
Armin Keller ◽  
...  

<p>Soil data at different scales are needed for assessments and monitoring of soil functions. Soil diffuse reflectance spectroscopy using visible–Near Infrared and mid-Infrared energies can be used to estimate a range of soil properties, rapidly and inexpensively. However the spectroscopic modeling is challenging because of the large soil diversity and its complex composition. We developed a National Soil Spectral library (SSL) (n = 4339) using samples from (i) the Swiss Soil Monitoring Network (NABO; 7 sampling campaigns at 71 agricultural locations since 1985, n = 592) and (ii) the National Biodiversity Monitoring (BDM) Program (n = 4295, 1094 locations across a 5x5 km grid). The SSL will provide spectroscopic models for estimation of functional soil properties at different scales (e.g. total carbon (C) and nitrogen, organic C, texture, pH and cation exchange capacity). We used a rule-based algorithm, Cubist, for the modelling. The models were tuned across full combinations of {5, 10, 20, 50, 100} committees and {2, 5, 7, 9} neighbors, using 5 times repeated 10-fold cross-validation grouped by location. Further, transfer learning with RS-LOCAL tuning was performed for each of the 71 monitoring sites separately by a hold out approach in order to select optimal instances from the remaining SSL. Total soil C in the reference data ranged from 0.1% to 58.3% C and the best Cubist model had a cross-validated RMSE of 0.82% C. The RS-LOCAL approach (RMSE<sub>mean</sub> = 0.14 %) was on average 2.5 times more accurate for the estimation of C over time at each of the 71 NABO sites compared to the general Cubist approach. Our results suggest that data-driven selection of SSL instances targeted to closely related soils produces less biased estimation of soil properties over time at smaller geographic extents. The general Cubist calibration models are useful when reference analyses in a new study area are scarce. In conclusion, the Swiss SSL models can be used to cost-efficiently estimate a range of soil properties for a diverse applications and purposes in Switzerland.</p>


SOIL ◽  
2021 ◽  
Vol 7 (2) ◽  
pp. 525-546
Author(s):  
Philipp Baumann ◽  
Anatol Helfenstein ◽  
Andreas Gubler ◽  
Armin Keller ◽  
Reto Giulio Meuli ◽  
...  

Abstract. Information on soils' composition and physical, chemical and biological properties is paramount to elucidate agroecosystem functioning in space and over time. For this purpose, we developed a national Swiss soil spectral library (SSL; n=4374) in the mid-infrared (mid-IR), calibrating 16 properties from legacy measurements on soils from the Swiss Biodiversity Monitoring program (BDM; n=3778; 1094 sites) and the Swiss long-term Soil Monitoring Network (NABO; n=596; 71 sites). General models were trained with the interpretable rule-based learner CUBIST, testing combinations of {5,10,20,50, and 100} ensembles of rules (committees) and {2, 5, 7, and 9} nearest neighbors used for local averaging with repeated 10-fold cross-validation grouped by location. To evaluate the information in spectra to facilitate long-term soil monitoring at a plot level, we conducted 71 model transfers for the NABO sites to induce locally relevant information from the SSL, using the data-driven sample selection method RS-LOCAL. In total, 10 soil properties were estimated with discrimination capacity suitable for screening (R2≥0.72; ratio of performance to interquartile distance (RPIQ) ≥ 2.0), out of which total carbon (C), organic C (OC), total nitrogen (N), pH and clay showed accuracy eligible for accurate diagnostics (R2>0.8; RPIQ ≥ 3.0). CUBIST and the spectra estimated total C accurately with the root mean square error (RMSE) = 8.4 g kg−1 and the RPIQ = 4.3, while the measured range was 1–583 g kg−1 and OC with RMSE = 9.3 g kg−1 and RPIQ = 3.4 (measured range 0–583 g kg−1). Compared to the general statistical learning approach, the local transfer approach – using two respective training samples – on average reduced the RMSE of total C per site fourfold. We found that the selected SSL subsets were highly dissimilar compared to validation samples, in terms of both their spectral input space and the measured values. This suggests that data-driven selection with RS-LOCAL leverages chemical diversity in composition rather than similarity. Our results suggest that mid-IR soil estimates were sufficiently accurate to support many soil applications that require a large volume of input data, such as precision agriculture, soil C accounting and monitoring and digital soil mapping. This SSL can be updated continuously, for example, with samples from deeper profiles and organic soils, so that the measurement of key soil properties becomes even more accurate and efficient in the near future.


Sign in / Sign up

Export Citation Format

Share Document