scholarly journals Measurement error-filtered machine learning in digital soil mapping

2021 ◽  
pp. 100572
Author(s):  
Stephan van der Westhuizen ◽  
Gerard B.M. Heuvelink ◽  
David P. Hofmeyr ◽  
Laura Poggio
2021 ◽  
Author(s):  
Stephan van der Westhuizen ◽  
Gerard Heuvelink ◽  
David Hofmeyr

<p>Digital soil mapping (DSM) may be defined as the use of a statistical model to quantify the relationship between a certain observed soil property at various geographic locations, and a collection of environmental covariates, and then using this relationship to predict the soil property at locations where the property was not measured. It is also important to quantify the uncertainty with regards to prediction of these soil maps. An important source of uncertainty in DSM is measurement error which is considered as the difference between a measured and true value of a soil property.</p><p>The use of machine learning (ML) models such as random forests (RF) has become a popular trend in DSM. This is because ML models tend to be capable of accommodating highly non-linear relationships between the soil property and covariates. However, it is not clear how to incorporate measurement error into ML models. In this presentation we will discuss how to incorporate measurement error into some popular ML models, starting with incorporating weights into the objective function of ML models that implicitly assume a Gaussian error. We will discuss the effect that these modifications have on prediction accuracy, with reference to simulation studies.</p>


Author(s):  
Martin Meier ◽  
Eliana de Souza ◽  
Marcio Rocha Francelino ◽  
Elpídio Inácio Fernandes Filho ◽  
Carlos Ernesto Gonçalves Reynaud Schaefer

2020 ◽  
Vol 210 ◽  
pp. 103359
Author(s):  
Alexandre M.J.-C. Wadoux ◽  
Budiman Minasny ◽  
Alex B. McBratney

2021 ◽  
Author(s):  
Ruhollah Taghizadeh-Mehrjardi ◽  
Razieh Sheikhpour ◽  
Norair Toomanian ◽  
Thomas Scholten

<p>The most critical aspect of application of digital soil mapping is its limited transferability. Modelling soil properties for regions where no or only sparse soil information is available is highly uncertain, when using the low-cost geo-spatial environmental covariates alone. To overcome this drawback, transfer learning has been introduced in different environmental sciences, including soil science. The general idea behind extrapolation of soil information with transfer learning in soil science is that the target area to transfer to is alike, e.g. in terms of soil-forming factors, and the same machine learning rules can be applied. Supervised machine learning, so far, has been used to transfer the soil information from the reference to the target areas with very similar environmental characteristics between both. Hence, it is unclear how machine learning can perform for other target regions with different environmental characteristics. Semi-supervised learning is an approach to machine learning that combines a small amount of labeled data (reference area) with a large amount of unlabeled data (target area) during training. In this study, we explored if semi-supervised learning could improve the transferability of digital soil mapping relative to supervised learning methods. Soil data for two arid regions and associated environmental covariates were obtained. Semi-supervised learning and supervised learning models were trained based on the data in the reference area and then tested based on the data in the target area. The results of this study indicated the higher power of semi-supervised learning for transferring soil information from one area to another in comparison to the supervised learning method.   </p>


2020 ◽  
Author(s):  
Daniel Zizala

<p>Previous studies have shown that remote sensing data can be very useful input into soil prediction models. This input usually represents reflectance from bare soils, which, however, make up only a small part of the total area in a given part of the year. For eliminating masking effect of vegetation time series of individual images (Žížala et al. 2019; Shabou et al. 2015; Demattê et al. 2016; Blasch et al. 2015a) or multitemporal composites of spectral data can be used. Exposed Soil Composite Mapping Processor (SCMaP) (Rogge et al. 2018), Geospatial Soil Sensing System (GEOS3) (Demattê et al. 2018), Bare Soil Composite Image (Gallo et al. 2018), and Barest Pixel Composite for Agricultural Areas (Diek et al. 2017), all developed from Landsat time series, multitemporal bare soil image developed from RapidEye time series (Blasch et al. 2015b), or bare soil mosaic (Loiseau et al. 2019) derived from Sentinel-2 data can serve as examples of such composites. However, only some of the composite products have been used yet to predict soil properties. Promising results were achieved; however, the potential of these spectral composites has not yet been tested in a relevant number of studies. Further research is needed for its evaluation.</p><p>Aims of this study are to analyze and to compare the prediction ability of models using different types of multitemporal bare soil composites derived from Sentinel-2 images and their applicability for mapping soil properties in large areas. The study was conducted on a regional scale in the soil heterogeneous region of central Czechia with dissected relief and variable soil properties, where data from 100 soil profiles with soil analytics were available. Sentinel-2 images from 2016-2019 were used for composite formation in the python numpy environment. Different methods of cloud masking, bare soil identification and data aggregation (both already used in previous studies and newly derived) have been tested to compare which is the most suitable for prediction of soil properties. The principles of digital soil mapping and machine learning algorithms (random forest and support vector machine multivariate methods) were used for prediction.</p><p>Results reveal that Sentinel-2 multitemporal bare soil composites can be successfully applied in the prediction of soil properties. The setting of basic parameters of composite creation is very complex and challenging and it requires to use exact algorithms for masking clouds and bare soil. Soil moisture and surface roughness also greatly affect spectral characteristics of bare soil and thus a very important aspect of compositing is finding appropriate statistics to derive final pixel values of reflectance (minimum, mean, median, ...). One possible way to minimize the effect of moisture and surface roughness may be incorporation radar backscatter information from Sentinel-1. However, it further complicates the processing of data and makes the composite creation more complex.</p><p>The research has been supported by the project no. QK1820389 " Production of actual detailed maps of soil properties in the Czech Republic based on database of Large-scale Mapping of Agricultural Soils in Czechoslovakia and application of digital soil mapping" funding by Ministry of Agriculture.</p>


Geoderma ◽  
2016 ◽  
Vol 265 ◽  
pp. 62-77 ◽  
Author(s):  
Brandon Heung ◽  
Hung Chak Ho ◽  
Jin Zhang ◽  
Anders Knudby ◽  
Chuck E. Bulmer ◽  
...  

2020 ◽  
Vol 71 (3) ◽  
pp. 352-368 ◽  
Author(s):  
Ruhollah Taghizadeh-Mehrjardi ◽  
Karsten Schmidt ◽  
Kamran Eftekhari ◽  
Thorsten Behrens ◽  
Mohammad Jamshidi ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document