Digital soil mapping of compositional particle-size fractions using proximal and remotely sensed ancillary data

Geophysics ◽  
2012 ◽  
Vol 77 (4) ◽  
pp. WB201-WB211 ◽  
Author(s):  
S. Buchanan ◽  
J. Triantafilis ◽  
I. O. A. Odeh ◽  
R. Subansinghe

The soil particle-size fractions (PSFs) are one of the most important attributes to influence soil physical (e.g., soil hydraulic properties) and chemical (e.g., cation exchange) processes. There is an increasing need, therefore, for high-resolution digital prediction of PSFs to improve our ability to manage agricultural land. Consequently, use of ancillary data to make cheaper high-resolution predictions of soil properties is becoming popular. This approach is known as “digital soil mapping.” However, most commonly employed techniques (e.g., multiple linear regression or MLR) do not consider the special requirements of a regionalized composition, namely PSF; (1) should be nonnegative (2) should sum to a constant at each location, and (3) estimation should be constrained to produce an unbiased estimation, to avoid false interpretation. Previous studies have shown that the use of the additive log-ratio transformation (ALR) is an appropriate technique to meet the requirements of a composition. In this study, we investigated the use of ancillary data (i.e., electromagnetic (EM), gamma-ray spectrometry, Landsat TM, and a digital elevation model to predict soil PSF using MLR and generalized additive models (GAM) in a standard form and with an ALR transformation applied to the optimal method (GAM-ALR). The results show that the use of ancillary data improved prediction precision by around 30% for clay, 30% for sand, and 7% for silt for all techniques (MLR, GAM, and GAM-ALR) when compared to ordinary kriging. However, the ALR technique had the advantage of adhering to the special requirements of a composition, with all predicted values nonnegative and PSFs summing to unity at each prediction point and giving more accurate textural prediction.

2021 ◽  
Author(s):  
Mo Zhang ◽  
Wenjiao Shi

Abstract. Digital soil mapping of soil particle-size fractions (PSFs) using log-ratio methods is a widely used technique. As a hybrid interpolator, regression kriging (RK) provides a way to improve prediction accuracy. However, there have been few comparisons with other techniques when RK is applied for compositional data, and it is not known if its performance based on different balances of isometric log-ratio (ILR) transformation is robust. Here, we compared the generalized linear model (GLM), random forest (RF), and their hybrid patterns (RK) using different transformed data based on three ILR balances, with 29 environmental covariables (ECs) for the prediction of soil PSFs in the upper reaches of the Heihe River Basin (HRB), China. The results showed that the RF performed best, with more accurate predictions, but the GLM produced a more unbiased prediction. As a hybrid interpolator, RK was recommended because it widened the data ranges of the prediction values, and modified the bias and accuracy of most models, especially the RF. The prediction maps generated from RK revealed more details of the soil sampling points than the other models. Different data distributions were produced for the three ILR balances. Using the most abundant component of the compositional data as the first component of the permutations was not considered to be the right choice because it produced the worst performance. Based on the relative abundance of the components, we recommend that the focus should be on data distribution. This study provides a reference for the mapping of soil PSFs combined with transformed data at the regional scale.


2021 ◽  
Vol 13 (7) ◽  
pp. 1392
Author(s):  
Chong Wang ◽  
Lin Zhao ◽  
Hongbing Fang ◽  
Lingxiao Wang ◽  
Zanpin Xing ◽  
...  

Spatial information of particle size fractions (PSFs) is primary for understanding the thermal state of permafrost in the Qinghai-Tibet Plateau (QTP) in response to climate change. However, the limitation of field observations and the tremendous spatial heterogeneity hamper the digital mapping of PSF. This study integrated log-ratio transformation approaches, variable searching methods, and machine learning techniques to map the surficial soil PSF distribution of two typical permafrost regions. Results showed that the Boruta technique identified different covariates but retained those covariates of vegetation and land surface temperature in both regions. Variable selection techniques effectively decreased the data redundancy and improved model performance. In addition, the spatial distribution of soil PSFs generated by four log-ratio models presented similar patterns. Isometric log-ratio random forest (ILR-RF) outperformed the other models in both regions (i.e., R2 ranged between 0.36 to 0.56, RMSE ranged between 0.02 and 0.10). Compared with three legacy datasets, our prediction better captured the spatial pattern of PSFs with higher accuracy. Although this study largely improved the accuracy of spatial distribution of soil PSFs, further endeavors should also be made to improve model accuracy and interpretability for a better understanding of the interaction and processes between environmental predictors and soil PSFs at permafrost regions.


2020 ◽  
Author(s):  
Mo Zhang ◽  
Wenjiao Shi

Abstract. Digital soil mapping of soil particle-size fractions (PSFs) using log-ratio methods has been widely used. As a hybrid interpolator, regression kriging (RK) is an alternative way to improve prediction accuracy. However, there is still a lack of systematic comparison and recommendation when RK was applied for compositional data. Whether performance based on different balances of isometric log-ratio (ILR) transformation is robust. Here, we systematically compared the generalized linear model (GLM), random forest (RF), and their hybrid pattern (RK) using different balances of ILR transformed data of soil PSFs with 29 environmental covariables for prediction of soil PSFs on the upper reaches of the Heihe River Basin. The results showed that RF had better performance with more accurate predictions, but GLM had a more unbiased prediction. For the hybrid interpolators, RK was recommended because it widened data ranges of the prediction results, and modified bias and accuracy for most models, especially for RF. The drawback, however, existed due to the data distributions and model algorithms. Moreover, prediction maps generated from RK demonstrated more details of soil sampling points. Three ILR transformed data based on sequential binary partitions (SBP) made different distributions, and it is not recommended to use the most abundant component of compositions as the first component of permutations. This study can reference spatial simulation of soil PSFs combined with environmental covariables and transformed data at a regional scale.


Sign in / Sign up

Export Citation Format

Share Document