scholarly journals Improved stratified sampling strategy for estimating mean soil moisture based on auxiliary variable spatial autocorrelation

2022 ◽  
Vol 215 ◽  
pp. 105212
Author(s):  
Jianhua Jin ◽  
Baozhong Zhang ◽  
Xiaomin Mao
2021 ◽  
Vol 13 (5) ◽  
pp. 907
Author(s):  
Theodora Lendzioch ◽  
Jakub Langhammer ◽  
Lukáš Vlček ◽  
Robert Minařík

One of the best preconditions for the sufficient monitoring of peat bog ecosystems is the collection, processing, and analysis of unique spatial data to understand peat bog dynamics. Over two seasons, we sampled groundwater level (GWL) and soil moisture (SM) ground truth data at two diverse locations at the Rokytka Peat bog within the Sumava Mountains, Czechia. These data served as reference data and were modeled with a suite of potential variables derived from digital surface models (DSMs) and RGB, multispectral, and thermal orthoimages reflecting topomorphometry, vegetation, and surface temperature information generated from drone mapping. We used 34 predictors to feed the random forest (RF) algorithm. The predictor selection, hyperparameter tuning, and performance assessment were performed with the target-oriented leave-location-out (LLO) spatial cross-validation (CV) strategy combined with forward feature selection (FFS) to avoid overfitting and to predict on unknown locations. The spatial CV performance statistics showed low (R2 = 0.12) to high (R2 = 0.78) model predictions. The predictor importance was used for model interpretation, where temperature had strong impact on GWL and SM, and we found significant contributions of other predictors, such as Normalized Difference Vegetation Index (NDVI), Normalized Difference Index (NDI), Enhanced Red-Green-Blue Vegetation Index (ERGBVE), Shape Index (SHP), Green Leaf Index (GLI), Brightness Index (BI), Coloration Index (CI), Redness Index (RI), Primary Colours Hue Index (HI), Overall Hue Index (HUE), SAGA Wetness Index (TWI), Plan Curvature (PlnCurv), Topographic Position Index (TPI), and Vector Ruggedness Measure (VRM). Additionally, we estimated the area of applicability (AOA) by presenting maps where the prediction model yielded high-quality results and where predictions were highly uncertain because machine learning (ML) models make predictions far beyond sampling locations without sampling data with no knowledge about these environments. The AOA method is well suited and unique for planning and decision-making about the best sampling strategy, most notably with limited data.


2011 ◽  
Vol 5 (3) ◽  
pp. 617-629 ◽  
Author(s):  
J. I. López-Moreno ◽  
S. R. Fassnacht ◽  
S. Beguería ◽  
J. B. P. Latron

Abstract. Snow depth variability over small distances can affect the representativeness of depth samples taken at the local scale, which are often used to assess the spatial distribution of snow at regional and basin scales. To assess spatial variability at the plot scale, intensive snow depth sampling was conducted during January and April 2009 in 15 plots in the Rio Ésera Valley, central Spanish Pyrenees Mountains. Each plot (10 × 10 m; 100 m2) was subdivided into a grid of 1 m2 squares; sampling at the corners of each square yielded a set of 121 data points that provided an accurate measure of snow depth in the plot (considered as ground truth). The spatial variability of snow depth was then assessed using sampling locations randomly selected within each plot. The plots were highly variable, with coefficients of variation up to 0.25. This indicates that to improve the representativeness of snow depth sampling in a given plot the snow depth measurements should be increased in number and averaged when spatial heterogeneity is substantial. Snow depth distributions were simulated at the same plot scale under varying levels of standard deviation and spatial autocorrelation, to enable the effect of each factor on snowpack representativeness to be established. The results showed that the snow depth estimation error increased markedly as the standard deviation increased. The results indicated that in general at least five snow depth measurements should be taken in each plot to ensure that the estimation error is <10 %; this applied even under highly heterogeneous conditions. In terms of the spatial configuration of the measurements, the sampling strategy did not impact on the snow depth estimate under lack of spatial autocorrelation. However, with a high spatial autocorrelation a smaller error was obtained when the distance between measurements was greater.


2020 ◽  
Vol 8 (2) ◽  
pp. 49-56
Author(s):  
Akan Anieting

In this article, a new estimator for population mean in two-phase stratified sampling in the presence of nonresponse using single auxiliary variable has been proposed. The bias and Mean Squared Error (MSE) of the proposed estimator has been given using large sample approximation. The empirical study shows that the MSE of the proposed estimator is more efficient than existing estimators. The optimum values of first and second phase sample have been determined.


Author(s):  
Min-Tang Li ◽  
Lee-Fang Chow ◽  
Fang Zhao ◽  
Shi-Chiang Li

A key feature in estimating and applying destination choice models with aggregate alternatives is to sample a set of nonchosen traffic analysis zones (TAZs), plus the one a trip maker chose, to construct a destination choice set. Computational complexity is reduced because the choice set would be too large if all study area TAZs were included in the calibration. Commonly, two types of sampling strategies are applied to draw subsets of alternatives from the universal choice set. The first, and simplest, approach is to select randomly a subset of nonchosen alternatives with uniform selection probabilities and then add the chosen alternative if it is not otherwise included. The approach, however, is not an efficient sampling scheme because most alternatives for a given trip maker may have small choice probabilities. The second approach, stratified importance sampling, draws samples with unequal selection probabilities determined on the basis of preliminary estimates of choice probabilities for every alternative in the universal choice set. The stratified sampling method assigns different selection probabilities to alternatives in different strata. Simple random sampling is applied to draw alternatives in each stratum. However, it is unclear how to divide the study area so that destination TAZs may be sampled effectively. The process of and findings from implementing a stratified sampling strategy in selecting alternative TAZs for calibrating aggregate destination choice models in a geographic information system (GIS) environment are described. In this stratified sampling analysis, stratum regions varied by spatial location and employment size in the adjacent area were defined for each study area TAZ. The sampling strategy is more effective than simple random sampling in regard to maximum log likelihood and goodness-of-fit values.


Author(s):  
Weijie Liu ◽  
Hui Qian ◽  
Chao Zhang ◽  
Zebang Shen ◽  
Jiahao Xie ◽  
...  

In this paper, a novel stratified sampling strategy is designed to accelerate the mini-batch SGD. We derive a new iteration-dependent surrogate which bound the stochastic variance from above. To keep the strata minimizing this surrogate with high probability, a stochastic stratifying algorithm is adopted in an adaptive manner, that is, in each iteration, strata are reconstructed only if an easily verifiable condition is met. Based on this novel sampling strategy, we propose an accelerated mini-batch SGD algorithm named SGD-RS. Our theoretical analysis shows that the convergence rate of SGD-RS is superior to the state-of-the-art. Numerical experiments corroborate our theory and demonstrate that SGD-RS achieves at least 3.48-times speed-ups compared to vanilla minibatch SGD.


2012 ◽  
Vol 610-613 ◽  
pp. 3732-3737 ◽  
Author(s):  
Ji Ping Zhang ◽  
Lin Bo Zhang ◽  
Bin Gong

This study combines the sampling technique, geographic information system and remote sensing technique to conduct a sampling survey on forest cover area of Jinggangshan National Nature Reserve in China on the basis of TM remote sensing image. The spatial simple random sampling, spatial stratified sampling and sandwich sampling model are respectively utilized to establish the sampling design. For the spatial simple random sampling model, the spatial autocorrelation analysis method is adopted to determine the spatial autocorrelation coefficient through calculating Moran's I index, while in the spatial stratified sampling and sandwich sampling model, the yearly maximum NDVI (Normalized Difference Vegetation Index) is utilized to conduct the spatial stratification. Through comparison of the sampling accuracy of three sampling models, a higher precision and more reasonable sampling method and sampling model is provided for remote sensing monitoring of forest cover area. The study results show that: sandwich sampling model is featured as the highest sampling accuracy, followed by the spatial stratified sampling and simple random sampling. Under the requirement of same precision, sandwich spatial sampling model can reduce quantity of the sampling points, and create all kinds of report units according to demands of different spatial area, so it is featured as the better suitability.


2010 ◽  
Vol 14 (6) ◽  
pp. 873-889 ◽  
Author(s):  
E. Zehe ◽  
T. Graeff ◽  
M. Morgner ◽  
A. Bauer ◽  
A. Bronstert

Abstract. This study presents an application of an innovative sampling strategy to assess soil moisture dynamics in a headwater of the Weißeritz in the German eastern Ore Mountains. A grassland site and a forested site were instrumented with two Spatial TDR clusters (STDR) that consist of 39 and 32 coated TDR probes of 60 cm length. Distributed time series of vertically averaged soil moisture data from both sites/ensembles were analyzed by statistical and geostatistical methods. Spatial variability and the spatial mean at the forested site were larger than at the grassland site. Furthermore, clustering of TDR probes in combination with long-term monitoring allowed identification of average spatial covariance structures at the small field scale for different wetness states. The correlation length of soil water content as well as the sill to nugget ratio at the grassland site increased with increasing average wetness and but, in contrast, were constant at the forested site. As soil properties at both the forested and grassland sites are extremely variable, this suggests that the correlation structure at the forested site is dominated by the pattern of throughfall and interception. We also found a very strong correlation between antecedent soil moisture at the forested site and runoff coefficients of rainfall-runoff events observed at gauge Rehefeld. Antecedent soil moisture at the forest site explains 92% of the variability in the runoff coefficients. By combining these results with a recession analysis we derived a first conceptual model of the dominant runoff mechanisms operating in this catchment. Finally, we employed a physically based hydrological model to shed light on the controls of soil- and plant morphological parameters on soil average soil moisture at the forested site and the grassland site, respectively. A homogeneous soil setup allowed, after fine tuning of plant morphological parameters, most of the time unbiased predictions of the observed average soil conditions observed at both field sites. We conclude that the proposed sampling strategy of clustering TDR probes is suitable to assess unbiased average soil moisture dynamics in critical functional units, in this case the forested site, which is a much better predictor for event scale runoff formation than pre-event discharge. Long term monitoring of such critical landscape elements could maybe yield valuable information for flood warning in headwaters. We thus think that STDR provides a good intersect of the advantages of permanent sampling and spatially highly resolved soil moisture sampling using mobile rods.


2009 ◽  
Vol 6 (6) ◽  
pp. 7503-7537 ◽  
Author(s):  
E. Zehe ◽  
T. Graeff ◽  
M. Morgner ◽  
A. Bauer ◽  
A. Bronstert

Abstract. This study presents an application of an innovative sampling strategy to assess soil moisture dynamics in a headwater of the Weißeritz in the German eastern Ore Mountains. A grassland site and a forested site were instrumented with two Spatial TDR clusters (STDR) that consist of 39 and 32 coated TDR probes of 60 cm length. Distributed time series of vertically averaged soil moisture data from both sites/ensembles were analyzed by statistical and geostatistical methods. Spatial variability and the spatial mean at the forested site were larger than at the grassland site. Furthermore, clustering of TDR probes in combination with long-term monitoring allowed identification of average spatial covariance structures at the small field scale for different wetness states. The correlation length of soil water content as well as the sill to nugget ratio at the grassland site increased with increasing average wetness and but, in contrast, were constant at the forested site. As soil properties at both the forested and grassland sites are extremely variable, this suggests that the correlation structure at the forested site is dominated by the pattern of throughfall and interception. We also found a strong correlation between average soil moisture dynamics and runoff coefficients of rainfall-runoff events observed at gauge Rehefeld, which explains almost as much variability in the runoff coefficients as pre-event discharge. By combining these results with a recession analysis we derived a first conceptual model of the dominant runoff mechanisms operating in this catchment. Finally, long term simulations with a physically based hydrological model were in good/acceptable accordance with the time series of spatial average soil water content observed at the forested site and the grassland site, respectively. Both simulations used a homogeneous soil setup that closely reproduces observed average soil conditions observed at the field sites. This corroborates the proposed sampling strategy of clustering TDR probes in typical functional units is a promising technique to explore the soil moisture control on runoff generation. Long term monitoring of such sites could maybe yield valuable information for flood warning. The sampling strategy helps furthermore to unravel different types of soil moisture variability.


Sign in / Sign up

Export Citation Format

Share Document