scholarly journals Empirical streamflow simulation for water resource management in data-scarce seasonal watersheds

2015 ◽  
Vol 12 (10) ◽  
pp. 11083-11127 ◽  
Author(s):  
J. E. Shortridge ◽  
S. D. Guikema ◽  
B. F. Zaitchik

Abstract. In the past decade, certain methods for empirical rainfall–runoff modeling have seen extensive development and been proposed as a useful complement to physical hydrologic models, particularly in basins where data to support process-based models is limited. However, the majority of research has focused on a small number of methods, such as artificial neural networks, despite the development of multiple other approaches for non-parametric regression in recent years. Furthermore, this work has generally evaluated model performance based on predictive accuracy alone, while not considering broader objectives such as model interpretability and uncertainty that are important if such methods are to be used for planning and management decisions. In this paper, we use multiple regression and machine-learning approaches to simulate monthly streamflow in five highly-seasonal rivers in the highlands of Ethiopia and compare their performance in terms of predictive accuracy, error structure and bias, model interpretability, and uncertainty when faced with extreme climate conditions. While the relative predictive performance of models differed across basins, data-driven approaches were able to achieve reduced errors when compared to physical models developed for the region. Methods such as random forests and generalized additive models may have advantages in terms of visualization and interpretation of model structure, which can be useful in providing insights into physical watershed function. However, the uncertainty associated with model predictions under climate change should be carefully evaluated, since certain models (especially generalized additive models and multivariate adaptive regression splines) became highly variable when faced with high temperatures.

2016 ◽  
Vol 20 (7) ◽  
pp. 2611-2628 ◽  
Author(s):  
Julie E. Shortridge ◽  
Seth D. Guikema ◽  
Benjamin F. Zaitchik

Abstract. In the past decade, machine learning methods for empirical rainfall–runoff modeling have seen extensive development and been proposed as a useful complement to physical hydrologic models, particularly in basins where data to support process-based models are limited. However, the majority of research has focused on a small number of methods, such as artificial neural networks, despite the development of multiple other approaches for non-parametric regression in recent years. Furthermore, this work has often evaluated model performance based on predictive accuracy alone, while not considering broader objectives, such as model interpretability and uncertainty, that are important if such methods are to be used for planning and management decisions. In this paper, we use multiple regression and machine learning approaches (including generalized additive models, multivariate adaptive regression splines, artificial neural networks, random forests, and M5 cubist models) to simulate monthly streamflow in five highly seasonal rivers in the highlands of Ethiopia and compare their performance in terms of predictive accuracy, error structure and bias, model interpretability, and uncertainty when faced with extreme climate conditions. While the relative predictive performance of models differed across basins, data-driven approaches were able to achieve reduced errors when compared to physical models developed for the region. Methods such as random forests and generalized additive models may have advantages in terms of visualization and interpretation of model structure, which can be useful in providing insights into physical watershed function. However, the uncertainty associated with model predictions under extreme climate conditions should be carefully evaluated, since certain models (especially generalized additive models and multivariate adaptive regression splines) become highly variable when faced with high temperatures.


Author(s):  
Marcos Samuel Matias Ribeiro ◽  
Lara de Melo Barbosa Andrade ◽  
Maria Helena Constantino Spyrides ◽  
Kellen Carla Lima ◽  
Pollyane Evangelista da Silva ◽  
...  

AbstractThe occurrence of environmental disasters affects different social segments, impacting health, education, housing, economy and the provision of basic services. Thus, the objective of this study was to estimate the relationship between the occurrence of disasters and extreme climate, sociosanitary and demographic conditions in the Northeast region of Brazil during the period from 1993 to 2013. Initially, we analyzed the spatial pattern of the incidence of events and, subsequently, generalized additive models for location, scale and shape were used in order to identify and estimate the magnitude of associations between factors. Results showed that droughts are the predominant disasters in the NEB representing 81.1% of the cases, followed by events triggered by excessive rainfall such as flash floods (11.1%) and floods (7.8%). Climate conditions presented statistically significant associations with the analyzed disasters, in which indicators of excess rainfall positively contributed to the occurrence of flash floods and floods, but negatively contributed to the occurrence of drought. Sociosanitary factors, such as percentage of households with inadequate sewage, waste collection and water supply, were also positively associated with the model’s estimations, i.e., contributing to an increase in the occurrence of events, with the exception of floods, which were not significantly influenced by sociosanitary parameters. A decrease of 19% in the risk of drought occurrence was estimated, on average. On the other hand, events caused by excessive rainfall increased by 40% and 57%, in the cases of flash floods and floods, respectively.


Author(s):  
Astor Toraño Caicoya ◽  
Hans Pretzsch

The Site Index (SI) has been widely used in forest management and silviculture. It relies on the assumption that the height of dominant trees in a stand is independent from the local density. However, research on climate change suggests that under certain moisture stress conditions, this may not hold. Here, based on 29 plots from 5 long-term research experiments, we have tested the effect of local stand density on the SI of Norway spruce (Picea abies (L.) H. Karst). With generalized additive models (GAMM), we analyzed the effect of stand structure and climate predictors on SI. The two evaluated models revealed that local stand density and age had a significant effect on SI (p≤0.001 ), showing a clear negative trend especially significant on sites with poor and dry soils, which may reduce the site index by a maximum of approximately 4 m for an increase in density between 400 and 600 trees/ha. We stress that the physiological characteristics of Norway spruce, flat-rooting system and xeromorphism, especially when growing in pure stands, may explain these effects. Thus, density control and growth in mixtures may help to reduce the water stress and losses in height growth under future climate conditions.


2020 ◽  
Vol 12 (10) ◽  
pp. 4006
Author(s):  
Fhumulani Mathivha ◽  
Caston Sigauke ◽  
Hector Chikoore ◽  
John Odiyo

Forecasting extreme hydrological events is critical for drought risk and efficient water resource management in semi-arid environments that are prone to natural hazards. This study aimed at forecasting drought conditions in a semi-arid region in north-eastern South Africa. The Standardized Precipitation Evaporation Index (SPEI) was used as a drought-quantifying parameter. Data for SPEI formulation for eight weather stations were obtained from South Africa Weather Services. Forecasting of the SPEI was achieved by using Generalized Additive Models (GAMs) at 1, 6, and 12 month timescales. Time series decomposition was done to reduce time series complexities, and variable selection was done using Lasso. Mild drought conditions were found to be more prevalent in the study area compared to other drought categories. Four models were developed to forecast drought in the Luvuvhu River Catchment (i.e., GAM, Ensemble Empirical Mode Decomposition (EEMD)-GAM, EEMD-Autoregressive Integrated Moving Average (ARIMA)-GAM, and Forecast Quantile Regression Averaging (fQRA)). At the first two timescales, fQRA forecasted the test data better than the other models, while GAMs were best at the 12 month timescale. Root Mean Square Error values of 0.0599, 0.2609, and 0.1809 were shown by fQRA and GAM at the 1, 6, and 12 month timescales, respectively. The study findings demonstrated the strength of GAMs in short- and medium-term drought forecasting.


2019 ◽  
Author(s):  
Adam B. Smith ◽  
Maria J. Santos

AbstractModels of species’ distributions and niches are frequently used to infer the importance of range- and niche-defining variables. However, the degree to which these models can reliably identify important variables and quantify their influence remains unknown. Here we use a series of simulations to explore how well models can 1) discriminate between variables with different influence and 2) calibrate the magnitude of influence relative to an “omniscient” model. To quantify variable importance, we trained generalized additive models (GAMs), Maxent, and boosted regression trees (BRTs) on simulated data and tested their sensitivity to permutations in each predictor. Importance was inferred by calculating the correlation between permuted and unpermuted predictions, and by comparing predictive accuracy of permuted and unpermuted predictions using AUC and the Continuous Boyce Index. In scenarios with one influential and one uninfluential variable, models were unable to discriminate reliably between variables in conditions that are normally challenging for generating accurate predictions: training occurrences <8-64; prevalence >0.5; small spatial extent; environmental data with coarse resolution when spatial autocorrelation is low; and correlation between environmental variables where |r| >0.7. When two variables influenced the distribution equally, importance was underestimated when species had narrow or intermediate niche breadth. Interactions between variables in how they shaped the niche did not affect inferences about their importance. When variables acted unequally, the effect of the stronger variable was overestimated. GAMs and Maxent discriminated between variables more reliably than BRTs, but no algorithm was consistently well-calibrated vis-à-vis the omniscient model. Algorithm-specific measures of importance like Maxent’s change-in-gain metric were less robust than the permutation test. Overall, high predictive accuracy did not connote robust inferential capacity. As a result, requirements for reliably measuring variable importance are likely more stringent than for creating models with high predictive accuracy.


2021 ◽  
Author(s):  
Cervantes - Martínez Karla ◽  
Riojas - Rodríguez Horacio ◽  
Díaz - Ávalos Carlos ◽  
Moreno - Macías Hortensia ◽  
López - Ridaura Ruy ◽  
...  

Abstract Epidemiological studies on the effects of air pollution in Mexico often use the environmental concentrations of monitors closest to the home as exposure proxies, yet this approach disregards the space gradients of pollutants and assumes that individuals have no intra-city mobility. Our aim was to develop high-resolution spatial and temporal models for predicting long-term exposure to PM2.5 and NO2 in a population of ~ 16 500 participants from the Mexican Teachers’ Cohort study. We geocoded the home and work addresses of participants. Using information from secondary sources on geographic and meteorological variables as well as other pollutants, we fitted two generalized additive models to predict monthly PM2.5 and NO2 concentrations in the 2004–2019 period. The models were evaluated through 10-fold cross validation. Both showed high predictive accuracy with out-of-sample data and no overfitting (CV RMSE = 0.102 for PM2.5 and CV RMSE = 4.497 for NO2). Participants were exposed to a monthly average of 24.38 (6.78) µg/m3 of PM2.5 and 28.21 (8.00) ppb of NO2 during the study period. These models offer a solid alternative for estimating PM2.5 and NO2 exposure with high spatio-temporal resolution for epidemiological studies in the Valle de México region.


Sign in / Sign up

Export Citation Format

Share Document