scholarly journals Genetic programming for hydrological applications: to model or forecast that is the question

Author(s):  
Herath Mudiyanselage Viraj Vidura Herath ◽  
Jayashree Chadalawada ◽  
Vladan Babovic

Abstract Genetic programming (GP) is a widely used machine learning (ML) algorithm that has been applied in water resources science and engineering since its conception in the early 1990s. However, similar to other ML applications, the GP algorithm is often used as a data fitting tool rather than as a model building instrument. We find this a gross underutilization of the GP capabilities. The most unique and distinct feature of GP that makes it distinctly different from the rest of ML techniques is its capability to produce explicit mathematical relationships between input and output variables. In the context of theory-guided data science (TGDS) which recently emerged as a new paradigm in ML with the main goal of blending the existing body of knowledge with ML techniques to induce physically sound models. Hence, TGDS has evolved into a popular data science paradigm, especially in scientific disciplines including water resources. Following these ideas, in our prior work, we developed two hydrologically informed rainfall-runoff model induction toolkits for lumped modelling and distributed modelling based on GP. In the current work, the two toolkits are applied using a different hydrological model building library. Here, the model building blocks are derived from the Sugawara TANK model template which represents the elements of hydrological knowledge. Results are compared against the traditional GP approach and suggest that GP as a rainfall-runoff model induction toolkit preserves the prediction power of the traditional GP short-term forecasting approach while benefiting to better understand the catchment runoff dynamics through the readily interpretable induced models.

2020 ◽  
Author(s):  
Herath Mudiyanselage Viraj Vidura Herath ◽  
Jayashree Chadalawada ◽  
Vladan Babovic

Abstract. Despite showing a great success of applications in many commercial fields, machine learning and data science models in general, show a limited use in scientific fields including hydrology. The approach is often criticized for lack of interpretability and physical consistency. This has led to the emergence of new paradigms, such as Theory Guided Data Science (TGDS) and physics informed machine learning. The motivation behind such approaches is to improve the physical meaningfulness of machine learning models by blending existing scientific knowledge with learning algorithms. Following the same principles, in our prior work (Chadalawada et al., 2020), a new model induction framework was founded on Genetic Programming (GP) namely Machine Learning Rainfall-Runoff Model Induction Toolkit (ML-RR-MI). ML-RR-MI is cable of developing fully-fledged lumped conceptual rainfall-runoff models for a watershed of interest using the building blocks of two flexible rainfall-runoff modelling frameworks (FUSE and SUPERFLEX). In this study, we extend ML-RR-MI towards inducing semi-distributed rainfall-runoff models. This effort is motivated by the desire to address the decreasing meaningfulness of lumped models which tend to particularly deteriorate within large catchments where the spatial heterogeneity of forcing variables and watershed properties are significant. Henceforth, our machine learning approach for rainfall-runoff modelling titled Machine Induction Knowledge-Augmented System Hydrologique Asiatique (MIKA-SHA) captures spatial variabilities and automatically induces rainfall-runoff models for the catchment of interest without any subjectivity in model selection. Currently, MIKA-SHA learns models utilizing the model building components of FUSE and SUPERFLEX. However, the proposed framework can be coupled with any internally coherent collection of building blocks. MIKA-SHA’s model induction capabilities have been tested on the Red Creek catchment near Vestry, Mississippi, United States. The resulted model architectures through MIKA-SHA are compatible with previously reported research findings and fieldwork insights of the watershed and are readily interpretable by hydrologists.


2018 ◽  
Vol 13 (2) ◽  
pp. 115-130 ◽  
Author(s):  
Radhika Radhika ◽  
Rendy Firmansyah ◽  
Waluyo Hatmoko

Information on water availability is vital in water resources management. Unfortunately, information on the condition of hydrological data, either river flow data, or rainfall data is very limited temporally and spatially. With the availability of satellite technology, rainfall in the tropics can be monitored and recorded for further analysis. This paper discusses the calculation of surface water availability based on rainfall data from TRMM satellite, and then Wflow, a distributed rainfall-runoff model generates monthly time runoff data from 2003 to 2015 for all river basin areas in Indonesia. It is concluded that the average surface water availability in Indonesia is 88.3 thousand m3/s or equivalent to 2.78 trillion m3/ year. This figure is lower than the study of Water Resources Research Center 2010 based on discharge at the post estimated water that produces 3.9 trillion m3/year, but very close to the study of Aquastat FAO of 2.79 trillion m3 / year. The main benefit of this satellite-based calculation is that at any location in Indonesia, potential surface water can be obtained by multiplying the area of the catchment and the runoff height.


Author(s):  
Hiroki Momiyama ◽  
Tomo'omi Kumagai ◽  
Tomohiro Egusa

In Japan, there has recently been an increasing call for forest thinning to conserve water resources from forested mountain catchments in terms of runoff during prolonged drought periods of the year. How their water balance and the resultant runoff are altered by forest thinning is examined using a combination of 8-year hydrological observations, 100-year meteorological data generator output, and a semi-process-based rainfall-runoff model. The rainfall-runoff model is developed based on TOPMODEL assuming that forest thinning has an impact on runoff primarily through an alteration in canopy interception. The main novelty in this analysis is that the availability of the generated 100-year meteorological data allows the investigations of the forest thinning impacts on mountain catchment water resources under the most severer drought conditions. The model is validated against runoff observations conducted at a forested mountain catchment in the Kanto region of Japan for the period 2010–2017. It is demonstrated that the model reproduces temporal variations in runoff and evapotranspiration at inter- and intra-annual time scales, resulting in well reproducing the observed flow duration curves. On the basis of projected flow duration curves for the 100-year, despite the large increase in an annual total runoff with ordinary intensifying thinning, low flow rates, i.e., water resources from the catchment in the drought period in the year, in both normal and drought years were impacted by the forest thinning to a lesser extent. Higher catchment water retention capacity appreciably enhanced the forest thinning effect on increasing available water resources.


2019 ◽  
Vol 27 (1) ◽  
pp. 14-24 ◽  
Author(s):  
Naser Mohammadzadeh ◽  
Bahman Jabbarian Amiri ◽  
Leila Eslami Endergoli ◽  
Shirin Karimi

Abstract With the aim of assessing the impact of climate change on surface water resources, a conceptual rainfall-runoff model (the tank model) was coupled with LARS-WG as a weather generator model. The downscaled daily rainfall, temperature, and evaporation from LARS-WG under various IPCC climate change scenarios were used to simulate the runoff through the calibrated Tank model. A catchment (4648 ha) located in the southern basin of the Caspian Sea was chosen for this research study. The results showed that this model has a reasonable predictive capability in simulating minimum and maximum temperatures at a level of 99%, rainfall at a level of 93%, and radiation at a level of 97% under various scenarios in agreement with the observed data. Moreover, the results of the rainfall-runoff model indicated an increase in the flow rate of about 108% under the A1B scenario, 101% under the A2 scenario, and 93% under the B1 scenario over the 30-year time period of the discharge prediction.


2021 ◽  
Vol 25 (8) ◽  
pp. 4373-4401
Author(s):  
Herath Mudiyanselage Viraj Vidura Herath ◽  
Jayashree Chadalawada ◽  
Vladan Babovic

Abstract. Despite showing great success of applications in many commercial fields, machine learning and data science models generally show limited success in many scientific fields, including hydrology (Karpatne et al., 2017). The approach is often criticized for its lack of interpretability and physical consistency. This has led to the emergence of new modelling paradigms, such as theory-guided data science (TGDS) and physics-informed machine learning. The motivation behind such approaches is to improve the physical meaningfulness of machine learning models by blending existing scientific knowledge with learning algorithms. Following the same principles in our prior work (Chadalawada et al., 2020), a new model induction framework was founded on genetic programming (GP), namely the Machine Learning Rainfall–Runoff Model Induction (ML-RR-MI) toolkit. ML-RR-MI is capable of developing fully fledged lumped conceptual rainfall–runoff models for a watershed of interest using the building blocks of two flexible rainfall–runoff modelling frameworks. In this study, we extend ML-RR-MI towards inducing semi-distributed rainfall–runoff models. The meaningfulness and reliability of hydrological inferences gained from lumped models may tend to deteriorate within large catchments where the spatial heterogeneity of forcing variables and watershed properties is significant. This was the motivation behind developing our machine learning approach for distributed rainfall–runoff modelling titled Machine Induction Knowledge Augmented – System Hydrologique Asiatique (MIKA-SHA). MIKA-SHA captures spatial variabilities and automatically induces rainfall–runoff models for the catchment of interest without any explicit user selections. Currently, MIKA-SHA learns models utilizing the model building components of two flexible modelling frameworks. However, the proposed framework can be coupled with any internally coherent collection of building blocks. MIKA-SHA's model induction capabilities have been tested on the Rappahannock River basin near Fredericksburg, Virginia, USA. MIKA-SHA builds and tests many model configurations using the model building components of the two flexible modelling frameworks and quantitatively identifies the optimal model for the watershed of concern. In this study, MIKA-SHA is utilized to identify two optimal models (one from each flexible modelling framework) to capture the runoff dynamics of the Rappahannock River basin. Both optimal models achieve high-efficiency values in hydrograph predictions (both at catchment and subcatchment outlets) and good visual matches with the observed runoff response of the catchment. Furthermore, the resulting model architectures are compatible with previously reported research findings and fieldwork insights of the watershed and are readily interpretable by hydrologists. MIKA-SHA-induced semi-distributed model performances were compared against existing lumped model performances for the same basin. MIKA-SHA-induced optimal models outperform the lumped models used in this study in terms of efficiency values while benefitting hydrologists with more meaningful hydrological inferences about the runoff dynamics of the Rappahannock River basin.


2021 ◽  
Author(s):  
Jamie Lee Stevenson ◽  
Christian Birkel ◽  
Aaron J. Neill ◽  
Doerthe Tetzlaff ◽  
Chris Soulsby

Sign in / Sign up

Export Citation Format

Share Document