Effect of input variables on rainfall-runoff modeling using a deep learning method

Author(s):  
Kazuki yokoo ◽  
Kei ishida ◽  
Takeyoshi nagasato ◽  
Ali Ercan

<p>In recent years, deep learning has been applied to various issues in natural science, including hydrology. These application results show its high applicability. There are some studies that performed rainfall-runoff modeling by means of a deep learning method, LSTM (Long Short-Term Memory). LSTM is a kind of RNN (Recurrent Neural Networks) that is suitable for modeling time series data with long-term dependence. These studies showed the capability of LSTM for rainfall-runoff modeling. However, there are few studies that investigate the effects of input variables on the estimation accuracy. Therefore, this study, investigated the effects of the selection of input variables on the accuracy of a rainfall-runoff model by means of LSTM. As the study watershed, this study selected a snow-dominated watershed, the Ishikari River basin, which is in the Hokkaido region of Japan. The flow discharge was obtained at a gauging station near the outlet of the river as the target data. For the input data to the model, Meteorological variables were obtained from an atmospheric reanalysis dataset, ERA5, in addition to the gridded precipitation dataset. The selected meteorological variables were air temperature, evaporation, longwave radiation, shortwave radiation, and mean sea level pressure. Then, the rainfall-runoff model was trained with several combinations of the input variables. After the training, the model accuracy was compared among the combinations. The use of meteorological variables in addition to precipitation and air temperature as input improved the model accuracy. In some cases, however, the model accuracy was worsened by using more variables as input. The results indicate the importance to select adequate variables as input for rainfall-runoff modeling by LSTM.</p>

2020 ◽  
Author(s):  
Frederik Kratzert ◽  
Daniel Klotz ◽  
Sepp Hochreiter ◽  
Grey S. Nearing

Abstract. A deep learning rainfall-runoff model can take multiple meteorological forcing products as inputs and learn to combine them in spatially and temporally dynamic ways. This is demonstrated using Long Short Term Memory networks (LSTMs) trained over basins in the continental US using the CAMELS data set. Using multiple precipitation products (NLDAS, Maurer, DayMet) in a single LSTM significantly improved simulation accuracy relative to using only individual precipitation products. A sensitivity analysis showed that the LSTM learned to utilize different precipitation products in different ways in different basins and for simulating different parts of the hydrograph in individual basins.


Author(s):  
Vahid Nourani ◽  
Masoud Mehrvand ◽  
Aida Hosseini Baghanam

In this study the performance of ANN with feed-forward neural network (FFNN) algorithm evaluated rainfall-runoff modeling in five gauging stations in Florida State. In addition, for investigating the performance of ANN in multi-station discharge prediction, self-organizing map (SOM) clustering tool employed in order to cluster the input data with similar patterns, due to the large amount of records in multiple stations. The main aim of study is to investigate capability and accuracy of ANN based methods in multi-station discharge prediction. In order to consider multiple stations effect on watershed outlet discharge, different combinations for precipitation and discharge data of all stations with antecedent values over the watershed have been taken into account. In this way, application of the representatives from each cluster led to significantly reduction in the numbers of the input variables so that the optimal ANN structure could be proposed. Therefore, ANN as a data-driven model was trained to predict daily runoff for the Peace River basin via recorded values from July 1995 to July 2011. Three scenarios conducted the aim of research; first scenario was an integrated ANN model trained by the data of rainfall and runoff at multiple stations. The second scenario was a sequential ANN model processed with upstream discharge records in addition to rainfall data as inputs and downstream discharge values as target. Finally, third scenario was a SOM-ANN model, in which rainfall and runoff data were clustered according the homogeneity of data via (SOM). The center of each cluster as the dominant component of each cluster was imposed to ANN in order to present an optimal rainfall-runoff model over the watershed. In all scenarios, different data sets at various time lags in both rainfall and stream flow data were applied as inputs in ANN-based model to predict stream flow. Results show that ANN model coupled with SOM is useful tools for forecasting multi-station discharge and precipitation event response in the watershed. Furthermore, the comparison of scenarios leads to select the most efficient and optimal inputs to ANN which subsequently, presents the optimal multi-station rainfall-runoff model over the watershed.


Author(s):  
Pavan Kumar Yeditha ◽  
Maheswaran Rathinasamy ◽  
Sai Sumanth Neelamsetty ◽  
Biswa Bhattacharya ◽  
Ankit Agarwal

Abstract Rainfall–runoff models are valuable tools for flood forecasting, management of water resources, and drought warning. With the advancement in space technology, a plethora of satellite precipitation products (SPPs) are available publicly. However, the application of the satellite data for the data-driven rainfall–runoff model is emerging and requires careful investigation. In this work, two satellite rainfall data sets, namely Global Precipitation Measurement-Integrated Multi-Satellite Retrieval Product V6 (GPM-IMERG) and Climate Hazards Group Infrared Precipitation with Station (CHIRPS), are evaluated for the development of rainfall–runoff models and the prediction of 1-day ahead streamflow. The accuracy of the data from the SPPs is compared to the India Meteorological Department (IMD)-gridded precipitation data set. Detection metrics showed that for light rainfall (1–10 mm), the probability of detection (POD) value ranges between 0.67 and 0.75 and with an increasing rainfall range, i.e., medium and heavy rainfall (10–50 mm and >50 mm), the POD values ranged from 0.24 to 0.45. These results indicate that the satellite precipitation performs satisfactorily with reference to the IMD-gridded data set. Using the daily precipitation data of nearly two decades (2000–2018) over two river basins in India's Eastern part, artificial neural network, extreme learning machine (ELM), and long short-time memory (LSTM) models are developed for rainfall–runoff modelling. One-day ahead runoff prediction using the developed rainfall–runoff modelling confirmed that both the SPPs are sufficient to drive the rainfall–runoff models with a reasonable accuracy estimated using the Nash–Sutcliffe Efficiency coefficient, correlation coefficient, and the root-mean-squared error. In particular, the 1-day streamflow forecasts for the Vamsadhara river basin (VRB) using LSTM with GPM-IMERG inputs resulted in NSC values of 0.68 and 0.67, while ELM models for Mahanadhi river basin (MRB) with the same input resulted in NSC values of 0.86 and 0.87, respectively, during training and validation stages. At the same time, the LSTM model with CHIRPS inputs for the VRB resulted in NSC values of 0.68 and 0.65, and the ELM model with CHIRPS inputs for the MRB resulted in NSC values of 0.89 and 0.88, respectively, in training and validation stages. These results indicated that both the SPPs could reliably be used with LSTM and ELM models for rainfall–runoff modelling and streamflow prediction. This paper highlights that deep learning models, such as ELM and LSTM, with the GPM-IMERG products can lead to a new horizon to provide flood forecasting in flood-prone catchments.


2022 ◽  
Author(s):  
Zhongrun Xiang ◽  
Ibrahim Demir

Recent studies using latest deep learning algorithms such as LSTM (Long Short-Term Memory) have shown great promise in time-series modeling. There are many studies focusing on the watershed-scale rainfall-runoff modeling or streamflow forecasting, often considering a single watershed with limited generalization capabilities. To improve the model performance, several studies explored an integrated approach by decomposing a large watershed into multiple sub-watersheds with semi-distributed structure. In this study, we propose an innovative physics-informed fully-distributed rainfall-runoff model, NRM-Graph (Neural Runoff Model-Graph), using Graph Neural Networks (GNN) to make full use of spatial information including the flow direction and geographic data. Specifically, we applied a time-series model on each grid cell for its runoff production. The output of each grid cell is then aggregated by a GNN as the final runoff at the watershed outlet. The case study shows that our GNN based model successfully represents the spatial information in predictions. NRM-Graph network has shown less over-fitting and a significant improvement on the model performance compared to the baselines with spatial information. Our research further confirms the importance of spatially distributed hydrological information in rainfall-runoff modeling using deep learning, and we encourage researchers to incorporate more domain knowledge in modeling.


2021 ◽  
Author(s):  
Jamie Lee Stevenson ◽  
Christian Birkel ◽  
Aaron J. Neill ◽  
Doerthe Tetzlaff ◽  
Chris Soulsby

Water ◽  
2021 ◽  
Vol 13 (9) ◽  
pp. 1226
Author(s):  
Pakorn Ditthakit ◽  
Sirimon Pinthong ◽  
Nureehan Salaeh ◽  
Fadilah Binnui ◽  
Laksanara Khwanchum ◽  
...  

Accurate monthly runoff estimation is crucial in water resources management, planning, and development, preventing and reducing water-related problems, such as flooding and droughts. This article evaluates the monthly hydrological rainfall-runoff model’s performance, the GR2M model, in Thailand’s southern basins. The GR2M model requires only two parameters: production store (X1) and groundwater exchange rate (X2). Moreover, no prior research has been reported on its application in this region. The 37 runoff stations, which are located in three sub-watersheds of Thailand’s southern region, namely; Thale Sap Songkhla, Peninsular-East Coast, and Peninsular-West Coast, were selected as study cases. The available monthly hydrological data of runoff, rainfall, air temperature from the Royal Irrigation Department (RID) and the Thai Meteorological Department (TMD) were collected and analyzed. The Thornthwaite method was utilized for the determination of evapotranspiration. The model’s performance was conducted using three statistical indices: Nash–Sutcliffe Efficiency (NSE), Correlation Coefficient (r), and Overall Index (OI). The model’s calibration results for 37 runoff stations gave the average NSE, r, and OI of 0.657, 0.825, and 0.757, respectively. Moreover, the NSE, r, and OI values for the model’s verification were 0.472, 0.750, and 0.639, respectively. Hence, the GR2M model was qualified and reliable to apply for determining monthly runoff variation in this region. The spatial distribution of production store (X1) and groundwater exchange rate (X2) values was conducted using the IDW method. It was susceptible to the X1, and X2 values of approximately more than 0.90, gave the higher model’s performance.


Sign in / Sign up

Export Citation Format

Share Document