Applying machine learning for drought prediction using a large ensemble of climate simulations
<p>There is strong scientific and social interest to understand the factors leading to extreme events in order to improve the management of risks associated with hazards like droughts. Recent events like the summer 2018 drought in Germany already had severe und unexpected impacts, e.g. forest fires and crop failures; in order to increase preparedness robust prediction tools are &#160;urgently required. In this study, machine learning methods are applied to predict the occurrence of a drought with lead times of one to three months. The approach takes into account a list of thirty atmospheric and soil variables<strong> </strong>as predictor input parameters from a single regional climate model initial condition large ensemble (CRCM5-LE). The data was produced the context of the ClimEx project by Ouranos with the Canadian Regional Climate Model (CRCM5) driven by 50 members of the Canadian Earth System Model (CanESM2) for the Bavarian and Quebec domains.</p><p>Drought occurrence was defined using the Standardized Precipitation Index. The training and test datasets were chosen from the current climatology (1955-2005) for the Munich and Lisbon subdomain within the CRCM5-LE. The best performing machine learning algorithms managed to obtain a correct classification of drought or no drought for a lead time of one month for around 60 % of the events of each class for the both domains. Explainable AI methods like feature importance and shapley values were applied to gain a better understanding of the trained algorithms. Physical variables like the North Atlantic Oscillation Index and air pressure one month before the event proved to be of high importance for the prediction. The study showed that better accuracies can be obtained for the Lisbon domain, due to the stronger influence of the North Atlantic Oscillation Index on Portugal&#8217;s climate.</p>