scholarly journals Density-based weighting for imbalanced regression

2021 ◽  
Author(s):  
Michael Steininger ◽  
Konstantin Kobs ◽  
Padraig Davidson ◽  
Anna Krause ◽  
Andreas Hotho

AbstractIn many real world settings, imbalanced data impedes model performance of learning algorithms, like neural networks, mostly for rare cases. This is especially problematic for tasks focusing on these rare occurrences. For example, when estimating precipitation, extreme rainfall events are scarce but important considering their potential consequences. While there are numerous well studied solutions for classification settings, most of them cannot be applied to regression easily. Of the few solutions for regression tasks, barely any have explored cost-sensitive learning which is known to have advantages compared to sampling-based methods in classification tasks. In this work, we propose a sample weighting approach for imbalanced regression datasets called DenseWeight and a cost-sensitive learning approach for neural network regression with imbalanced data called DenseLoss based on our weighting scheme. DenseWeight weights data points according to their target value rarities through kernel density estimation (KDE). DenseLoss adjusts each data point’s influence on the loss according to DenseWeight, giving rare data points more influence on model training compared to common data points. We show on multiple differently distributed datasets that DenseLoss significantly improves model performance for rare data points through its density-based weighting scheme. Additionally, we compare DenseLoss to the state-of-the-art method SMOGN, finding that our method mostly yields better performance. Our approach provides more control over model training as it enables us to actively decide on the trade-off between focusing on common or rare cases through a single hyperparameter, allowing the training of better models for rare data points.

2020 ◽  
Author(s):  
Corrado Camera ◽  
Adriana Bruggeman ◽  
George Zittis ◽  
Ioannis Sofokleous ◽  
Joël Arnault

Abstract. Few studies evaluate the hydrologic performance of coupled atmospheric-hydrologic models when forced with observed rainfall and even fewer when forced with modelled precipitation. This information is crucial for the study of floods and in general for the use of the models for water management purposes. This study's objectives were: (i) to calibrate the one-way coupled WRF-hydro model for simulating extreme events in Cyprus with observed precipitation; and (ii) to evaluate the model performance when forced with WRF-downscaled (1 × 1 km2) re-analysis precipitation data (ERA-Interim). Streamflow was modelled during extreme rainfall events that occurred in January 1989 and November 1994 over 22 mountain watersheds. In six watersheds, Nash-Sutcliffe Efficiencies (NSE) larger than 0.5 were obtained for both events. The WRF-modelled rainfall showed an average NSE of 0.83 for January 1989 and 0.49 for November 1994. Nevertheless, hydrologic simulations of the two events with the WRF-modelled rainfall and the calibrated WRF-Hydro returned negative streamflow NSE for 13 watersheds in January 1989 and for 18 watersheds in November 1994. These results indicate that small differences in amounts or shifts in time or space of modelled rainfall, in comparison with observed precipitation, can strongly modify the hydrologic response of small watersheds to extreme events. Thus, the calibration of WRF-Hydro for small watersheds depends on the availability of observed rainfall with high temporal and spatial resolution. However, the use of modelled precipitation input data will remain important for studying the effect of future extremes on flooding and water resources.


2019 ◽  
Vol 1 (1) ◽  
pp. 33
Author(s):  
M Welly

Many people in Indonesia calculate design rainfall before calculating the design flooddischarge. The design rainfall with a certain return period will eventually be convertedinto a design flood discharge by combining it with the characteristics of the watershed.However, the lack of a network of rainfall recording stations makes many areas that arenot hydrologically measured (ungauged basin), so it is quite difficult to know thecharacteristics of rain in the area concerned. This study aims to analyze thecharacteristics of design rainfall in Lampung Province. The focus of the analysis is toinvestigate whether geographical factors influence the design rainfall that occurs in theparticular area. The data used in this study is daily rainfall data from 15 rainfallrecording stations spread in Lampung Province. The method of frequency analysis usedin this study is the Gumbel method. The research shows that the geographical location ofan area does not have significant effect on extreme rainfall events. The effect of risingearth temperatures due to natural exploitation by humans tends to be stronger as a causeof extreme events such as extreme rainfall.Keywords: Influence, geographical, factors, extreme, rainfall.


2013 ◽  
Vol 31 (3) ◽  
pp. 413 ◽  
Author(s):  
André Becker Nunes ◽  
Gilson Carlos Da Silva

ABSTRACT. The eastern region of Santa Catarina State (Brazil) has an important history of natural disasters due to extreme rainfall events. Floods and landslides are enhancedby local features such as orography and urbanization: the replacement of natural surface coverage causing more surface runoff and, hence, flooding. Thus, studies of this type of events – which directly influence life in the towns – take on increasing importance. This work makes a quantitative analysis of occurrences of extreme rainfall events in the eastern and northern regions of Santa Catarina State in the last 60 years, through individual analysis, considering the history of floods ineach selected town, as well as an estimate through to the end of century following regional climate modeling. A positive linear trend, in most of the towns studied, was observed in the results, indicating greater frequency of these events in recent decades, and the HadRM3P climate model shows a heterogeneous increase of events for all towns in the period from 2071 to 2100.Keywords: floods, climate modeling, linear trend. RESUMO. A região leste do Estado de Santa Catarina tem um importante histórico de desastres naturais ocasionados por eventos extremos de precipitação. Inundações e deslizamentos de terra são potencializados pelo relevo acidentado e pela urbanização das cidades da região: a vegetação nativa vem sendo removida acarretando um maior escoamento superficial e, consequentemente, em inundações. Desta forma, torna-se de suma importância os estudos acerca deste tipo de evento que influencia diretamente a sociedade em geral. Neste trabalho é realizada uma análise quantitativa do número de eventos severos de precipitação ocorridos nas regiões leste e norte de Santa Catarina dos últimos 60 anos, por meio de uma análise pontual, considerandoo histórico de inundações de cada cidade selecionada, além de uma projeção para o fim do século de acordo com modelagem climática regional. Na análise dos resultados observou-se uma tendência linear positiva na maioria das cidades, indicando uma maior frequência deste tipo de evento nas últimas décadas, e o modelo climático HadRM3P mostra um aumento heterogêneo no número de eventos para todas as cidades no período de 2071 a 2100.Palavras-chave: inundações, modelagem climática, tendência linear.


2021 ◽  
Vol 25 (1) ◽  
Author(s):  
Arturo Ruiz-Luna ◽  
Claudia Martínez-Peralta ◽  
Patricia P. B. Eichler ◽  
Leonardo R. Teixeira ◽  
Montserrat Acosta-Morel ◽  
...  

2021 ◽  
Author(s):  
Anil Deo ◽  
Savin S. Chand ◽  
Hamish Ramsay ◽  
Neil J. Holbrook ◽  
Simon McGree ◽  
...  

AbstractSouthwest Pacific nations are among some of the worst impacted and most vulnerable globally in terms of tropical cyclone (TC)-induced flooding and accompanying risks. This study objectively quantifies the fractional contribution of TCs to extreme rainfall (hereafter, TC contributions) in the context of climate variability and change. We show that TC contributions to extreme rainfall are substantially enhanced during active phases of the Madden–Julian Oscillation and by El Niño conditions (particularly over the eastern southwest Pacific region); this enhancement is primarily attributed to increased TC activity during these event periods. There are also indications of increasing intensities of TC-induced extreme rainfall events over the past few decades. A key part of this work involves development of sophisticated Bayesian regression models for individual island nations in order to better understand the synergistic relationships between TC-induced extreme rainfall and combinations of various climatic drivers that modulate the relationship. Such models are found to be very useful for not only assessing probabilities of TC- and non-TC induced extreme rainfall events but also evaluating probabilities of extreme rainfall for cases with different underlying climatic conditions. For example, TC-induced extreme rainfall probability over Samoa can vary from ~ 95 to ~ 75% during a La Niña period, if it coincides with an active or inactive phase of the MJO, and can be reduced to ~ 30% during a combination of El Niño period and inactive phase of the MJO. Several other such cases have been assessed for different island nations, providing information that have potentially important implications for planning and preparing for TC risks in vulnerable Pacific Island nations.


2011 ◽  
Vol 24 (7) ◽  
pp. 1913-1921 ◽  
Author(s):  
Mateus da Silva Teixeira ◽  
Prakki Satyamurty

Abstract A new approach to define heavy and extreme rainfall events based on cluster analysis and area-average rainfall series is presented. The annual frequency of the heavy and extreme rainfall events is obtained for the southeastern and southern Brazil regions. In the 1960–2004 period, 510 (98) and 466 (77) heavy (extreme) rainfall events are identified in the two regions. Monthly distributions of the events closely follow the monthly climatological rainfall in the two regions. In both regions, annual heavy and extreme rainfall event frequencies present increasing trends in the 45-yr period. However, only in southern Brazil is the trend statistically significant. Although longer time series are necessary to ensure the existence of long-term trends, the positive trends are somewhat alarming since they indicate that climate changes, in terms of rainfall regimes, are possibly under way in Brazil.


Weather ◽  
2006 ◽  
Vol 61 (7) ◽  
pp. 211-211
Author(s):  
Nick Baker

2016 ◽  
Author(s):  
Pietro Sabatino ◽  
Giuseppe Fedele ◽  
Antonio Procopio ◽  
Francesco Chiaravalloti ◽  
Salvatore Gabriele

2008 ◽  
Vol 14 (7) ◽  
pp. 1600-1608 ◽  
Author(s):  
PHILIP A. FAY ◽  
DAWN M. KAUFMAN ◽  
JESSE B. NIPPERT ◽  
JONATHAN D. CARLISLE ◽  
CHRISTOPHER W. HARPER

Sign in / Sign up

Export Citation Format

Share Document