A validation scheme for homogenization techniques on a Swedish temperature network using artificial inhomogeneities (1950-2005)

Author(s):  
Giulio Nils Caroletti ◽  
Tommaso Caloiero ◽  
Magnus Joelsson ◽  
Roberto Coscarelli

<p>Homogenization techniques and missing value reconstruction have grown in importance in climatology given their relevance in establishing coherent data records over which climate signals can be correctly attributed, discarding apparent changes depending on instrument inhomogeneities, e.g., change in instrumentation, location, time of measurement.</p><p>However, it is not generally possible to assess homogenized results directly, as true data values are not known. Thus, to validate homogenization techniques, artificially inhomogeneous datasets, also called benchmark datasets, are created from known homogeneous datasets. Results from their homogenization can be assessed and used to rank, evaluate and/or validate techniques used.</p><p>Considering temperature data, the aims of this work are: i) to determine which metrics (bias, absolute error, factor of exceedance, root mean squared error, and Pearson’s correlation coefficient) can be meaningfully used to validate the best-performing homogenization technique in a region; ii) to evaluate through a Pearson correlation analysis if homogenization techniques’ performance depends on physical features of a station (i.e., latitude, altitude and distance from the sea) or on the nature of the inhomogeneities (i.e., the number of break points and missing data).</p><p>With this aims, a southern Sweden temperature database with homogeneous, maximum and minimum temperature data from 100 ground stations over the period 1950-2005 has been used. Starting from these data, inhomogeneous datasets were created introducing up to 7 artificial breaks for each ground station and an average of 107 missing data. Then, 3 homogenization techniques were applied, ACMANT (Adapted Caussinus-Mestre Algorithm for Networks of Temperature series), and two versions of HOMER (HOMogenization software in R): the standard, automated setup mode (Standard-HOMER) and a manual setup developed and performed at the Swedish Meteorological and Hydrological Institute (SMHI-HOMER).</p><p>Results showed that root mean square error, absolute bias and factor of exceedance were the most useful metrics to evaluate improvements in the homogenized datasets: for instance, RMSE for both variables was reduced from an average of 0.71-0.89K (corrupted dataset) to 0.50-0.60K (Standard-HOMER), 0.51-0.61K (SMHI-HOMER) and 0.46-0.53K (ACMANT), respectively.</p><p>Globally, HOMER performed better regarding the factor of exceedance, while ACMANT outperformed it with regard to root mean square error and absolute error. Regardless of the technique used, the homogenization quality anti-correlated meaningfully to the number of breaks. Missing data did not seem to have an impact on HOMER, while it negatively affected ACMANT, because this method does not fill-in missing data in the same drastic way.</p><p>In general, the nature of the datasets had a more important role in yielding good homogenization results than associated physical parameters: only for minimum temperature, distance from the sea and altitude showed a weak but significant correlation with the factor of exceedance and the root mean square error.</p><p>This study has been performed within the INDECIS Project, that is part of ERA4CS, an ERA-NET initiated by JPI Climate, and funded by FORMAS (SE), DLR (DE), BMWFW (AT), IFD (DK), MINECO (ES), ANR (FR) with co-funding by the European Union (Grant 690462).</p>


2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Roberto Coscarelli ◽  
Giulio Nils Caroletti ◽  
Magnus Joelsson ◽  
Erik Engström ◽  
Tommaso Caloiero

AbstractIn order to correctly detect climate signals and discard possible instrumentation errors, establishing coherent data records has become increasingly relevant. However, since real measurements can be inhomogeneous, their use for assessing homogenization techniques is not directly possible, and the study of their performance must be done on homogeneous datasets subjected to controlled, artificial inhomogeneities. In this paper, considering two European temperature networks over the 1950–2005 period, up to 7 artificial breaks and an average of 107 missing data per station were introduced, in order to determine that mean square error, absolute bias and factor of exceedance can be meaningfully used to validate the best-performing homogenization technique. Three techniques were used, ACMANT and two versions of HOMER: the standard, automated setup mode and a manual setup. Results showed that the HOMER techniques performed better regarding the factor of exceedance, while ACMANT was best with regard to absolute error and root mean square error. Regardless of the technique used, it was also established that homogenization quality anti-correlated meaningfully to the number of breaks. On the other hand, as missing data are almost always replaced in the two HOMER techniques, only ACMANT performance is significantly, negatively affected by the amount of missing data.



2021 ◽  
Vol 13 (9) ◽  
pp. 1630
Author(s):  
Yaohui Zhu ◽  
Guijun Yang ◽  
Hao Yang ◽  
Fa Zhao ◽  
Shaoyu Han ◽  
...  

With the increase in the frequency of extreme weather events in recent years, apple growing areas in the Loess Plateau frequently encounter frost during flowering. Accurately assessing the frost loss in orchards during the flowering period is of great significance for optimizing disaster prevention measures, market apple price regulation, agricultural insurance, and government subsidy programs. The previous research on orchard frost disasters is mainly focused on early risk warning. Therefore, to effectively quantify orchard frost loss, this paper proposes a frost loss assessment model constructed using meteorological and remote sensing information and applies this model to the regional-scale assessment of orchard fruit loss after frost. As an example, this article examines a frost event that occurred during the apple flowering period in Luochuan County, Northwestern China, on 17 April 2020. A multivariable linear regression (MLR) model was constructed based on the orchard planting years, the number of flowering days, and the chill accumulation before frost, as well as the minimum temperature and daily temperature difference on the day of frost. Then, the model simulation accuracy was verified using the leave-one-out cross-validation (LOOCV) method, and the coefficient of determination (R2), the root mean square error (RMSE), and the normalized root mean square error (NRMSE) were 0.69, 18.76%, and 18.76%, respectively. Additionally, the extended Fourier amplitude sensitivity test (EFAST) method was used for the sensitivity analysis of the model parameters. The results show that the simulated apple orchard fruit number reduction ratio is highly sensitive to the minimum temperature on the day of frost, and the chill accumulation and planting years before the frost, with sensitivity values of ≥0.74, ≥0.25, and ≥0.15, respectively. This research can not only assist governments in optimizing traditional orchard frost prevention measures and market price regulation but can also provide a reference for agricultural insurance companies to formulate plans for compensation after frost.



2020 ◽  
Vol 30 (4) ◽  
pp. 249-257
Author(s):  
Reid J. Reale ◽  
Timothy J. Roberts ◽  
Khalil A. Lee ◽  
Justina L. Bonsignore ◽  
Melissa L. Anderson

We sought to assess the accuracy of current or developing new prediction equations for resting metabolic rate (RMR) in adolescent athletes. RMR was assessed via indirect calorimetry, alongside known predictors (body composition via dual-energy X-ray absorptiometry, height, age, and sex) and hypothesized predictors (race and maturation status assessed via years to peak height velocity), in a diverse cohort of adolescent athletes (n = 126, 77% male, body mass = 72.8 ± 16.6 kg, height = 176.2 ± 10.5 cm, age = 16.5 ± 1.4 years). Predictive equations were produced and cross-validated using repeated k-fold cross-validation by stepwise multiple linear regression (10 folds, 100 repeats). Performance of the developed equations was compared with several published equations. Seven of the eight published equations examined performed poorly, underestimating RMR in >75% to >90% of cases. Root mean square error of the six equations ranged from 176 to 373, mean absolute error ranged from 115 to 373 kcal, and mean absolute error SD ranged from 103 to 185 kcal. Only the Schofield equation performed reasonably well, underestimating RMR in 51% of cases. A one- and two-compartment model were developed, both r2 of .83, root mean square error of 147, and mean absolute error of 114 ± 26 and 117 ± 25 kcal for the one- and two-compartment model, respectively. Based on the models’ performance, as well as visual inspection of residual plots, the following model predicts RMR in adolescent athletes with better precision than previous models; RMR = 11.1 × body mass (kg) + 8.4 × height (cm) − (340 male or 537 female).



2020 ◽  
Vol 12 (3) ◽  
pp. 356 ◽  
Author(s):  
Hui Qiu ◽  
Shuanggen Jin

Mean sea surface height (MSSH) is an important parameter, which plays an important role in the analysis of the geoid gap and the prediction of ocean dynamics. Traditional measurement methods, such as the buoy and ship survey, have a small cover area, sparse data, and high cost. Recently, the Global Navigation Satellite System-Reflectometry (GNSS-R) and the spaceborne Cyclone Global Navigation Satellite System (CYGNSS) mission, which were launched on 15 December 2016, have provided a new opportunity to estimate MSSH with all-weather, global coverage, high spatial-temporal resolution, rich signal sources, and strong concealability. In this paper, the global MSSH was estimated by using the relationship between the waveform characteristics of the delay waveform (DM) obtained by the delay Doppler map (DDM) of CYGNSS data, which was validated by satellite altimetry. Compared with the altimetry CNES_CLS2015 product provided by AVISO, the mean absolute error was 1.33 m, the root mean square error was 2.26 m, and the correlation coefficient was 0.97. Compared with the sea surface height model DTU10, the mean absolute error was 1.20 m, the root mean square error was 2.15 m, and the correlation coefficient was 0.97. Furthermore, the sea surface height obtained from CYGNSS was consistent with Jason-2′s results by the average absolute error of 2.63 m, a root mean square error ( RMSE ) of 3.56 m and, a correlation coefficient ( R ) of 0.95.



2014 ◽  
Vol 7 (3) ◽  
pp. 1247-1250 ◽  
Author(s):  
T. Chai ◽  
R. R. Draxler

Abstract. Both the root mean square error (RMSE) and the mean absolute error (MAE) are regularly employed in model evaluation studies. Willmott and Matsuura (2005) have suggested that the RMSE is not a good indicator of average model performance and might be a misleading indicator of average error, and thus the MAE would be a better metric for that purpose. While some concerns over using RMSE raised by Willmott and Matsuura (2005) and Willmott et al. (2009) are valid, the proposed avoidance of RMSE in favor of MAE is not the solution. Citing the aforementioned papers, many researchers chose MAE over RMSE to present their model evaluation statistics when presenting or adding the RMSE measures could be more beneficial. In this technical note, we demonstrate that the RMSE is not ambiguous in its meaning, contrary to what was claimed by Willmott et al. (2009). The RMSE is more appropriate to represent model performance than the MAE when the error distribution is expected to be Gaussian. In addition, we show that the RMSE satisfies the triangle inequality requirement for a distance metric, whereas Willmott et al. (2009) indicated that the sums-of-squares-based statistics do not satisfy this rule. In the end, we discussed some circumstances where using the RMSE will be more beneficial. However, we do not contend that the RMSE is superior over the MAE. Instead, a combination of metrics, including but certainly not limited to RMSEs and MAEs, are often required to assess model performance.



2014 ◽  
Vol 7 (1) ◽  
pp. 1525-1534 ◽  
Author(s):  
T. Chai ◽  
R. R. Draxler

Abstract. Both the root mean square error (RMSE) and the mean absolute error (MAE) are regularly employed in model evaluation studies. Willmott and Matsuura (2005) have suggested that the RMSE is not a good indicator of average model performance and might be a misleading indicator of average error and thus the MAE would be a better metric for that purpose. Their paper has been widely cited and may have influenced many researchers in choosing MAE when presenting their model evaluation statistics. However, we contend that the proposed avoidance of RMSE and the use of MAE is not the solution to the problem. In this technical note, we demonstrate that the RMSE is not ambiguous in its meaning, contrary to what was claimed by Willmott et al. (2009). The RMSE is more appropriate to represent model performance than the MAE when the error distribution is expected to be Gaussian. In addition, we show that the RMSE satisfies the triangle inequality requirement for a distance metric.



2015 ◽  
Vol 143 (5) ◽  
pp. 1506-1516 ◽  
Author(s):  
Paul J. Roebber

Abstract Evolutionary program ensembles are developed and tested for minimum temperature forecasts at Chicago, Illinois, at forecast ranges of 36, 60, 84, 108, 132, and 156 h. For all forecast ranges examined, the evolutionary program ensemble outperforms the 21-member GFS model output statistics (MOS) ensemble when considering root-mean-square error and Brier skill score. The relative advantage in root-mean-square error widens with forecast range, from 0.18°F at 36 h to 1.53°F at 156 h while the probabilistic skill remains positive throughout. At all forecast ranges, probabilistic forecasts of abnormal conditions are particularly skillful compared to the raw GFS guidance. The evolutionary program reliance on particular forecast inputs is distinct from that obtained from considering multiple linear regression models, with less reliance on the GFS MOS temperature and more on alternative data such as upstream temperatures at the time of forecast issuance, time of year, and forecasts of wind speed, precipitation, and cloud cover. This weighting trends away from current observations and toward seasonal (climatological) measures as forecast range increases. Using two different forms of ensemble member subselection, a Bayesian model combination calibration is tested on both ensembles. This calibration had limited effect on evolutionary program ensemble skill but was able to improve MOS ensemble performance, reducing but not eliminating the skill gap between them. The largest skill differentials occurred at the longest forecast ranges, beginning at 132 h. A hybrid, calibrated ensemble was able to provide some further increase in skill.



2021 ◽  
Vol 12 (1) ◽  
pp. 95-104
Author(s):  
Firəngiz Sadıyeva ◽  

Məqalədə COVID-19 pandemiyasını proqnozlaşdırmaq üçün avtoreqressiv inteqrasiya edilmiş hərəkətli ortalama (ing. ARIMA. Autoregressive İntegrated Moving Average) modeli təklif edilmişdir. COVID-19 dünyada sürətlə yayılan və hazırda davam edən yeni növ pandemiyadır. Son dövrlərdə pandemiyaya yoluxanların sayı Azərbaycanda rekord həddə çatmışdır. Məhz bu səbəbdən COVID-19 pandemiyasının proqnozu məsələsinə baxılmışdır və bir neçə ayı əhatə edən real verilənlərlə eksperimentlərdə təklif edilmiş ARIMA modelinin COVID-19 zaman sıralarının proqnozlaşdırılması üçün müxtəlif parametrlərlə tətbiq edilmişdir. Verilənlər dedikdə, 22.01.2020 – 22.10.2020 tarixləri arasında Azərbaycan Respublikasının Səhiyyə Nazirliyi (www.sehiyye.gov.az) tərəfindən rəsmi olaraq qeydiyyata alınan gündəlik yoluxma hallarının sayı nəzərdə tutulur. Bu verilənlərdən istifadə etməklə, növbəti zaman aralığında ölkəmizdə baş verəcək yoluxma halları proqnoz edilmişdir. Bunun üçün ARIMA modelinə müxtəlif parametrlər verilmiş və uyğun olaraq hər bir modelin səhv dərəcəsi qiymətləndirilmişdir. Səhvin qiymətləndirilməsi üçün MAPE (Mean Absolute Persentace Error), MAE (Mean Absolute Error) və RMSE (Root Mean Square Error) funksiyaları istifadə edilib. Müqayisələr nəticəsində ən uyğun model seçilmişdir. Alınmış nəticələr ölkəmizdə pandemiya dövründə həm səhiyyə sistemi, həm də adi vətəndaşlar üçün vacib amildir. Əldə edilmiş nəticələr statistik metodların koronavirusa aid qeyri-stasionar zaman sıralarının proqnozlaşdırılmasının digər məsələlərə tətbiqində də məhsuldar ola biləcəyini təsdiqləyir.



2021 ◽  
Vol 20 (2) ◽  
pp. 113-119
Author(s):  
Khaled Ferkous ◽  
Farouk Chellali ◽  
Abdalah Kouzou ◽  
Belgacem Bekkar

Several methods have been used to predict daily solar radiation in recent years, such as artificial intelligence and hybrid models. In this paper, a Wavelet coupled Gaussian Process Regression (W-GPR) model was proposed to predict the daily solar radiation received on a horizontal surface in Ghardaia (Algeria). A statistical period of four years (2013 -2016) was used where the first three years (2013-2015) are used to train model and the last year (2016) to test the model for predicting daily total solar radiation. Different types of wave mother and different combinations of input data were evaluated based on the minimum air temperature, relative humidity and extraterrestrial solar radiation on a horizontal surface. The results demonstrated the effectiveness of the new hybrid model W-GPR compared to the classical GPR model in terms of Root Mean Square Error (RMSE), relative Root Mean Square Error (rRMSE), Mean Absolute Error (MAE) and determination coefficient (R2).



Sign in / Sign up

Export Citation Format

Share Document