scholarly journals Wavelet-Gaussian Process Regression Model for Regression Daily Solar Radiation in Ghardaia, Algeria

2021 ◽  
Vol 20 (2) ◽  
pp. 113-119
Author(s):  
Khaled Ferkous ◽  
Farouk Chellali ◽  
Abdalah Kouzou ◽  
Belgacem Bekkar

Several methods have been used to predict daily solar radiation in recent years, such as artificial intelligence and hybrid models. In this paper, a Wavelet coupled Gaussian Process Regression (W-GPR) model was proposed to predict the daily solar radiation received on a horizontal surface in Ghardaia (Algeria). A statistical period of four years (2013 -2016) was used where the first three years (2013-2015) are used to train model and the last year (2016) to test the model for predicting daily total solar radiation. Different types of wave mother and different combinations of input data were evaluated based on the minimum air temperature, relative humidity and extraterrestrial solar radiation on a horizontal surface. The results demonstrated the effectiveness of the new hybrid model W-GPR compared to the classical GPR model in terms of Root Mean Square Error (RMSE), relative Root Mean Square Error (rRMSE), Mean Absolute Error (MAE) and determination coefficient (R2).


Clean Energy ◽  
2021 ◽  
Vol 5 (2) ◽  
pp. 316-328
Author(s):  
Khaled Ferkous ◽  
Farouk Chellali ◽  
Abdalah Kouzou ◽  
Belgacem Bekkar

Abstract Forecasting solar radiation is fundamental to several domains related to renewable energy where several methods have been used to predict daily solar radiation, such as artificial intelligence and hybrid models. Recently, the Gaussian process regression (GPR) algorithm has been used successfully in remote sensing and Earth sciences. In this paper, a wavelet-coupled Gaussian process regression (W–GPR) model was proposed to predict the daily solar radiation received on a horizontal surface in Ghardaia (Algeria). For this purpose, 3 years of data (2013–15) have been used in model training while the data of 2016 were used to validate the model. In this work, different types of mother wavelets and different combinations of input data were evaluated based on the minimum air temperature, relative humidity and extraterrestrial solar radiation on a horizontal surface. The results demonstrated the effectiveness of the new hybrid W–GPR model compared with the classical GPR model in terms of root mean square error (RMSE), relative root mean square error (rRMSE), mean absolute error (MAE) and determination coefficient (R2).



Author(s):  
Derek Nevins ◽  
Lloyd Smith ◽  
Jeff Kensrud

In baseball and softball, batted-ball velocities depend strongly on bat speed prior to contact. The purpose of this study was to characterize the sensitivity of hit. ball speed to swing speed models. A pitching machine was used to project balls at varying speeds considered to be age appropriate for female fastpitch softball batters ranging from 9 to 24 years of age. Participants swung bats of varying moment of inertia and high-speed video was used to measure bat and ball kinematics for each hit. Several power law swing speed models were compared, including a generic model for all participants, age-specific models and participant-specific models. In addition, an exponential Gaussian process regression model was used. Estimates of batted-ball speeds were obtained by coupling the swing speed models to a model of the bat–ball collision. The root mean square error of model estimates ranged from 2.3 to 5.0 rad/s for swing speed models and 3.6 to 8.5 m/s for batted-ball speed estimates. The Gaussian process regression model performance was similar to the participant-specific models (swing speed root mean square error = 2.4 rad/s; batted-ball speed root mean square error = 3.7 m/s). The correlation of the Gaussian process regression model with field measurements was surprising given the limited number of inputs, which warrants further investigation. Agreement between all power law models and field measurements improved when the bat moment of inertia about its instantaneous center of rotation (rather than about the knob) was considered. This information will be of use to players and coaches to improve batting performance and bat selection.



2020 ◽  
Vol 30 (4) ◽  
pp. 249-257
Author(s):  
Reid J. Reale ◽  
Timothy J. Roberts ◽  
Khalil A. Lee ◽  
Justina L. Bonsignore ◽  
Melissa L. Anderson

We sought to assess the accuracy of current or developing new prediction equations for resting metabolic rate (RMR) in adolescent athletes. RMR was assessed via indirect calorimetry, alongside known predictors (body composition via dual-energy X-ray absorptiometry, height, age, and sex) and hypothesized predictors (race and maturation status assessed via years to peak height velocity), in a diverse cohort of adolescent athletes (n = 126, 77% male, body mass = 72.8 ± 16.6 kg, height = 176.2 ± 10.5 cm, age = 16.5 ± 1.4 years). Predictive equations were produced and cross-validated using repeated k-fold cross-validation by stepwise multiple linear regression (10 folds, 100 repeats). Performance of the developed equations was compared with several published equations. Seven of the eight published equations examined performed poorly, underestimating RMR in >75% to >90% of cases. Root mean square error of the six equations ranged from 176 to 373, mean absolute error ranged from 115 to 373 kcal, and mean absolute error SD ranged from 103 to 185 kcal. Only the Schofield equation performed reasonably well, underestimating RMR in 51% of cases. A one- and two-compartment model were developed, both r2 of .83, root mean square error of 147, and mean absolute error of 114 ± 26 and 117 ± 25 kcal for the one- and two-compartment model, respectively. Based on the models’ performance, as well as visual inspection of residual plots, the following model predicts RMR in adolescent athletes with better precision than previous models; RMR = 11.1 × body mass (kg) + 8.4 × height (cm) − (340 male or 537 female).



2020 ◽  
Vol 12 (3) ◽  
pp. 356 ◽  
Author(s):  
Hui Qiu ◽  
Shuanggen Jin

Mean sea surface height (MSSH) is an important parameter, which plays an important role in the analysis of the geoid gap and the prediction of ocean dynamics. Traditional measurement methods, such as the buoy and ship survey, have a small cover area, sparse data, and high cost. Recently, the Global Navigation Satellite System-Reflectometry (GNSS-R) and the spaceborne Cyclone Global Navigation Satellite System (CYGNSS) mission, which were launched on 15 December 2016, have provided a new opportunity to estimate MSSH with all-weather, global coverage, high spatial-temporal resolution, rich signal sources, and strong concealability. In this paper, the global MSSH was estimated by using the relationship between the waveform characteristics of the delay waveform (DM) obtained by the delay Doppler map (DDM) of CYGNSS data, which was validated by satellite altimetry. Compared with the altimetry CNES_CLS2015 product provided by AVISO, the mean absolute error was 1.33 m, the root mean square error was 2.26 m, and the correlation coefficient was 0.97. Compared with the sea surface height model DTU10, the mean absolute error was 1.20 m, the root mean square error was 2.15 m, and the correlation coefficient was 0.97. Furthermore, the sea surface height obtained from CYGNSS was consistent with Jason-2′s results by the average absolute error of 2.63 m, a root mean square error ( RMSE ) of 3.56 m and, a correlation coefficient ( R ) of 0.95.



2014 ◽  
Vol 7 (3) ◽  
pp. 1247-1250 ◽  
Author(s):  
T. Chai ◽  
R. R. Draxler

Abstract. Both the root mean square error (RMSE) and the mean absolute error (MAE) are regularly employed in model evaluation studies. Willmott and Matsuura (2005) have suggested that the RMSE is not a good indicator of average model performance and might be a misleading indicator of average error, and thus the MAE would be a better metric for that purpose. While some concerns over using RMSE raised by Willmott and Matsuura (2005) and Willmott et al. (2009) are valid, the proposed avoidance of RMSE in favor of MAE is not the solution. Citing the aforementioned papers, many researchers chose MAE over RMSE to present their model evaluation statistics when presenting or adding the RMSE measures could be more beneficial. In this technical note, we demonstrate that the RMSE is not ambiguous in its meaning, contrary to what was claimed by Willmott et al. (2009). The RMSE is more appropriate to represent model performance than the MAE when the error distribution is expected to be Gaussian. In addition, we show that the RMSE satisfies the triangle inequality requirement for a distance metric, whereas Willmott et al. (2009) indicated that the sums-of-squares-based statistics do not satisfy this rule. In the end, we discussed some circumstances where using the RMSE will be more beneficial. However, we do not contend that the RMSE is superior over the MAE. Instead, a combination of metrics, including but certainly not limited to RMSEs and MAEs, are often required to assess model performance.



2014 ◽  
Vol 7 (1) ◽  
pp. 1525-1534 ◽  
Author(s):  
T. Chai ◽  
R. R. Draxler

Abstract. Both the root mean square error (RMSE) and the mean absolute error (MAE) are regularly employed in model evaluation studies. Willmott and Matsuura (2005) have suggested that the RMSE is not a good indicator of average model performance and might be a misleading indicator of average error and thus the MAE would be a better metric for that purpose. Their paper has been widely cited and may have influenced many researchers in choosing MAE when presenting their model evaluation statistics. However, we contend that the proposed avoidance of RMSE and the use of MAE is not the solution to the problem. In this technical note, we demonstrate that the RMSE is not ambiguous in its meaning, contrary to what was claimed by Willmott et al. (2009). The RMSE is more appropriate to represent model performance than the MAE when the error distribution is expected to be Gaussian. In addition, we show that the RMSE satisfies the triangle inequality requirement for a distance metric.



2021 ◽  
Vol 12 (1) ◽  
pp. 95-104
Author(s):  
Firəngiz Sadıyeva ◽  

Məqalədə COVID-19 pandemiyasını proqnozlaşdırmaq üçün avtoreqressiv inteqrasiya edilmiş hərəkətli ortalama (ing. ARIMA. Autoregressive İntegrated Moving Average) modeli təklif edilmişdir. COVID-19 dünyada sürətlə yayılan və hazırda davam edən yeni növ pandemiyadır. Son dövrlərdə pandemiyaya yoluxanların sayı Azərbaycanda rekord həddə çatmışdır. Məhz bu səbəbdən COVID-19 pandemiyasının proqnozu məsələsinə baxılmışdır və bir neçə ayı əhatə edən real verilənlərlə eksperimentlərdə təklif edilmiş ARIMA modelinin COVID-19 zaman sıralarının proqnozlaşdırılması üçün müxtəlif parametrlərlə tətbiq edilmişdir. Verilənlər dedikdə, 22.01.2020 – 22.10.2020 tarixləri arasında Azərbaycan Respublikasının Səhiyyə Nazirliyi (www.sehiyye.gov.az) tərəfindən rəsmi olaraq qeydiyyata alınan gündəlik yoluxma hallarının sayı nəzərdə tutulur. Bu verilənlərdən istifadə etməklə, növbəti zaman aralığında ölkəmizdə baş verəcək yoluxma halları proqnoz edilmişdir. Bunun üçün ARIMA modelinə müxtəlif parametrlər verilmiş və uyğun olaraq hər bir modelin səhv dərəcəsi qiymətləndirilmişdir. Səhvin qiymətləndirilməsi üçün MAPE (Mean Absolute Persentace Error), MAE (Mean Absolute Error) və RMSE (Root Mean Square Error) funksiyaları istifadə edilib. Müqayisələr nəticəsində ən uyğun model seçilmişdir. Alınmış nəticələr ölkəmizdə pandemiya dövründə həm səhiyyə sistemi, həm də adi vətəndaşlar üçün vacib amildir. Əldə edilmiş nəticələr statistik metodların koronavirusa aid qeyri-stasionar zaman sıralarının proqnozlaşdırılmasının digər məsələlərə tətbiqində də məhsuldar ola biləcəyini təsdiqləyir.



2021 ◽  
Author(s):  
Giulio Nils Caroletti ◽  
Tommaso Caloiero ◽  
Magnus Joelsson ◽  
Roberto Coscarelli

<p>Homogenization techniques and missing value reconstruction have grown in importance in climatology given their relevance in establishing coherent data records over which climate signals can be correctly attributed, discarding apparent changes depending on instrument inhomogeneities, e.g., change in instrumentation, location, time of measurement.</p><p>However, it is not generally possible to assess homogenized results directly, as true data values are not known. Thus, to validate homogenization techniques, artificially inhomogeneous datasets, also called benchmark datasets, are created from known homogeneous datasets. Results from their homogenization can be assessed and used to rank, evaluate and/or validate techniques used.</p><p>Considering temperature data, the aims of this work are: i) to determine which metrics (bias, absolute error, factor of exceedance, root mean squared error, and Pearson’s correlation coefficient) can be meaningfully used to validate the best-performing homogenization technique in a region; ii) to evaluate through a Pearson correlation analysis if homogenization techniques’ performance depends on physical features of a station (i.e., latitude, altitude and distance from the sea) or on the nature of the inhomogeneities (i.e., the number of break points and missing data).</p><p>With this aims, a southern Sweden temperature database with homogeneous, maximum and minimum temperature data from 100 ground stations over the period 1950-2005 has been used. Starting from these data, inhomogeneous datasets were created introducing up to 7 artificial breaks for each ground station and an average of 107 missing data. Then, 3 homogenization techniques were applied, ACMANT (Adapted Caussinus-Mestre Algorithm for Networks of Temperature series), and two versions of HOMER (HOMogenization software in R): the standard, automated setup mode (Standard-HOMER) and a manual setup developed and performed at the Swedish Meteorological and Hydrological Institute (SMHI-HOMER).</p><p>Results showed that root mean square error, absolute bias and factor of exceedance were the most useful metrics to evaluate improvements in the homogenized datasets: for instance, RMSE for both variables was reduced from an average of 0.71-0.89K (corrupted dataset) to 0.50-0.60K (Standard-HOMER), 0.51-0.61K (SMHI-HOMER) and 0.46-0.53K (ACMANT), respectively.</p><p>Globally, HOMER performed better regarding the factor of exceedance, while ACMANT outperformed it with regard to root mean square error and absolute error. Regardless of the technique used, the homogenization quality anti-correlated meaningfully to the number of breaks. Missing data did not seem to have an impact on HOMER, while it negatively affected ACMANT, because this method does not fill-in missing data in the same drastic way.</p><p>In general, the nature of the datasets had a more important role in yielding good homogenization results than associated physical parameters: only for minimum temperature, distance from the sea and altitude showed a weak but significant correlation with the factor of exceedance and the root mean square error.</p><p>This study has been performed within the INDECIS Project, that is part of ERA4CS, an ERA-NET initiated by JPI Climate, and funded by FORMAS (SE), DLR (DE), BMWFW (AT), IFD (DK), MINECO (ES), ANR (FR) with co-funding by the European Union (Grant 690462).</p>



Sign in / Sign up

Export Citation Format

Share Document