scholarly journals Multistep-Ahead Solar Radiation Forecasting Scheme Based on the Light Gradient Boosting Machine: A Case Study of Jeju Island

2020 ◽  
Vol 12 (14) ◽  
pp. 2271 ◽  
Author(s):  
Jinwoong Park ◽  
Jihoon Moon ◽  
Seungmin Jung ◽  
Eenjun Hwang

Smart islands have focused on renewable energy sources, such as solar and wind, to achieve energy self-sufficiency. Because solar photovoltaic (PV) power has the advantage of less noise and easier installation than wind power, it is more flexible in selecting a location for installation. A PV power system can be operated more efficiently by predicting the amount of global solar radiation for solar power generation. Thus far, most studies have addressed day-ahead probabilistic forecasting to predict global solar radiation. However, day-ahead probabilistic forecasting has limitations in responding quickly to sudden changes in the external environment. Although multistep-ahead (MSA) forecasting can be used for this purpose, traditional machine learning models are unsuitable because of the substantial training time. In this paper, we propose an accurate MSA global solar radiation forecasting model based on the light gradient boosting machine (LightGBM), which can handle the training-time problem and provide higher prediction performance compared to other boosting methods. To demonstrate the validity of the proposed model, we conducted a global solar radiation prediction for two regions on Jeju Island, the largest island in South Korea. The experiment results demonstrated that the proposed model can achieve better predictive performance than the tree-based ensemble and deep learning methods.

Author(s):  
Napoleon Bezas ◽  
Christos Timplalexis ◽  
Athanasios I. Salamanis ◽  
Vasileios Karapatsias ◽  
Dimosthenis Ioannidis ◽  
...  

Residential load forecasting is one of the most important tasks of the overall supply management process in electrical grids, since it enables smart grid services such as demand response (DR). Hence, several approaches for accurate residential load forecasting have been proposed in the relevant literature. However, most of the existing methods focus on the forecasting performance and neglect other aspects of the problem like training time and model size (i.e. memory usage). In this paper, we introduce a new model for both short-term and day-ahead residential load forecasting. The model synthesizes an heterogeneous feature set, which is constituted by both automatically-selected lagged values from the load time series and manually-extracted temporal features. Then, the tree-based algorithm light gradient boosting machine (LGBM) is fed with the constructed feature set and used as a regression model. Finally, a data-lightweight strategy is used for retraining the proposed model, which leads to both high forecasting accuracy and low training times. The proposed model has been extensively evaluated on a large real-world residential load dataset. The experimental results indicate that the proposed model achieves both higher forecasting performance and lower training times and model sizes compared to state-of-the-art solutions.


Author(s):  
Christos S Ioakimidis ◽  
Napoleon Bezas ◽  
Christos Timplalexis ◽  
Athanasios I. Salamanis ◽  
Vasileios Karapatsias ◽  
...  

Residential load forecasting is one of the most important tasks of the overall supply management process in electrical grids, since it enables smart grid services such as demand response (DR). Hence, several approaches for accurate residential load forecasting have been proposed in the relevant literature. However, most of the existing methods focus on the forecasting performance and neglect other aspects of the problem like training time and model size (i.e. memory usage). In this paper, we introduce a new model for both short-term and day-ahead residential load forecasting. The model synthesizes an heterogeneous feature set, which is constituted by both automatically-selected lagged values from the load time series and manually-extracted temporal features. Then, the tree-based algorithm light gradient boosting machine (LGBM) is fed with the constructed feature set and used as a regression model. Finally, a data-lightweight strategy is used for retraining the proposed model, which leads to both high forecasting accuracy and low training times. The proposed model has been extensively evaluated on a large real-world residential load dataset. The experimental results indicate that the proposed model achieves both higher forecasting performance and lower training times and model sizes compared to state-of-the-art solutions.


Energies ◽  
2021 ◽  
Vol 14 (21) ◽  
pp. 7367
Author(s):  
Mohamed Chaibi ◽  
EL Mahjoub Benghoulam ◽  
Lhoussaine Tarik ◽  
Mohamed Berrada ◽  
Abdellah El Hmaidi

Machine learning (ML) models are commonly used in solar modeling due to their high predictive accuracy. However, the predictions of these models are difficult to explain and trust. This paper aims to demonstrate the utility of two interpretation techniques to explain and improve the predictions of ML models. We compared first the predictive performance of Light Gradient Boosting (LightGBM) with three benchmark models, including multilayer perceptron (MLP), multiple linear regression (MLR), and support-vector regression (SVR), for estimating the global solar radiation (H) in the city of Fez, Morocco. Then, the predictions of the most accurate model were explained by two model-agnostic explanation techniques: permutation feature importance (PFI) and Shapley additive explanations (SHAP). The results indicated that LightGBM (R2 = 0.9377, RMSE = 0.4827 kWh/m2, MAE = 0.3614 kWh/m2) provides similar predictive accuracy as SVR, and outperformed MLP and MLR in the testing stage. Both PFI and SHAP methods showed that extraterrestrial solar radiation (H0) and sunshine duration fraction (SF) are the two most important parameters that affect H estimation. Moreover, the SHAP method established how each feature influences the LightGBM estimations. The predictive accuracy of the LightGBM model was further improved slightly after re-examination of features, where the model combining H0, SF, and RH was better than the model with all features.


Entropy ◽  
2021 ◽  
Vol 23 (1) ◽  
pp. 116
Author(s):  
Xiangfa Zhao ◽  
Guobing Sun

Automatic sleep staging with only one channel is a challenging problem in sleep-related research. In this paper, a simple and efficient method named PPG-based multi-class automatic sleep staging (PMSS) is proposed using only a photoplethysmography (PPG) signal. Single-channel PPG data were obtained from four categories of subjects in the CAP sleep database. After the preprocessing of PPG data, feature extraction was performed from the time domain, frequency domain, and nonlinear domain, and a total of 21 features were extracted. Finally, the Light Gradient Boosting Machine (LightGBM) classifier was used for multi-class sleep staging. The accuracy of the multi-class automatic sleep staging was over 70%, and the Cohen’s kappa statistic k was over 0.6. This also showed that the PMSS method can also be applied to stage the sleep state for patients with sleep disorders.


2016 ◽  
Vol 2016 ◽  
pp. 1-7 ◽  
Author(s):  
Boluwaji M. Olomiyesan ◽  
Onyedi D. Oyedum

In this study, the performance of three global solar radiation models and the accuracy of global solar radiation data derived from three sources were compared. Twenty-two years (1984–2005) of surface meteorological data consisting of monthly mean daily sunshine duration, minimum and maximum temperatures, and global solar radiation collected from the Nigerian Meteorological (NIMET) Agency, Oshodi, Lagos, and the National Aeronautics Space Agency (NASA) for three locations in North-Western region of Nigeria were used. A new model incorporating Garcia model into Angstrom-Prescott model was proposed for estimating global radiation in Nigeria. The performances of the models used were determined by using mean bias error (MBE), mean percentage error (MPE), root mean square error (RMSE), and coefficient of determination (R2). Based on the statistical error indices, the proposed model was found to have the best accuracy with the least RMSE values (0.376 for Sokoto, 0.463 for Kaduna, and 0.449 for Kano) and highest coefficient of determination, R2 values of 0.922, 0.938, and 0.961 for Sokoto, Kano, and Kaduna, respectively. Also, the comparative study result indicates that the estimated global radiation from the proposed model has a better error range and fits the ground measured data better than the satellite-derived data.


Author(s):  
Nino Antulov-Fantulin ◽  
Tian Guo ◽  
Fabrizio Lillo

AbstractWe study the problem of the intraday short-term volume forecasting in cryptocurrency multi-markets. The predictions are built by using transaction and order book data from different markets where the exchange takes place. Methodologically, we propose a temporal mixture ensemble, capable of adaptively exploiting, for the forecasting, different sources of data and providing a volume point estimate, as well as its uncertainty. We provide evidence of the clear outperformance of our model with respect to econometric models. Moreover our model performs slightly better than Gradient Boosting Machine while having a much clearer interpretability of the results. Finally, we show that the above results are robust also when restricting the prediction analysis to each volume quartile.


2021 ◽  
Vol 3 (1) ◽  
Author(s):  
B. A Omodunbi

Diabetes mellitus is a health disorder that occurs when the blood sugar level becomes extremely high due to body resistance in producing the required amount of insulin. The aliment happens to be among the major causes of death in Nigeria and the world at large. This study was carried out to detect diabetes mellitus by developing a hybrid model that comprises of two machine learning model namely Light Gradient Boosting Machine (LGBM) and K-Nearest Neighbor (KNN). This research is aimed at developing a machine learning model for detecting the occurrence of diabetes in patients. The performance metrics employed in evaluating the finding for this study are Receiver Operating Characteristics (ROC) Curve, Five-fold Cross-validation, precision, and accuracy score. The proposed system had an accuracy of 91% and the area under the Receiver Operating Characteristic Curve was 93%. The experimental result shows that the prediction accuracy of the hybrid model is better than traditional machine learning


Sign in / Sign up

Export Citation Format

Share Document