scholarly journals Building Energy Consumption Raw Data Forecasting Using Data Cleaning and Deep Recurrent Neural Networks

Buildings ◽  
2019 ◽  
Vol 9 (9) ◽  
pp. 204 ◽  
Author(s):  
Yang ◽  
Tan ◽  
Santamouris ◽  
Lee

With the rising focus on building energy big data analysis, there lacks a framework for raw data preprocessing to answer the question of how to handle the missing data in the raw data set. This study presents a methodology and framework for building energy consumption raw data forecasting. A case building is used to forecast the energy consumption by using deep recurrent neural networks. Four different methodologies to impute missing data in the raw data set are compared and implemented. The question of sensitivity of gap size and available data percentage on the imputation accuracy was tested. The cleaned data were then used for building energy forecasting. While the existing studies explored only the use of small recurrent networks of 2 layers and less, the question of whether a deep network of more than 2 layers would be performing better for building energy consumption forecasting should be explored. In addition, the problem of overfitting has been cited as a significant problem in using deep networks. In this study, the deep recurrent neural network is then used to explore the use of deeper networks and their regularization in the context of an energy load forecasting task. The results show a mean absolute error of 2.1 can be achieved through the 2*32 gated neural network model. In applying regularization methods to overcome model overfitting, the study found that weights regularization did indeed delay the onset of overfitting.

Processes ◽  
2019 ◽  
Vol 7 (10) ◽  
pp. 731 ◽  
Author(s):  
Sanghyuk Lee ◽  
Jaehoon Cha ◽  
Moon Keun Kim ◽  
Kyeong Soo Kim ◽  
Van Huy Pham ◽  
...  

The importance of neural network (NN) modelling is evident from its performance benefits in a myriad of applications, where, unlike conventional techniques, NN modeling provides superior performance without relying on complex filtering and/or time-consuming parameter tuning specific to applications and their wider ranges of conditions. In this paper, we employ NN modelling with training data generation based on sensitivity analysis for the prediction of building energy consumption to improve performance and reliability. Unlike our previous work, where insignificant input variables are successively screened out based on their mean impact values (MIVs) during the training process, we use the receiver operating characteristic (ROC) plot to generate reliable data with a conservative or progressive point of view, which overcomes the issue of data insufficiency of the MIV method: By properly setting boundaries for input variables based on the ROC plot and their statistics, instead of completely screening them out as in the MIV-based method, we can generate new training data that maximize true positive and false negative numbers from the partial data set. Then a NN model is constructed and trained with the generated training data using Levenberg–Marquardt back propagation (LM-BP) to perform electricity prediction for commercial buildings. The performance of the proposed data generation methods is compared with that of the MIV method through experiments, whose results show that data generation using successive and cross pattern provides satisfactory performance, following energy consumption trends with good phase. Among the two options in data generation, i.e., successive and two data combination, the successive option shows lower root mean square error (RMSE) than the combination one by around 400~900 kWh (i.e., 30%~75%).


Geofluids ◽  
2021 ◽  
Vol 2021 ◽  
pp. 1-12
Author(s):  
Xuenan Zhang ◽  
Jinxin Zhang ◽  
Jinhua Zhang ◽  
YuChuan Zhang

As the energy consumption of residential building takes a large part in the building energy consumption, it is important to promote energy efficiency in residential building for green development. In order to evaluate the energy consumption of residential building more effectively, this paper proposes a combined prediction model based on random forest and BP neural network (RF-BPNN). To verify the prediction effect of the RF-BPNN combined model, experiments were performed by using the energy efficiency data set in the UCI database, and the model was evaluated with five indicators: mean absolute error, root mean square deviation, mean absolute percentage error, correlation coefficient, and coincidence index. Compared with the random forest, BP neural network model, and other existing models, respectively, it is proven by the experimental results that the RF-BPNN model possesses higher prediction accuracy and better stability.


1996 ◽  
Vol 35 (01) ◽  
pp. 12-18 ◽  
Author(s):  
M. Subotin ◽  
W. Marsh ◽  
J. McMichael ◽  
J. J. Fung ◽  
I. Dvorchik

AbstractA novel multisolutional clustering and quantization (MCO) algorithm has been developed that provides a flexible way to preprocess data. It was tested whether it would impact the neural network’s performance favorably and whether the employment of the proposed algorithm would enable neural networks to handle missing data. This was assessed by comparing the performance of neural networks using a well-documented data set to predict outcome following liver transplantation. This new approach to data preprocessing leads to a statistically significant improvement in network performance when compared to simple linear scaling. The obtained results also showed that coding missing data as zeroes in combination with the MCO algorithm, leads to a significant improvement in neural network performance on a data set containing missing values in 59.4% of cases when compared to replacement of missing values with either series means or medians.


Sign in / Sign up

Export Citation Format

Share Document