scholarly journals Evaluation of a Multiple Regression Model for Noisy and Missing Data

Author(s):  
Chanintorn Jittawiriyanukoon

<span>The standard data collection problems may involve noiseless data while on the other hand large organizations commonly experience noisy and missing data, probably concerning data collected from individuals. As noisy and missing data will be significantly worrisome for occasions of the vast data collection then the investigation of different filtering techniques for big data environment would be remarkable. A multiple regression model where big data is employed for experimenting will be presented. Approximation for datasets with noisy and missing data is also proposed. The statistical root mean squared error (RMSE) associated with correlation coefficient (COEF) will be analyzed to prove the accuracy of estimators. Finally, results predicted by massive online analysis (MOA) will be compared to those real data collected from the following different time. These theoretical predictions with noisy and missing data estimation by simulation, revealing consistency with the real data are illustrated. Deletion mechanism (DEL) outperforms with the lowest average percentage of error.</span>

Energies ◽  
2020 ◽  
Vol 13 (15) ◽  
pp. 3996 ◽  
Author(s):  
Marwen Elkamel ◽  
Lily Schleider ◽  
Eduardo L. Pasiliao ◽  
Ali Diabat ◽  
Qipeng P. Zheng

Predicting future energy demand will allow for better planning and operation of electricity providers. Suppliers will have an idea of what they need to prepare for, thereby preventing over and under-production. This can save money and make the energy industry more efficient. We applied a multiple regression model and three Convolutional Neural Networks (CNNs) in order to predict Florida’s future electricity use. The multiple regression model was a time series model that included all the variables and employed a regression equation. The univariant CNN only accounts for the energy consumption variable. The multichannel network takes into account all the time series variables. The multihead network created a CNN model for each of the variables and then combined them through concatenation. For all of the models, the dataset was split up into training and testing data so the predictions could be compared to the actual values in order to avoid overfitting and to provide an unbiased estimate of model accuracy. Historical data from January 2010 to December 2017 were used. The results for the multiple regression model concluded that the variables month, Cooling Degree Days, Heating Degree Days and GDP were significant in predicting future electricity demand. Other multiple regression models were formulated that utilized other variables that were correlated to the variables in the best-selected model. These variables included: number of visitors to the state, population, number of consumers and number of households. For the CNNs, the univariant predictions had more diverse and higher Root Mean Squared Error (RMSE) values compared to the multichannel and multihead network. The multichannel network performed the best out of the three CNNs. In summary, the multichannel model was found to be the best at predicting future electricity demand out of all the models considered, including the regression model based on the datasets employed.


Paradigm ◽  
2021 ◽  
Vol 25 (2) ◽  
pp. 181-193
Author(s):  
Nitya Garg

Banking sector is the backbone of any economy, so it is necessary to focus on its performance which is largely affected by its non-performing assets (NPAs). In the year 2018–2019, NPA of scheduled banks was Rs 355,076 Crore which is 3.7% of net advances. The purpose of this study is to identify the determinants based on analysis from previous literatures, and majorly macroeconomic and bank specific factors which are affecting NPAs using the relative weight analysis and to frame a model to predict future NPAs using multiple regression model using SPSS. The study also attempts to focus on actions and remedies that banks should make to control future NPAs. Findings of the study will act as a scaffolding for financial analysts and policymakers to prevent the conversion of its performing assets into NPAs and also help in proper management of banks and also in the recovery of economy.


2020 ◽  
Vol 12 (07) ◽  
pp. 527-544
Author(s):  
Assoué Kouakou Sylvestre Kouadio ◽  
Ouedraogo Moussa ◽  
Ismaïla Ouattara ◽  
Issiaka Savane

2014 ◽  
Vol 644-650 ◽  
pp. 5319-5324
Author(s):  
Tian Jiu Leng

In this paper, the relevant factors of PM2.5 and the degree of correlation between them were analyzed.The multiple regression model was established using stepwise regression analysis method and the temporal spatial evolution of PM2.5 was obtained by setting the initial and boundary conditions.


Sign in / Sign up

Export Citation Format

Share Document