scholarly journals Analysis of L2 Regularization Hyper Parameter for Stock Price Prediction

2021 ◽  
Vol 26 (1) ◽  
pp. 83-88
Author(s):  
Arjun Singh Saud ◽  
Subarna Shakya

Nowadays stock price prediction is an active area of research among machine learning researchers. One of the main problems with machine learning models is overfitting. Regularization techniques are widely used approaches to avoid over-fitted models. L2 regularization is one of the most popular and widely used regularization techniques. Regularization hyperparameter (ʎ) is one key parameter to be optimized for a well-generalized machine learning model. Hyperparameters can’t be learned by machine learning models during the learning process. We need to find their optimal value through experiments. This research work analyzed the L2 regularization hyperparameter used with a gated recurrent unit (GRU) network for stock price prediction. We experimented with five stocks from the Nepal Stock Exchange (NEPSE) and observed that stock price can be predicted with lower mean squared errors (MSEs) when the value of ʎ was around 0.0005. Therefore, this research paper recommended using ʎ=0.0005 with L2 regularization for stock price prediction.

2021 ◽  
Vol 9 (4) ◽  
pp. 769-788
Author(s):  
Shan Zhong ◽  
David Hitchcock

We summarized both common and novel predictive models used for stock price prediction and combined them with technical indices, fundamental characteristics and text-based sentiment data to predict S&P stock prices. A 66.18% accuracy in S&P 500 index directional prediction and 62.09% accuracy in individual stock directional prediction was achieved by combining different machine learning models such as Random Forest and LSTM together into state-of-the-art ensemble models. The data we use contains weekly historical prices, finance reports, and text information from news items associated with 518 different common stocks issued by current and former S&P 500 large-cap companies, from January 1, 2000 to December 31, 2019. Our study's innovation includes utilizing deep language models to categorize and infer financial news item sentiment; fusing different models containing different combinations of variables and stocks to jointly make predictions; and overcoming the insufficient data problem for machine learning models in time series by using data across different stocks.


2021 ◽  
Author(s):  
Sidra Mehtab ◽  
Jaydip Sen

Prediction of future movement of stock prices has been a subject matter of many research work. On one hand, we have proponents of the Efficient Market Hypothesis who claim that stock prices cannot be predicted, on the other hand, there are propositions illustrating that, if appropriately modelled, stock prices can be predicted with a high level of accuracy. There is also a gamut of literature on technical analysis of stock prices where the objective is to identify patterns in stock price movements and profit from it. In this work, we propose a hybrid approach for stock price prediction using machine learning and deep learning-based methods. We select the NIFTY 50 index values of the National Stock Exchange (NSE) of India, over a period of four years: 2015 – 2018. Based on the NIFTY data during 2015 – 2018, we build various predictive models using machine learning approaches, and then use those models to predict the “Close” value of NIFTY 50 for the year 2019, with a forecast horizon of one week, i.e., five days. For predicting the NIFTY index movement patterns, we use a number of classification methods, while for forecasting the actual “Close” values of NIFTY index, various regression models are built. We, then, augment our predictive power of the models by building a deep learning-based regression model using Convolutional Neural Network (CNN) with a walk-forward validation. The CNN model is fine-tuned for its parameters so that the validation loss stabilizes with increasing number of iterations, and the training and validation accuracies converge. We exploit the power of CNN in forecasting the future NIFTY index values using three approaches which differ in number of variables used in forecasting, number of sub-models used in the overall models and, size of the input data for training the models. Extensive results are presented on various metrics for all classification and regression models. The results clearly indicate that CNN-based multivariate forecasting model is the most effective and accurate in predicting the movement of NIFTY index values with a weekly forecast horizon.


2021 ◽  
Author(s):  
Jaydip Sen ◽  
Sidra Mehtab ◽  
Gourab Nath

Prediction of future movement of stock prices has been a subject matter of many research work. On one hand, we have proponents of the Efficient Market Hypothesis who claim that stock prices cannot be predicted, on the other hand, there are propositions illustrating that, if appropriately modeled, stock prices can be predicted with a high level of accuracy. There is also a gamut of literature on technical analysis of stock prices where the objective is to identify patterns in stock price movements and profit from it. In this work, we propose a hybrid approach for stock price prediction using five deep learning-based regression models. We select the NIFTY 50 index values of the National Stock Exchange (NSE) of India, over a period of December 29, 2014 to July 31, 2020. Based on the NIFTY data during December 29, 2014 to December 28, 2018, we build two regression models using <i>convolutional neural networks</i> (CNNs), and three regression models using <i>long-and-short-term memory</i> (LSTM) networks for predicting the <i>open</i> values of the NIFTY 50 index records for the period December 31, 2018 to July 31, 2020. We adopted a multi-step prediction technique with <i>walk-forward validation</i>. The parameters of the five deep learning models are optimized using the grid-search technique so that the validation losses of the models stabilize with an increasing number of epochs in the model training, and the training and validation accuracies converge. Extensive results are presented on various metrics for all the proposed regression models. The results indicate that while both CNN and LSTM-based regression models are very accurate in forecasting the NIFTY 50 <i>open</i> values, the CNN model that previous one week’s data as the input is the fastest in its execution. On the other hand, the encoder-decoder convolutional LSTM model uses the previous two weeks’ data as the input is found to be the most accurate in its forecasting results.


Data is the most crucial component of a successful ML system. Once a machine learning model is developed, it gets obsolete over time due to presence of new input data being generated every second. In order to keep our predictions accurate we need to find a way to keep our models up to date. Our research work involves finding a mechanism which can retrain the model with new data automatically. This research also involves exploring the possibilities of automating machine learning processes. We started this project by training and testing our model using conventional machine learning methods. The outcome was then compared with the outcome of those experiments conducted using the AutoML methods like TPOT. This helped us in finding an efficient technique to retrain our models. These techniques can be used in areas where people do not deal with the actual working of a ML model but only require the outputs of ML processes


2021 ◽  
Author(s):  
Jaydip Sen ◽  
Tamal Datta Chaudhuri

Prediction of future movement of stock prices has been the subject matter of many research work. On one hand, we have proponents of the Efficient Market Hypothesis who claim that stock prices cannot be predicted accurately. On the other hand, there are propositions that have shown that, if appropriately modelled, stock prices can be predicted fairly accurately. The latter have focused on choice of variables, appropriate functional forms and techniques of forecasting. This work proposes a granular approach to stock price prediction by combining statistical and machine learning methods with some concepts that have been advanced in the literature on technical analysis. The objective of our work is to take 5 minute daily data on stock prices from the National Stock Exchange (NSE) in India and develop a forecasting framework for stock prices. Our contention is that such a granular approach can model the inherent dynamics and can be fine-tuned for immediate forecasting. Six different techniques including three regression-based approaches and three classification-based approaches are applied to model and predict stock price movement of two stocks listed in NSE - Tata Steel and Hero Moto. Extensive results have been provided on the performance of these forecasting techniques for both the stocks.


2021 ◽  
Vol 19 (2) ◽  
pp. 9-15
Author(s):  
Arjun Singh Saud ◽  
Subarna Shakya

Stock price forecasting in the field of interest for many stock investors to earn more profit from stock trading. Nowadays, machine learning researchers are also involved in this research field so that fast, accurate and automatic stock price forecasting can be achieved. This research paper evaluated GRU network’s performance with weight decay reg-ularization techniques for predicting price of stocks listed NEPSE. Three weight decay regularization technique analyzed in this research work were (1) L1 regularization (2) L2 regularization and (3) L1_L2 regularization. In this research work, six randomly selected stocks from NEPSE were experimented. From the experimental results, we observed that L2 regularization could outperform L1 and L1_L2 reg-ularization techniques for all six stocks. The average MSE obtained with L2 regularization was 4.12% to 33.52% lower than the average MSE obtained with L1 regularization, and it was 10.92% to 37.1% lower than the average MSE obtained with L1_L2 regularization. Thus, we concluded that the L2 regularization is best choice among weight regularization for stock price prediction.


Sign in / Sign up

Export Citation Format

Share Document