scholarly journals Text Mining of Stocktwits Data for Predicting Stock Prices

2021 ◽  
Vol 4 (1) ◽  
pp. 13 ◽  
Author(s):  
Mukul Jaggi ◽  
Priyanka Mandal ◽  
Shreya Narang ◽  
Usman Naseem ◽  
Matloob Khushi

Stock price prediction can be made more efficient by considering the price fluctuations and understanding people’s sentiments. A limited number of models understand financial jargon or have labelled datasets concerning stock price change. To overcome this challenge, we introduced FinALBERT, an ALBERT based model trained to handle financial domain text classification tasks by labelling Stocktwits text data based on stock price change. We collected Stocktwits data for over ten years for 25 different companies, including the major five FAANG (Facebook, Amazon, Apple, Netflix, Google). These datasets were labelled with three labelling techniques based on stock price changes. Our proposed model FinALBERT is fine-tuned with these labels to achieve optimal results. We experimented with the labelled dataset by training it on traditional machine learning, BERT, and FinBERT models, which helped us understand how these labels behaved with different model architectures. Our labelling method’s competitive advantage is that it can help analyse the historical data effectively, and the mathematical function can be easily customised to predict stock movement.

2021 ◽  
Author(s):  
Sidra Mehtab ◽  
Jaydip Sen

Prediction of future movement of stock prices has been a subject matter of many research work. On one hand, we have proponents of the Efficient Market Hypothesis who claim that stock prices cannot be predicted, on the other hand, there are propositions illustrating that, if appropriately modelled, stock prices can be predicted with a high level of accuracy. There is also a gamut of literature on technical analysis of stock prices where the objective is to identify patterns in stock price movements and profit from it. In this work, we propose a hybrid approach for stock price prediction using machine learning and deep learning-based methods. We select the NIFTY 50 index values of the National Stock Exchange (NSE) of India, over a period of four years: 2015 – 2018. Based on the NIFTY data during 2015 – 2018, we build various predictive models using machine learning approaches, and then use those models to predict the “Close” value of NIFTY 50 for the year 2019, with a forecast horizon of one week, i.e., five days. For predicting the NIFTY index movement patterns, we use a number of classification methods, while for forecasting the actual “Close” values of NIFTY index, various regression models are built. We, then, augment our predictive power of the models by building a deep learning-based regression model using Convolutional Neural Network (CNN) with a walk-forward validation. The CNN model is fine-tuned for its parameters so that the validation loss stabilizes with increasing number of iterations, and the training and validation accuracies converge. We exploit the power of CNN in forecasting the future NIFTY index values using three approaches which differ in number of variables used in forecasting, number of sub-models used in the overall models and, size of the input data for training the models. Extensive results are presented on various metrics for all classification and regression models. The results clearly indicate that CNN-based multivariate forecasting model is the most effective and accurate in predicting the movement of NIFTY index values with a weekly forecast horizon.


2021 ◽  
Author(s):  
Jaydip Sen ◽  
Sidra Mehtab ◽  
Gourab Nath

Prediction of future movement of stock prices has been a subject matter of many research work. On one hand, we have proponents of the Efficient Market Hypothesis who claim that stock prices cannot be predicted, on the other hand, there are propositions illustrating that, if appropriately modeled, stock prices can be predicted with a high level of accuracy. There is also a gamut of literature on technical analysis of stock prices where the objective is to identify patterns in stock price movements and profit from it. In this work, we propose a hybrid approach for stock price prediction using five deep learning-based regression models. We select the NIFTY 50 index values of the National Stock Exchange (NSE) of India, over a period of December 29, 2014 to July 31, 2020. Based on the NIFTY data during December 29, 2014 to December 28, 2018, we build two regression models using <i>convolutional neural networks</i> (CNNs), and three regression models using <i>long-and-short-term memory</i> (LSTM) networks for predicting the <i>open</i> values of the NIFTY 50 index records for the period December 31, 2018 to July 31, 2020. We adopted a multi-step prediction technique with <i>walk-forward validation</i>. The parameters of the five deep learning models are optimized using the grid-search technique so that the validation losses of the models stabilize with an increasing number of epochs in the model training, and the training and validation accuracies converge. Extensive results are presented on various metrics for all the proposed regression models. The results indicate that while both CNN and LSTM-based regression models are very accurate in forecasting the NIFTY 50 <i>open</i> values, the CNN model that previous one week’s data as the input is the fastest in its execution. On the other hand, the encoder-decoder convolutional LSTM model uses the previous two weeks’ data as the input is found to be the most accurate in its forecasting results.


2021 ◽  
Author(s):  
Jaydip Sen ◽  
Sidra Mehtab ◽  
Abhishek Dutta

Prediction of stock prices has been an important area of research for a long time. While supporters of the <i>efficient market hypothesis</i> believe that it is impossible to predict stock prices accurately, there are formal propositions demonstrating that accurate modeling and designing of appropriate variables may lead to models using which stock prices and stock price movement patterns can be very accurately predicted. Researchers have also worked on technical analysis of stocks with a goal of identifying patterns in the stock price movements using advanced data mining techniques. In this work, we propose an approach of hybrid modeling for stock price prediction building different machine learning and deep learning-based models. For the purpose of our study, we have used NIFTY 50 index values of the National Stock Exchange (NSE) of India, during the period December 29, 2014 till July 31, 2020. We have built eight regression models using the training data that consisted of NIFTY 50 index records from December 29, 2014 till December 28, 2018. Using these regression models, we predicted the <i>open</i> values of NIFTY 50 for the period December 31, 2018 till July 31, 2020. We, then, augment the predictive power of our forecasting framework by building four deep learning-based regression models using long-and short-term memory (LSTM) networks with a novel approach of walk-forward validation. Using the grid-searching technique, the hyperparameters of the LSTM models are optimized so that it is ensured that validation losses stabilize with the increasing number of epochs, and the convergence of the validation accuracy is achieved. We exploit the power of LSTM regression models in forecasting the future NIFTY 50 <i>open</i> values using four different models that differ in their architecture and in the structure of their input data. Extensive results are presented on various metrics for all the regression models. The results clearly indicate that the LSTM-based univariate model that uses one-week prior data as input for predicting the next week's <i>open</i> value of the NIFTY 50 time series is the most accurate model.


2021 ◽  
Author(s):  
Alexandre Heiden ◽  
Rafael Stubs Parpinelli

Financial news has been proven to be valuable source of information for the evaluation of stock market volatility. Most of the attention has been given to social media platforms, while news from vehicles such as newspapers are not as widely explored. Newspapers provide, although in a smaller volume, more reliable information than social media platforms. In this context, this research aims to examine the influence of financial news within the stock price prediction problem, by using the VADER sentiment analysis model to process the news and feed the sentiments as a feature into a LSTM-based stock price prediction model, along with the historical data of the assets. Experiments indicate that the model has better results when the news’ sentiments are considered, and the model demonstrates potential to accurately predict stock prices up to around 60 days into the future.


2019 ◽  
Vol 61 ◽  
pp. 01006 ◽  
Author(s):  
Jakub Horák ◽  
Tomáš Krulický

Accurate stock price prediction is very difficult in today's economy. Accurate prediction plays an important role in helping investors improve return on equity. As a result, a number of new approaches and technologies have logically evolved in recent years to predict stock prices. One is also the method of artificial neural networks, which have many advantages over conventional methods. The aim of this paper is to compare a method of exponential time series alignment and time series alignment using artificial neural networks as tools for predicting future stock price developments on the example of the company Unipetrol. Time series alignment is performed using artificial neural networks, exponential alignment of time series, and then a comparison of time series of predictions of future stock price trends predicted using the most successful neural network and price prediction calculated by exponential time series alignment is performed. Predictions for 62 business days were obtained. The realistic picture of further possible development is surprisingly given based on the exponential alignment of time series.


Author(s):  
Lin Sun ◽  
Wenzheng Xu ◽  
Jimin Liu

Using hierarchical CNN, the company's multiple news is characterized as three levels: sentence vectors, chapter vectors, and enterprise sentiment vectors. By combining the stock price data with the news lyric data at the same time, the influence of news on price is used to achieve correlation analysis of news information and stock prices. A two-channel attention mechanism fusion model based on CNN-LSTM is proposed. After the dual-channel feature extraction, the attention layer fusion layer is used to convert the weighted values of LSTM hidden variables, so the stock price can be predicted with the news text.


2019 ◽  
Vol 24 (2) ◽  
pp. 17-21
Author(s):  
Arjun Singh Saud ◽  
Subarna Shakya

The stock price is the cost of purchasing a security or stock in a stock exchange. The stock price prediction has been the aim of investors since the beginning of the stock market. It is the act of forecasting the future price of a company's stock. Nowadays, deep learning techniques are widely used for identifying the stock trends from large amounts of past data. This research has experimented two big and robust commercial banks listed in the Nepal Stock Exchange (NEPSE) and compared stock price prediction performance of GRU with three widely used gradient descent optimization techniques: Momentum, RMSProp, and Adam. GRU with Adam is more accurate and consistent approach for predicting stock prices from the present study.


2021 ◽  
Author(s):  
Jaydip Sen ◽  
Tamal Datta Chaudhuri

Prediction of future movement of stock prices has been the subject matter of many research work. On one hand, we have proponents of the Efficient Market Hypothesis who claim that stock prices cannot be predicted accurately. On the other hand, there are propositions that have shown that, if appropriately modelled, stock prices can be predicted fairly accurately. The latter have focused on choice of variables, appropriate functional forms and techniques of forecasting. This work proposes a granular approach to stock price prediction by combining statistical and machine learning methods with some concepts that have been advanced in the literature on technical analysis. The objective of our work is to take 5 minute daily data on stock prices from the National Stock Exchange (NSE) in India and develop a forecasting framework for stock prices. Our contention is that such a granular approach can model the inherent dynamics and can be fine-tuned for immediate forecasting. Six different techniques including three regression-based approaches and three classification-based approaches are applied to model and predict stock price movement of two stocks listed in NSE - Tata Steel and Hero Moto. Extensive results have been provided on the performance of these forecasting techniques for both the stocks.


Author(s):  
Tihana Škrinjarić ◽  
Mirjana Čižmešija

This chapter examines the possibilities of utilizing the results of Grey Models (GM) in the portfolio selection. Namely, stock price prediction represents one of the most important steps in the portfolio management. Many different models and methods have been developed for this purpose over the decades. The GM models could be utilized for such purpose. However, this approach is still relatively unknown today although research in the Far East has shown that applications of GM approach have good forecasting capabilities. That is why this chapter aims to popularize the GM approach of modeling stock prices and to combine the estimation results with the portfolio performance measurement. The benefits of using GM models within the portfolio management are empirically confirmed using daily data on the stock market index CROBEX from Zagreb Stock Exchange during the period from September 2, 2019, until February 7, 2020. The GM(2,1) model is the best performing one with respect to out of sample forecasts and based on portfolio performance measures important to the investor.


Sign in / Sign up

Export Citation Format

Share Document