Multiple Kernel Learning on Time Series Data and Social Networks for Stock Price Prediction

Author(s):  
S. Deng ◽  
T. Mitsubuchi ◽  
K. Shioda ◽  
T. Shimada ◽  
A. Sakurai
2016 ◽  
Vol 2016 ◽  
pp. 1-7 ◽  
Author(s):  
Chun-xue Nie ◽  
Xue-bo Jin

A stock price is a typical but complex type of time series data. We used the effective prediction of long-term time series data to schedule an investment strategy and obtain higher profit. Due to economic, environmental, and other factors, it is very difficult to obtain a precise long-term stock price prediction. The exponentially segmented pattern (ESP) is introduced here and used to predict the fluctuation of different stock data over five future prediction intervals. The new feature of stock pricing during the subinterval, named the interval slope, can characterize fluctuations in stock price over specific periods. The cumulative distribution function (CDF) of MSE was compared to those of MMSE-BC and SVR. We concluded that the interval slope developed here can capture more complex dynamics of stock price trends. The mean stock price can then be predicted over specific time intervals relatively accurately, in which multiple mean values over time intervals are used to express the time series in the long term. In this way, the prediction of long-term stock price can be more precise and prevent the development of cumulative errors.


2021 ◽  
Vol 5 (1) ◽  
pp. 55-72
Author(s):  
Xuan Ji ◽  
Jiachen Wang ◽  
Zhijun Yan

Purpose Stock price prediction is a hot topic and traditional prediction methods are usually based on statistical and econometric models. However, these models are difficult to deal with nonstationary time series data. With the rapid development of the internet and the increasing popularity of social media, online news and comments often reflect investors’ emotions and attitudes toward stocks, which contains a lot of important information for predicting stock price. This paper aims to develop a stock price prediction method by taking full advantage of social media data. Design/methodology/approach This study proposes a new prediction method based on deep learning technology, which integrates traditional stock financial index variables and social media text features as inputs of the prediction model. This study uses Doc2Vec to build long text feature vectors from social media and then reduce the dimensions of the text feature vectors by stacked auto-encoder to balance the dimensions between text feature variables and stock financial index variables. Meanwhile, based on wavelet transform, the time series data of stock price is decomposed to eliminate the random noise caused by stock market fluctuation. Finally, this study uses long short-term memory model to predict the stock price. Findings The experiment results show that the method performs better than all three benchmark models in all kinds of evaluation indicators and can effectively predict stock price. Originality/value In this paper, this study proposes a new stock price prediction model that incorporates traditional financial features and social media text features which are derived from social media based on deep learning technology.


2019 ◽  
Vol 5 (01) ◽  
pp. 47-54
Author(s):  
Wigid Hariadi

Abstract. Intervention analysis is used to evaluate the effect of external events on a time series data. Sea-highway program is one of the leading programs Joko Widodo-Jusuf Kalla in presidential election 2014. So the author want to modeling the effect from Sea-highway programs on stock price movement in the shipping sector, TMAS.JK (Pelayaran Tempuran Emas tbk). After analyzing, proven that it has happened intervention on movement of daily stock price TMAS.JK caused by Sea-highway programs. Intervention I, on 11 August 2014, which was efect as a result of the election of the Joko Widodo-Jusuf kalla pair as President and vice President Republic of Indonesia on 22 july 2014. Intervention II, on 10 november 2014, president Joko Widodo speech in APEC about Sea-highway Program, and offering investment in port construction to foreign country. So that the model of time series analysis that right is intervention analysis model multi input step function, where the model is ARIMA (2,1,0), StepI (b=0, s=2, r=1), StepII (b=3, s=0, r=1).  Keywords: Intervention Analysis, Multi Input, Step Function, Sea-highway.    Abstrak. Analisis intervensi digunakan untuk mengevaluasi efek dari peristiwa eksternal pada suatu data time series. Program Tol-Laut merupakan salah satu program unggulan pasangan Joko Widodo-Jusuf Kalla dalam pemilu 2014. sehingga, penulis ingin memodelkan efek dari Program Tol-Laut terhadap pergerakan harga saham dibidang pelayaran, TMAS.JK (Pelayaran Tempuran Emas tbk). Setelah dilakukan analisis data, terbukti bahwa terjadi intervensi pada pergerakan harga saham harian TMAS.JK yang disebabkan oleh efek dari program Tol-Laut. Dimana intervensi I, pada tanggal 11 Agustus 2014, yang diduga sebagai dampak dari terpilihnya pasangan Joko widodo-Jusuf Kalla sebagai presiden dan wakil presiden Republik Indonesia pada tanggal 22 Juli 2014. Intervensi II, pada tanggal 10 November 2014, pidato Presiden Joko Widodo di forum APEC mengenai program  tol  laut, dan  menawarkan investasi dibidang pembangunan pelabuhan  kepada bangsa asing. Sehingga model analisis time series yang tepat adalah model analisis intervensi multi input fungsi step, dimana modelnya adalah ARIMA (2,1,0), StepI (b=0, s=2, r=1), StepII (b=3, s=0, r=1). Kata kunci: Analisis intervensi, Multi Input, fungsi step, Tol-Laut.


Stock market prediction through time series is a challenging as well as an interesting research areafor the finance domain, through which stock traders and investors can find the right time to buy/sell stocks. However, various algorithms have been developed based on the statistical approach to forecast the time series for stock data, but due to the volatile nature and different price ranges of the stock price one particular algorithm is not enough to visualize the prediction. This study aims to propose a model that will choose the preeminent algorithm for that particular company’s stock that can forecastthe time series with minimal error. This model can assist a trader/investor with or without expertise in the stock market to achieve profitable investments. We have used the Stock data from Stock Exchange Bangladesh, which covers 300+ companies to train and test our system. We have classified those companies based on the stock price range and then applied our model to identify which algorithm suites most for a particular range of stock price. Comparative forecasting results of all algorithms in diverse price ranges have been presented to show the usefulness of this Predictive Meta Model


Author(s):  
Ping Zhang ◽  
Jia-Yao Yang ◽  
Hao Zhu ◽  
Yue-Jie Hou ◽  
Yi Liu ◽  
...  

In the era of artificial intelligence, machine learning methods are successfully used in various fields. Machine learning has attracted extensive attention from investors in the financial market, especially in stock price prediction. However, one argument for the machine learning methods used in stock price prediction is that they are black-box models which are difficult to interpret. In this paper, we focus on the future stock price prediction with the historical stock price by machine learning and deep learning methods, such as support vector machine (SVM), random forest (RF), Bayesian classifier (BC), decision tree (DT), multilayer perceptron (MLP), convolutional neural network (CNN), bi-directional long-short term memory (BiLSTM), the embedded CNN, and the embedded BiLSTM. Firstly, we manually design several financial time series where the future price correlates with the historical stock prices in pre-designed modes, namely the curve-shape-feature (CSF) and the non-curve-shape-feature (NCSF) modes. In the CSF mode, the future prices can be extracted from the curve shapes of the historical stock prices. Conversely, in the NCSF mode, they can’t. Secondly, we apply various algorithms to those pre-designed and real financial time series. We find that the existing machine learning and deep learning algorithms fail in stock price prediction because in the real financial time series, less information of future prices is contained in the CSF mode, and perhaps more information is contained in the NCSF. Various machine learning and deep learning algorithms are good at handling the CSF in historical data, which are successfully applied in image recognition and natural language processing. However, they are inappropriate for stock price prediction on account of the NCSF. Therefore, accurate stock price prediction is the key to successful investment, and new machine learning algorithms handling the NCSF series are needed.


2021 ◽  
Author(s):  
Armin Lawi ◽  
Hendra Mesra ◽  
Supri Amir

Abstract Stocks are an attractive investment option since they can generate large profits compared to other businesses. The movement of stock price patterns on the stock market is very dynamic; thus it requires accurate data modeling to forecast stock prices with a low error rate. Forecasting models using Deep Learning are believed to be able to accurately predict stock price movements using time-series data, especially the Long Short-Term Memory (LSTM) and Gated Recurrent Unit (GRU) algorithms. However, several previous implementation studies have not been able to obtain convincing accuracy results. This paper proposes the implementation of the forecasting method by classifying the movement of time-series data on company stock prices into three groups using LSTM and GRU. The accuracy of the built model is evaluated using loss functions of Rooted Mean Squared Error (RMSE) and Mean Absolute Percentage Error (MAPE). The results showed that the performance evaluation of both architectures is accurate in which GRU is always superior to LSTM. The highest validation for GRU was 98.73% (RMSE) and 98.54% (MAPE), while the LSTM validation was 98.26% (RMSE) and 97.71% (MAPE).


Author(s):  
Soo-Tai Nam ◽  
Chan-Yong Jin ◽  
Seong-Yoon Shin

Big data is a large set of structured or unstructured data that can collect, store, manage, and analyze data with existing database management tools. And it means the technique of extracting value from these data and interpreting the results. Big data has three characteristics: The size of existing data and other data (volume), the speed of data generation (velocity), and the variety of information forms (variety). The time series data are obtained by collecting and recording the data generated in accordance with the flow of time. If the analysis of these time series data, found the characteristics of the data implies that feature helps to understand and analyze time series data. The concept of distance is the simplest and the most obvious in dealing with the similarities between objects. The commonly used and widely known method for measuring distance is the Euclidean distance. This study is the result of analyzing the similarity of stock price flow using 793,800 closing prices of 1,323 companies in Korea. Visual studio and Excel presented calculate the Euclidean distance using an analysis tool. We selected “000100” as a target domestic company and prepared for big data analysis. As a result of the analysis, the shortest Euclidean distance is the code “143860” company, and the calculated value is “11.147”. Therefore, based on the results of the analysis, the limitations of the study and theoretical implications are suggested.


Sign in / Sign up

Export Citation Format

Share Document