Pattern Discovery from Time Series Using Growing Hierarchical Self-Organizing Map

Author(s):  
Shiyuan Liu ◽  
Li Lu ◽  
Guanglan Liao ◽  
Jianping Xuan
2021 ◽  
Vol 9 (2) ◽  
pp. 70-80
Author(s):  
M. Kushnir ◽  
K. Tokarieva

The paper investigates methods of artificial intelligence in the prognostication and analysis of financial data time series. It is uncovered that scholars and practitioners face some difficulties in modelling complex system such as the stock market because it is nonlinear, chaotic, multi- dimensional, and spatial in nature, making forecasting a complex process. Models estimating nonstationary financial time series may include noise and errors. The relationship between the input and output parameters of the models is essentially non-linear, where stock prices include higher-level variables, which complicates stock market modeling and forecasting. It is also revealed that financial time series are multidimensional and they are influenced by many factors, such as economics, politics, environment and so on. Analysis and evaluation of multi- dimensional systems and their forecasting should be carried out by machine learning models. The problem of forecasting the stock market and obtaining quality forecasts is an urgent task, and the methods and models of machine learning should be the main mathematical tools in solving the above problems. First, we proposed to use self-organizing map, which is used to visualize multidimensional data by configuring neurons to quantize or cluster the input space in the topological structure. These characteristics of this algorithm make it attractive in solving many problems, including clustering, especially for forecasting stock prices. In addition, the methods discussed, encourage us to apply this cluster approach to present a different data structure for forecasting. Thus, models of adaptive neuro-fuzzy inference system combine the characteristics of both neural networks and fuzzy logic. Given the fact that the rule of hybrid learning and the theory of logic is a clear advantage of adaptive neuro-fuzzy inference system, which has computational advantages over other methods of parameter identification, we propose a new hybrid algorithm for integrating self-organizing map with adaptive fuzzy inference system to forecast stock index prices. This algorithm is well suited for estimating the relationship between historical prices in stock markets. The proposed hybrid method demonstrated reduced errors and higher overall accuracy.


2017 ◽  
Vol 4 (2) ◽  
pp. 198
Author(s):  
Fatma Agus Setyanngsih

<p><em>The prediction to determine the rainfall in Pontianak is much needed. One of them is using a neural network algorithm using SOM (Self Organizing Maping) with the data used in January 2010-2013. The purpose of this study was to determine the rainfall prediction in the city of Pontianak with parameters of air temperature, relative humidity, air pressure and wind speed. The results showed that the value of MSE is obtained when studying the data network prediction in January of 2010 until 2013 using the Neural Network-SOM learning process with the amount of 1 neuron and using 124 datas, with MSE value 0,0148.</em><strong> </strong></p><p><strong><em>Keywords</em></strong><em>: </em><em>Rainfall, Neural Network, Time Series, Self Organizing Map</em></p><p><em>Prediksi untuk mengetahui curah hujan yang terjadi di Pontianak sangat dibutuhkan salah satunya yaitu menggunakan algoritma jaringan syaraf tiruan dengan pengelompokkannya menggunakan SOM (Self Organizing Map) dengan data yang digunakan adalah data di bulan januari tahun 2010-2013. Tujuan dari penelitian ini adalah untuk mengetahui prediksi curah hujan di kota Pontianak dengan parameter suhu udara, kelembababn relative, tekanan udara dan kecepatan angin. Hasil penelitian menunjukkan bahwa nilai MSE ini didapatkan saat jaringan mempelajari data prediksi pada bulan januari di tahun 2010 sampai tahun 2013 dengan menggunakan proses pembelajaran JST SOM dengan jumlah neuron 1 dan menggunakan 124 data, dengan nilai MSE 0,0148. </em></p><p><em></em><em><strong><em>Kata kunci</em></strong><strong><em>:</em></strong><em> </em><em>Curah Hujan, Jaringan Syaraf Tiruan, Time Series, Self Organizing Map</em></em></p>


2008 ◽  
Vol 5 (6) ◽  
pp. 3517-3555 ◽  
Author(s):  
M. Herbst ◽  
H. V. Gupta ◽  
M. C. Casper

Abstract. Hydrological model evaluation and identification essentially depends on the extraction of information from model time series and its processing. However, the type of information extracted by statistical measures has only very limited meaning because it does not relate to the hydrological context of the data. To overcome this inadequacy we exploit the diagnostic evaluation concept of Signature Indices, in which model performance is measured using theoretically relevant characteristics of system behaviour. In our study, a Self-Organizing Map (SOM) is used to process the Signatures extracted from Monte-Carlo simulations generated by a distributed conceptual watershed model. The SOM creates a hydrologically interpretable mapping of overall model behaviour, which immediately reveals deficits and trade-offs in the ability of the model to represent the different functional behaviours of the watershed. Further, it facilitates interpretation of the hydrological functions of the model parameters and provides preliminary information regarding their sensitivities. Most notably, we use this mapping to identify the set of model realizations (among the Monte-Carlo data) that most closely approximate the observed discharge time series in terms of the hydrologically relevant characteristics, and to confine the parameter space accordingly. Our results suggest that Signature Index based SOMs could potentially serve as tools for decision makers inasmuch as model realizations with specific Signature properties can be selected according to the purpose of the model application. Moreover, given that the approach helps to represent and analyze multi-dimensional distributions, it could be used to form the basis of an optimization framework that uses SOMs to characterize the model performance response surface. As such it provides a powerful and useful way to conduct model identification and model uncertainty analyses.


2021 ◽  
Author(s):  
Takashi Abe ◽  
Ryuki Furukawa ◽  
Yuki Iwasaki ◽  
Toshimichi Ikemura

To confront the global threat of coronavirus disease 2019, a massive number of the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) genome sequences have been decoded, with the results promptly released through the GISAID database. Based on variant types, eight clades have already been defined in GISAID, but the diversity can be far greater. Owing to the explosive increase in available sequences, it is important to develop new technologies that can easily grasp the whole picture of the big-sequence data and support efficient knowledge discovery. An ability to efficiently clarify the detailed time-series changes in genome-wide mutation patterns will enable us to promptly identify and characterize dangerous variants that rapidly increase their population frequency. Here, we collectively analyzed over 150,000 SARS-CoV-2 genomes to understand their overall features and time-dependent changes using a batch-learning self-organizing map (BLSOM) for oligonucleotide composition, which is an unsupervised machine learning method. BLSOM can separate clades defined by GISAID with high precision, and each clade is subdivided into clusters, which shows a differential increase/decrease pattern based on geographic region and time. This allowed us to identify prevalent strains in each region and to show the commonality and diversity of the prevalent strains. Comprehensive characterization of the oligonucleotide composition of SARS-CoV-2 and elucidation of time-series trends of the population frequency of variants can clarify the viral adaptation processes after invasion into the human population and the time-dependent trend of prevalent epidemic strains across various regions, such as continents.


2007 ◽  
Vol 4 (6) ◽  
pp. 3953-3978 ◽  
Author(s):  
M. Herbst ◽  
M. C. Casper

Abstract. The reduction of information contained in model time series through the use of aggregating statistical measures is very high compared to the amount of information that one would like to draw from it for model identification and calibration purposes. Applied within a model identification context, aggregating statistical performance measures are inadequate to capture details on time series characteristics. It has been readily shown that this loss of information on the residuals imposes important limitations on model identification and -diagnostics and thus constitutes an element of the overall model uncertainty. In this contribution we present an approach using a Self-Organizing Map (SOM) to circumvent the identifiability problem induced by the low discriminatory power of aggregating performance measures. Instead, a Self-Organizing Map is used to differentiate the spectrum of model realizations, obtained from Monte-Carlo simulations with a distributed conceptual watershed model, based on the recognition of different patterns in time series. Further, the SOM is used instead of a classical optimization algorithm to identify the model realizations among the Monte-Carlo simulations that most closely approximate the pattern of the measured discharge time series. The results are analyzed and compared with the manually calibrated model as well as with the results of the Shuffled Complex Evolution algorithm (SCE-UA).


Sign in / Sign up

Export Citation Format

Share Document