A novel Fibonacci windows model for finding emerging patterns over online data stream

A First Attempt on Online Data Stream Classifier Using Context

Data Mining and Big Data - Lecture Notes in Computer Science ◽

10.1007/978-3-319-40973-3_50 ◽

2016 ◽

pp. 497-504 ◽

Cited By ~ 1

Author(s):

Michał Woźniak ◽

Bogusław Cyganek

Keyword(s):

Data Stream ◽

Online Data

Download Full-text

An Ensemble of Adaptive Neuro-Fuzzy Kohonen Networks for Online Data Stream Fuzzy Clustering

International Journal of Modern Education and Computer Science ◽

10.5815/ijmecs.2016.05.02 ◽

2016 ◽

Vol 8 (5) ◽

pp. 12-18 ◽

Cited By ~ 6

Author(s):

Zhengbing Hu ◽

◽

Yevgeniy V. Bodyanskiy ◽

Oleksii K. Tyshchenko ◽

Olena O. Boiko

Keyword(s):

Fuzzy Clustering ◽

Data Stream ◽

Online Data ◽

Neuro Fuzzy ◽

Kohonen Networks

Download Full-text

ESA-Stream: Efficient Self-Adaptive Online Data Stream Clustering

IEEE Transactions on Knowledge and Data Engineering ◽

10.1109/tkde.2020.2990196 ◽

2020 ◽

pp. 1-1

Author(s):

Yanni Li ◽

Hui Li ◽

Zhi Wang ◽

Bing Liu ◽

Jiangtao Cui ◽

...

Keyword(s):

Data Stream ◽

Online Data ◽

Stream Clustering ◽

Data Stream Clustering ◽

Self Adaptive

Download Full-text

An ensemble based on neural networks with random weights for online data stream regression

Soft Computing ◽

10.1007/s00500-019-04499-x ◽

2019 ◽

Vol 24 (13) ◽

pp. 9835-9855 ◽

Cited By ~ 3

Author(s):

Ricardo de Almeida ◽

Yee Mey Goh ◽

Radmehr Monfared ◽

Maria Teresinha Arns Steiner ◽

Andrew West

Keyword(s):

Data Stream ◽

Prediction Accuracy ◽

Concept Drift ◽

Learning Algorithms ◽

Data Distribution ◽

Machine Learning Algorithms ◽

Computational Time ◽

Online Data ◽

Data Prediction ◽

Random Weights

Abstract Most information sources in the current technological world are generating data sequentially and rapidly, in the form of data streams. The evolving nature of processes may often cause changes in data distribution, also known as concept drift, which is difficult to detect and causes loss of accuracy in supervised learning algorithms. As a consequence, online machine learning algorithms that are able to update actively according to possible changes in the data distribution are required. Although many strategies have been developed to tackle this problem, most of them are designed for classification problems. Therefore, in the domain of regression problems, there is a need for the development of accurate algorithms with dynamic updating mechanisms that can operate in a computational time compatible with today’s demanding market. In this article, the authors propose a new bagging ensemble approach based on neural network with random weights for online data stream regression. The proposed method improves the data prediction accuracy as well as minimises the required computational time compared to a recent algorithm for online data stream regression from literature. The experiments are carried out using four synthetic datasets to evaluate the algorithm’s response to concept drift, along with four benchmark datasets from different industries. The results indicate improvement in data prediction accuracy, effectiveness in handling concept drift, and much faster updating times compared to the existing available approach. Additionally, the use of design of experiments as an effective tool for hyperparameter tuning is demonstrated.

Download Full-text

Highly efficient incremental estimation of Gaussian mixture models for online data stream clustering

10.1117/12.601724 ◽

2005 ◽

Cited By ~ 41

Author(s):

Mingzhou Song ◽

Hongbin Wang

Keyword(s):

Mixture Models ◽

Data Stream ◽

Gaussian Mixture Models ◽

Gaussian Mixture ◽

Online Data ◽

Stream Clustering ◽

Highly Efficient ◽

Data Stream Clustering

Download Full-text

ESA-Stream: Efficient Self-Adaptive Online Data Stream Clustering (Extended Abstract)

2021 IEEE 37th International Conference on Data Engineering (ICDE) ◽

10.1109/icde51399.2021.00250 ◽

2021 ◽

Author(s):

Yanni Li ◽

Hui Li ◽

Zhi Wang ◽

Bing Liu ◽

Jiangtao Cui ◽

...

Keyword(s):

Data Stream ◽

Online Data ◽

Stream Clustering ◽

Data Stream Clustering ◽

Self Adaptive

Download Full-text

Critical evaluation of classifiers in data stream mining

International Journal of Engineering & Technology ◽

10.14419/ijet.v7i2.18.10819 ◽

2018 ◽

Vol 7 (4) ◽

pp. 2166

Author(s):

Lalit Agrawal ◽

Dattatraya Adane

Keyword(s):

Data Stream ◽

Forest Cover ◽

Critical Evaluation ◽

High Volume ◽

Data Stream Mining ◽

Volume Data ◽

Stream Mining ◽

Full Data ◽

Online Data ◽

Benchmark Datasets

Over past decade there has been a significant increase in the volume of online data. Extracting meaningful knowledge from this high volume data is considered as important aspect of research. It is very difficult to completely store full data, because of its perpetual nature. Therefore, analysis is needed while the “data is moving”. This moving data is known as data stream and analyzing it without storing it completely is termed as data stream mining. In recent years, many new techniques have been proposed to overcome the challenges of data stream mining. In this paper, we review the operation of popular streaming algorithms highlighting their strength and weaknesses. We also evaluate the classifiers used in these algorithms against two popular benchmark datasets namely (a) forest cover (forest) and (b) german credit available at UCI repository. Finally, we present our critical observation and draw conclusions on the basis of our analysis.

Download Full-text

EFFICIENTLY MINING RECENT FREQUENT PATTERNS OVER ONLINE TRANSACTIONAL DATA STREAMS

International Journal of Software Engineering and Knowledge Engineering ◽

10.1142/s0218194009004325 ◽

2009 ◽

Vol 19 (05) ◽

pp. 707-725 ◽

Cited By ~ 1

Author(s):

HUI CHEN

Keyword(s):

Data Mining ◽

Data Stream ◽

Frequent Patterns ◽

Stream Data ◽

Data Stream Management ◽

Online Data ◽

Stream Data Mining ◽

Network Traffic Analysis ◽

Stream Management ◽

Performance Results

Recent emerging applications, such as network traffic analysis, web click stream mining, power consumption measurement, sensor network data analysis, and dynamic tracing of stock fluctuation, call for study of a new kind of data, stream data. Many data stream management systems, prototype systems and software components have been developed to manage the streams or extract knowledge from stream data. Mining frequent patterns is a foundational job for the methods of data mining and knowledge discovery. This paper proposes an algorithm for mining the recent frequent patterns over an online data stream. This method uses RFP-tree to store compactly the recent frequent patterns of a stream. The content of each transaction is incrementally updated into the pattern tree upon its arrival by scanning the stream only once. Moreover, the strategy of conservative computation and time decaying model are used to ensure the correctness of the mining results. Finally, the performance results of extensive simulation show that our work can reduce the average processing time of stream data element and it is superior to other analogous algorithms.

Download Full-text

estWin: Online data stream mining of recent frequent itemsets by sliding window method

Journal of Information Science ◽

10.1177/0165551505050785 ◽

2005 ◽

Vol 31 (2) ◽

pp. 76-90 ◽

Cited By ~ 40

Author(s):

Joong Hyuk Chang ◽

Won Suk Lee

Keyword(s):

Data Stream ◽

Sliding Window ◽

Frequent Itemsets ◽

Data Stream Mining ◽

Stream Mining ◽

Online Data ◽

Window Method ◽

Sliding Window Method

Download Full-text

Online data stream Mining of Recent Frequent Itemsets based on Sliding Window model

2008 International Conference on Machine Learning and Cybernetics ◽

10.1109/icmlc.2008.4620420 ◽

2008 ◽

Author(s):

Jia-dong Ren ◽

Ke Li

Keyword(s):

Data Stream ◽

Sliding Window ◽

Frequent Itemsets ◽

Data Stream Mining ◽

Stream Mining ◽

Online Data

Download Full-text