A Novel Approach for Finding Frequent Itemsets in Data Stream

According to the features of data streams and combined sliding window, a new algorithm A-MFI which is based on self-adjusting and orderly-compound policy for mining maximal frequent itemsets in data stream is proposed. This algorithm which is based on basic window updates information from data stream flow fragments and scans the stream only once to gain and store it in frequent itemsets list when the data stream flows. The core idea of this algorithm: construct self-adjusting and orderly-compound FP-tree, use mixed subset pruning techniques to reduce the search space, merge nodes which has equal minsup in the same branch and compress to generate the orderly-compound FP-tree to avoid superset checking when mining maximal frequent itemsets. The experimental results show that the algorithm has higher efficiency in time and space, and also has good scalability.

Download Full-text

Exploring Calendar-Based Pattern Mining in Data Streams

Complex Data Warehousing and Knowledge Discovery for Advanced Retrieval Development ◽

10.4018/978-1-60566-748-5.ch016 ◽

2010 ◽

pp. 342-360

Author(s):

Rodrigo Salvador Monteiro ◽

Geraldo Zimbrão ◽

Holger Schwarz ◽

Bernhard Mitschang ◽

Jano Moreira de Souza

Keyword(s):

Data Warehouse ◽

Data Streams ◽

Data Stream ◽

Pattern Mining ◽

A Priori ◽

Frequent Itemsets ◽

Detailed Data ◽

Series Of Experiments ◽

Working Day

Calendar-based pattern mining aims at identifying patterns on specific calendar partitions. Potential calendar partitions are for example: every Monday, every first working day of each month, every holiday. Providing flexible mining capabilities for calendar-based partitions is especially challenging in a data stream scenario. The calendar partitions of interest are not known a priori and at each point in time only a subset of the detailed data is available. The authors show how a data warehouse approach can be applied to this problem. The data warehouse that keeps track of frequent itemsets holding on different partitions of the original stream has low storage requirements. Nevertheless, it allows to derive sets of patterns that are complete and precise. Furthermore, the authors demonstrate the effectiveness of their approach by a series of experiments.

Download Full-text

Mining Frequent Itemsets in Large Data Warehouses: A Novel Approach Proposed for Sparse Data Sets

Intelligent Data Engineering and Automated Learning - IDEAL 2007 - Lecture Notes in Computer Science ◽

10.1007/978-3-540-77226-2_53 ◽

2007 ◽

pp. 517-526 ◽

Cited By ~ 1

Author(s):

S. M. Fakhrahmad ◽

M. Zolghadri Jahromi ◽

M. H. Sadreddini

Keyword(s):

Large Data ◽

Sparse Data ◽

Frequent Itemsets ◽

Data Sets ◽

Data Warehouses ◽

Novel Approach ◽

Sparse Data Sets ◽

Mining Frequent Itemsets

Download Full-text

A novel approach for mining probabilistic frequent itemsets over uncertain data streams

International Journal of Applied Decision Sciences ◽

10.1504/ijads.2018.092794 ◽

2018 ◽

Vol 11 (3) ◽

pp. 302

Author(s):

Tianlai Li ◽

Fangai Liu ◽

Xinhua Wang

Keyword(s):

Data Streams ◽

Uncertain Data ◽

Frequent Itemsets ◽

Novel Approach ◽

Uncertain Data Streams

Download Full-text

A Novel Approach For Mining Probabilistic Frequent Itemsets Over Uncertain Data Streams

International Journal of Applied Decision Sciences ◽

10.1504/ijads.2018.10010708 ◽

2018 ◽

Vol 11 (1) ◽

pp. 1

Author(s):

Tianlai Li ◽

Fangai Liu ◽

Xinhua Wang

Keyword(s):

Data Streams ◽

Uncertain Data ◽

Frequent Itemsets ◽

Novel Approach ◽

Uncertain Data Streams

Download Full-text

A novel approach for data stream clustering using artificial bee colony algorithm

International Journal of Wireless and Mobile Computing ◽

10.1504/ijwmc.2015.066755 ◽

2015 ◽

Vol 8 (1) ◽

pp. 59 ◽

Cited By ~ 2

Author(s):

Chong Huan Xu

Keyword(s):

Data Stream ◽

Artificial Bee Colony Algorithm ◽

Artificial Bee Colony ◽

Stream Clustering ◽

Bee Colony ◽

Novel Approach ◽

Data Stream Clustering

Download Full-text

Maximal Frequent Itemsets in Data Stream Mining Based on Orderly-Compound Policy

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.26-28.113 ◽

2010 ◽

Vol 26-28 ◽

pp. 113-117

Author(s):

Pei Shuai Chen ◽

Chong Huan Xu

Keyword(s):

Data Stream ◽

Frequent Itemsets ◽

Space Efficiency ◽

Time Space ◽

Algorithm Construct ◽

Closed Itemsets ◽

Maximal Frequent Itemsets ◽

Mining Frequent Itemsets ◽

Basic Window ◽

Pruning Technique

Mining maximal frequent itemsets get the advantage of a relatively small number of itemsets. Compared to mining frequent itemsets and mining frequent closed itemsets, such algorithm has higher time and space efficiency. According to the features of data streams and combined sliding window, a new algorithm E-FPMFI which is based on orderly-compound policy for mining maximal frequent itemsets in data stream is proposed. The algorithm based on basic window updates information from data stream flow fragment and scans the stream only once to gain and store it in frequent itemsets list. The algorithm construct FP-tree, then compress orderly FP-tree by merging nodes which has equal minsup in same branch, also uses subset mix pruning technique, avoid superset checking. The experimental results show the algorithm has higher time, space efficiency and good scalability.

Download Full-text

Discovering Frequent Itemsets Reflected User Characteristics Using Weighted Batch based on Data Stream

The Journal of the Korea Contents Association ◽

10.5392/jkca.2011.11.1.056 ◽

2011 ◽

Vol 11 (1) ◽

pp. 56-64

Author(s):

Bok-Il Seo ◽

Jae-In Kim ◽

Bu-Hyun Hwang

Keyword(s):

Data Stream ◽

Frequent Itemsets ◽

User Characteristics

Download Full-text