A Novel Approach for Finding Frequent Itemsets in Data Stream

2013 ◽  
Vol 28 (3) ◽  
pp. 217-241 ◽  
Author(s):  
B. Chandra ◽  
Shalini Bhaskar
2010 ◽  
Vol 26-28 ◽  
pp. 118-122
Author(s):  
Chong Huan Xu ◽  
Chun Hua Ju

According to the features of data streams and combined sliding window, a new algorithm A-MFI which is based on self-adjusting and orderly-compound policy for mining maximal frequent itemsets in data stream is proposed. This algorithm which is based on basic window updates information from data stream flow fragments and scans the stream only once to gain and store it in frequent itemsets list when the data stream flows. The core idea of this algorithm: construct self-adjusting and orderly-compound FP-tree, use mixed subset pruning techniques to reduce the search space, merge nodes which has equal minsup in the same branch and compress to generate the orderly-compound FP-tree to avoid superset checking when mining maximal frequent itemsets. The experimental results show that the algorithm has higher efficiency in time and space, and also has good scalability.


Author(s):  
Rodrigo Salvador Monteiro ◽  
Geraldo Zimbrão ◽  
Holger Schwarz ◽  
Bernhard Mitschang ◽  
Jano Moreira de Souza

Calendar-based pattern mining aims at identifying patterns on specific calendar partitions. Potential calendar partitions are for example: every Monday, every first working day of each month, every holiday. Providing flexible mining capabilities for calendar-based partitions is especially challenging in a data stream scenario. The calendar partitions of interest are not known a priori and at each point in time only a subset of the detailed data is available. The authors show how a data warehouse approach can be applied to this problem. The data warehouse that keeps track of frequent itemsets holding on different partitions of the original stream has low storage requirements. Nevertheless, it allows to derive sets of patterns that are complete and precise. Furthermore, the authors demonstrate the effectiveness of their approach by a series of experiments.


2010 ◽  
Vol 26-28 ◽  
pp. 113-117
Author(s):  
Pei Shuai Chen ◽  
Chong Huan Xu

Mining maximal frequent itemsets get the advantage of a relatively small number of itemsets. Compared to mining frequent itemsets and mining frequent closed itemsets, such algorithm has higher time and space efficiency. According to the features of data streams and combined sliding window, a new algorithm E-FPMFI which is based on orderly-compound policy for mining maximal frequent itemsets in data stream is proposed. The algorithm based on basic window updates information from data stream flow fragment and scans the stream only once to gain and store it in frequent itemsets list. The algorithm construct FP-tree, then compress orderly FP-tree by merging nodes which has equal minsup in same branch, also uses subset mix pruning technique, avoid superset checking. The experimental results show the algorithm has higher time, space efficiency and good scalability.


Sign in / Sign up

Export Citation Format

Share Document