An Efficient Algorithm for Distributed Incremental Updating of Frequent Item-Sets on Massive Database

Author(s):  
Jiangtao Qiu ◽  
Changjie Tang ◽  
Lei Duan ◽  
Chuan Li ◽  
Shaojie Qiao ◽  
...  
2019 ◽  
Vol 8 (2) ◽  
pp. 3885-3889

Closed item sets are frequent itemsets that uniquely determines the exact frequency of frequent item sets. Closed Item sets reduces the massive output to a smaller magnitude without redundancy. In this paper, we present PSS-MCI, an efficient candidate generate based approach for mining all closed itemsets. It enumerates closed item sets using hash tree, candidate generation, super-set and sub-set checking. It uses partitioned based strategy to avoid unnecessary computation for the itemsets which are not useful. Using an efficient algorithm, it determines all closed item sets from a single scan over the database. However, several unnecessary item sets are being hashed in the buckets. To overcome the limitations, heuristics are enclosed with algorithm PSS-MCI. Empirical evaluation and results show that the PSS-MCI outperforms all candidate generate and other approaches. Further, PSS-MCI explores all closed item sets.


2008 ◽  
Vol 4 (8) ◽  
pp. 638-645 ◽  
Author(s):  
A.M.J. Md. Zubair Rahman ◽  
P. Balasubram

Author(s):  
Nibedita Panigrahi ◽  
P.K. Pattnaik ◽  
S.K. Padhi

Data mining is a part of know ledge Discovery in database process (KDD). As technology advances, floods of data can be produced and shared in many appliances such as wireless Sensor networks or Web click streams. This calls for extracting useful information and knowledge from streams of data. In this paper, We have proposed an efficient algorithm, where, at any time the current frequencies of all frequent item sets can be immediately produced. The current frequency of an item set in a stream is defined as its maximal frequency over all possible windows in the stream from any point in the past until the current state. The experimental result shows the proposed algorithm not only maintains a small summery of information for one item set but also consumes less memory then existing algorithms for mining frequent item sets over recent data streams.


2001 ◽  
Vol 61 (3) ◽  
pp. 350-371 ◽  
Author(s):  
Ramesh C. Agarwal ◽  
Charu C. Aggarwal ◽  
V.V.V. Prasad

2012 ◽  
Vol 263-266 ◽  
pp. 2179-2184 ◽  
Author(s):  
Zhen Yun Liao ◽  
Xiu Fen Fu ◽  
Ya Guang Wang

The first step of the association rule mining algorithm Apriori generate a lot of candidate item sets which are not frequent item sets, and all of these item sets cost a lot of system spending. To solve this problem,this paper presents an improved algorithm based on Apriori algorithm to improve the Apriori pruning step. Using this method, the large number of useless candidate item sets can be reduced effectively and it can also reduce the times of judge whether the item sets are frequent item sets. Experimental results show that the improved algorithm has better efficiency than classic Apriori algorithm.


Sign in / Sign up

Export Citation Format

Share Document