The Benefits of Using Prefix Tree Data Structure in Multi-Level Frequent Pattern Mining

Frequent Pattern Tree (FP-Tree) is a compact data structure of representing frequent itemsets. The construction of FP-Tree is very important prior to frequent patterns mining. However, there have been too limited efforts specifically focused on constructing FP-Tree data structure beyond from its original database. In typical FP-Tree construction, besides the prior knowledge on support threshold, it also requires two database scans; first to build and sort the frequent patterns and second to build its prefix paths. Thus, twice database scanning is a key and major limitation in completing the construction of FP-Tree. Therefore, this paper suggests scalable Trie Transformation Technique Algorithm (T3A) to convert our predefined tree data structure, Disorder Support Trie Itemset (DOSTrieIT) into FP-Tree. Experiment results through two UCI benchmark datasets show that the proposed T3A generates FP-Tree up to 3 magnitudes faster than that the benchmarked FP-Growth.

Download Full-text

Cost Analysis of the Prefix Tree Data Structure

Production Systems and Information Engineering ◽

10.32968/psaie.2019.003 ◽

2019 ◽

Vol 8 ◽

pp. 39-49

Author(s):

Edit Csizmás ◽

László Kovács

Keyword(s):

Data Structure ◽

Cost Analysis ◽

Prefix Tree ◽

Tree Data ◽

Tree Data Structure

Download Full-text

Maintenance of Frequent Patterns

Post-Mining of Association Rules ◽

10.4018/978-1-60566-404-0.ch014 ◽

2009 ◽

pp. 273-293 ◽

Cited By ~ 1

Author(s):

Mengling Feng ◽

Jinyan Li ◽

Guozhu Dong ◽

Limsoon Wong

Keyword(s):

Pattern Mining ◽

Frequent Pattern Mining ◽

Frequent Pattern ◽

Frequent Patterns ◽

Research Opportunities ◽

Prefix Tree ◽

Emerging Trends ◽

Maintenance Problem

This chapter surveys the maintenance of frequent patterns in transaction datasets. It is written to be accessible to researchers familiar with the field of frequent pattern mining. The frequent pattern maintenance problem is summarized with a study on how the space of frequent patterns evolves in response to data updates. This chapter focuses on incremental and decremental maintenance. Four major types of maintenance algorithms are studied: Apriori-based, partition-based, prefix-tree-based, and conciserepresentation- based algorithms. The authors study the advantages and limitations of these algorithms from both the theoretical and experimental perspectives. Possible solutions to certain limitations are also proposed. In addition, some potential research opportunities and emerging trends in frequent pattern maintenance are also discussed.

Download Full-text

DMP-tree: A dynamic M-way prefix tree data structure for strings matching

Computers & Electrical Engineering ◽

10.1016/j.compeleceng.2008.04.011 ◽

2010 ◽

Vol 36 (5) ◽

pp. 818-834 ◽

Cited By ~ 5

Author(s):

Nasser Yazdani ◽

Hossein Mohammadi

Keyword(s):

Data Structure ◽

Prefix Tree ◽

Tree Data ◽

Tree Data Structure

Download Full-text

Efficient Dynamic Weighted Frequent Pattern Mining by using a Prefix-Tree

The KIPS Transactions PartD ◽

10.3745/kipstd.2010.17d.4.253 ◽

2010 ◽

Vol 17D (4) ◽

pp. 253-258 ◽

Cited By ~ 2

Author(s):

Byeong-Soo Jeong ◽

Ahmed Farhan

Keyword(s):

Pattern Mining ◽

Frequent Pattern Mining ◽

Frequent Pattern ◽

Prefix Tree

Download Full-text

A COMPREHENSIVE STUDY OF MAJOR TECHNIQUES OF MULTI LEVEL FREQUENT PATTERN MINING: A SURVEY

International Journal of Research in Engineering and Technology ◽

10.15623/ijret.2013.0204017 ◽

2013 ◽

Vol 02 (04) ◽

pp. 511-515

Author(s):

Syed Zishan Ali .

Keyword(s):

Pattern Mining ◽

Frequent Pattern Mining ◽

Frequent Pattern ◽

Multi Level ◽

Comprehensive Study

Download Full-text

Novel Approach for Frequent Pattern Algorithm for Maximizing Frequent Patterns in Effective Time

INTERNATIONAL JOURNAL OF COMPUTERS & TECHNOLOGY ◽

10.24297/ijct.v3i2b.2876 ◽

2012 ◽

Vol 3 (2) ◽

pp. 279-283

Author(s):

Rahul Sharma ◽

Dr. Manish Manoria

Keyword(s):

Data Structure ◽

Large Datasets ◽

Experimental Results ◽

Frequent Pattern ◽

Frequent Patterns ◽

Effective Time ◽

Novel Approach ◽

Tree Data ◽

Improved Performance ◽

Tree Data Structure

The essential aspect of mining association rules is to mine the frequent patterns. Due to native difficulty it is impossible to mine complete frequent patterns from a dense database. FP-growth algorithm has been implemented using an Array-based structure, known as the FP-tree,which is for storing compressed frequency information. Numerous experimental results have demonstrated that the algorithm performs extremely well. But in FP-growth algorithm, two traversals of FP-tree are needed for constructing the new conditional FP-tree. In this paper we present a novel Array Based Without Scanning Frequent Pattern (ABWSFP) tree technique that greatly reduces the need to traverse FP-trees, thus obtaining significantly improved performance for FP-tree based algorithms. The technique works especially well for large datasets. We then present a new algorithm which use the QFP-tree data structure in combination with the FP Tree- Experimental results show that the new algorithm outperform other algorithm in not only the speed of algorithms, but also their CPU consumption and their scalability.

Download Full-text