Closed frequent itemsets mining based on It-Tree

Youssef Fakir; Chaima Ahle Touateb; Rachid Elayachi

doi:10.18844/gjcs.v11i1.4912

Closed Frequent Itemsets Mining Based on It-Tree

Journal of Medical Informatics and Decision Making ◽

10.14302/issn.2641-5526.jmid-20-3424 ◽

2020 ◽

Vol 1 (2) ◽

pp. 44-52

Author(s):

Youssef Fakir ◽

Chaima Ahle Touate ◽

Rachid Elayachi ◽

Mohamed Fakir

Keyword(s):

Data Mining ◽

Association Rule ◽

Computing Time ◽

Frequent Itemsets ◽

Closed Frequent Itemsets ◽

Hidden Knowledge ◽

Closed Itemsets ◽

Frequent Itemsets Mining ◽

Direct Counting ◽

Very High

In the last decade, the amount of collected data, in various computer science applications, has grown considerably. These large volumes of data need to be analysed in order to extract useful hidden knowledge. This work focuses on association rule extraction. This technique is one of the most popular in data mining. Nevertheless, the number of extracted association rules is often very high, and many of them are redundant. In this paper, we propose an algorithm, for mining closed itemsets, with the construction of an it-tree. This algorithm is compared with the DCI (direct counting & intersect) algorithm based on min support and computing time. CHARM is not memery-efficient. It needs to store all closed itemsets in the memory. The lower min-sup is, the more frequent closed itemsets there are so that the amounts of memory used by CHARM are increasing.

Download Full-text

Study of Various Parallel Implementations of Association Rule Mining Algorithm

American Journal of Advanced Computing ◽

10.15864/ajac.1305 ◽

2020 ◽

Vol 1 (3) ◽

pp. 1-7

Author(s):

Sarbani Dasgupta ◽

Banani Saha

Keyword(s):

Association Rule ◽

Association Rule Mining ◽

Parallel Implementation ◽

Rule Learning ◽

Frequent Itemsets ◽

Rule Mining ◽

Large Dataset ◽

Transactional Databases ◽

Frequent Itemsets Mining ◽

Sequential Implementation

In data mining, Apriori technique is generally used for frequent itemsets mining and association rule learning over transactional databases. The frequent itemsets generated by the Apriori technique provides association rules which are used for finding trends in the database. As the size of the database increases, sequential implementation of Apriori technique will take a lot of time and at one point of time the system may crash. To overcome this problem, several algorithms for parallel implementation of Apriori technique have been proposed. This paper gives a comparative study on various parallel implementation of Apriori technique .It also focuses on the advantages of using the Map Reduce technology, the latest technology used in parallelization of large dataset mining.

Download Full-text

A General Temporal Association Rule Frequent Itemsets Mining Algorithm

International Journal of Advancements in Computing Technology ◽

10.4156/ijact.vol3.issue11.9 ◽

2011 ◽

Vol 3 (11) ◽

pp. 63-71 ◽

Cited By ~ 2

Author(s):

Yan Hai ◽

Xiuli Li

Keyword(s):

Association Rule ◽

Frequent Itemsets ◽

Temporal Association ◽

Mining Algorithm ◽

Temporal Association Rule ◽

Frequent Itemsets Mining

Download Full-text

Improved BVBUC Algorithm to Discover Closed Itemsets in Long Biological Datasets

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.892.157 ◽

2019 ◽

Vol 892 ◽

pp. 157-167

Author(s):

Fatimah Audah Md Zaki ◽

Nurul Fariza Zulkurnain

Keyword(s):

Frequent Itemsets ◽

Suitable Method ◽

Closed Frequent Itemsets ◽

Closed Itemsets ◽

Synthetic Datasets

The task in mining closed frequent itemsets requires the algorithm to mine the frequent ones then determine its closure. The efficiency of closure computation is very important as it will determine the total mining time and the required memory. Over the years, many closure computation methods have been proposed to achieve these goals. However, to the best of our knowledge, there is no suitable method that can be adapted for algorithms that enumerate the rowset lattice, which is effective for biological datasets. Therefore, this paper proposed a method for computing closure compare with the method used in BVBUC algorithm method. Finally, BVBUC_I is proposed and the performances of these algorithms were evaluated using two synthetic datasets and three real datasets. The results of these tests proved the efficiency of the proposed method.

Download Full-text

Application of improved time series Apriori algorithm by frequent itemsets in association rule data mining based on temporal constraint

Evolutionary Intelligence ◽

10.1007/s12065-019-00234-5 ◽

2019 ◽

Vol 13 (1) ◽

pp. 39-49 ◽

Cited By ~ 4

Author(s):

Chunxia Wang ◽

Xiaoyue Zheng

Keyword(s):

Data Mining ◽

Time Series ◽

Association Rule ◽

Frequent Itemsets ◽

Apriori Algorithm ◽

Temporal Constraint

Download Full-text

Research on Association Rule Mining Algorithm Based on Distributed Data

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.998-999.899 ◽

2014 ◽

Vol 998-999 ◽

pp. 899-902 ◽

Cited By ~ 1

Author(s):

Cheng Luo ◽

Ying Chen

Keyword(s):

Data Mining ◽

Association Rule ◽

Association Rule Mining ◽

Large Scale ◽

Frequent Itemsets ◽

Network Communication ◽

Data Mining Algorithm ◽

Distributed Data ◽

Rule Mining ◽

Mining Algorithm

Existing data miming algorithms have mostly implemented data mining under centralized environment, but the large-scale database exists in the distributed form. According to the existing problem of the distributed data mining algorithm FDM and its improved algorithms, which exist the problem that the frequent itemsets are lost and network communication cost too much. This paper proposes a association rule mining algorithm based on distributed data (ARADD). The mapping marks the array mechanism is included in the ARADD algorithm, which can not only keep the integrity of the frequent itemsets, but also reduces the cost of network communication. The efficiency of algorithm is proved in the experiment.

Download Full-text

Efficient Data Streams Based Closed Frequent Itemsets Mining Algorithm

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.256-259.2910 ◽

2012 ◽

Vol 256-259 ◽

pp. 2910-2913

Author(s):

Jun Tan

Keyword(s):

Data Streams ◽

Sliding Window ◽

Frequent Itemsets ◽

Streaming Data ◽

Efficient Data ◽

Closed Itemsets ◽

Frequent Itemsets Mining ◽

Synthetic Datasets ◽

Online Mining ◽

Mining Data Streams

Online mining of frequent closed itemsets over streaming data is one of the most important issues in mining data streams. In this paper, we proposed a novel sliding window based algorithm. The algorithm exploits lattice properties to limit the search to frequent close itemsets which share at least one item with the new transaction. Experiments results on synthetic datasets show that our proposed algorithm is both time and space efficient.

Download Full-text

Improved Algorithm for Mining Maximum Frequent Patterns Based on FP-Tree

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.756-759.3692 ◽

2013 ◽

Vol 756-759 ◽

pp. 3692-3695 ◽

Cited By ~ 1

Author(s):

Nai Li Liu ◽

Lei Ma

Keyword(s):

Data Mining ◽

Fast Algorithm ◽

Association Rule ◽

Frequent Patterns ◽

Memory Space ◽

The Cost ◽

Mining Association Rule ◽

Very High ◽

Improved Algorithm

Mining association rule is an important matter in data mining, in which mining maximum frequent patterns is a key problem. Many of the previous algorithms mine maximum frequent patterns by producing candidate patterns firstly, then pruning. But the cost of producing candidate patterns is very high, especially when there exists long patterns. In this paper, the structure of a FP-tree is improved, we propose a fast algorithm based on FP-Tree for mining maximum frequent patterns, the algorithm does not produce maximum frequent candidate patterns and is more effectively than other improved algorithms. The new FP-Tree is a one-way tree and only retains pointers to point its father in each node, so at least one third of memory is saved. Experiment results show that the algorithm is efficient and saves memory space.

Download Full-text

Research into the Algorithm of Frequent Pattern Mining Based on across Linker

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.195-196.984 ◽

2012 ◽

Vol 195-196 ◽

pp. 984-986

Author(s):

Ming Ru Zhao ◽

Yuan Sun ◽

Jian Guo ◽

Ping Ping Dong

Keyword(s):

Data Mining ◽

Pattern Mining ◽

Frequent Pattern Mining ◽

Frequent Itemsets ◽

Frequent Pattern ◽

Apriori Algorithm ◽

Important Data ◽

Classical Algorithm ◽

Frequent Itemsets Mining ◽

Mining Frequent Itemsets

Frequent itemsets mining is an important data mining task and a focused theme in data mining research. Apriori algorithm is one of the most important algorithm of mining frequent itemsets. However, the Apriori algorithm scans the database too many times, so its efficiency is relatively low. The paper has therefore conducted a research on the mining frequent itemsets algorithm based on a across linker. Through comparing with the classical algorithm, the improved algorithm has obvious advantages.

Download Full-text

The Novel Model of Construct Materials Science and Information Based on Association Rule Mining

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.327.197 ◽

2013 ◽

Vol 327 ◽

pp. 197-200

Author(s):

Guo Fang Kuang ◽

Ying Cun Cao

Keyword(s):

Experimental Data ◽

Data Mining ◽

Association Rule ◽

Association Rule Mining ◽

Materials Science ◽

Frequent Itemsets ◽

Data Sets ◽

The Novel ◽

Rule Mining ◽

Novel Model

The material is used by humans to manufacture the machines, components, devices and other products of substances. Association rules originated in the field of data mining, people use it to find large amounts of data between itemsets of the association. Apriori is a breadth-first algorithm to obtain the support is greater than the minimum support of frequent itemsets by repeatedly scanning the database. This paper presents the construction of materials science and information model based on association rule mining. Experimental data sets prove that the proposed algorithm is effective and reasonable.

Download Full-text