Efficient Mining of Frequent Itemsets and a Measure of Interest for Association Rule Mining

Association rule mining is an important research topic in data mining. Association rule mining consists of two steps: finding frequent itemsets and then extracting interesting rules from the frequent itemsets. In the first step, efficiency is important since discovering frequent itemsets is computationally time consuming. In the second step, unbiased assessment is important for good decision making. In this paper, we deal with both the efficiency of the mining algorithm and the measure of interest of the resulting rules. First, we present an algorithm for finding frequent itemsets that uses a vertical database. We also introduce a modified vertical data format to reduce the size of the database and an itemset reordering strategy to reduce the size of the intermediate tidsets. Second, we present a new measure to evaluate the interest of the resulting association rules. Our performance analysis shows that our proposed algorithm reduces the size of the intermediate tidsets that are generated during the mining process. The smaller tidsets make intersection operations faster. Using our interest-measuring test helps to avoid the discovery of misleading rules.

Download Full-text

Algorithm of Mining Association Rule Based on Matrix

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.513-517.786 ◽

2014 ◽

Vol 513-517 ◽

pp. 786-791

Author(s):

Zi Zhi Lin ◽

Si Hui Shu ◽

Yun Ding

Keyword(s):

Association Rule ◽

Association Rule Mining ◽

Frequent Itemsets ◽

Rule Mining ◽

Rule Based ◽

Data Mining Algorithms ◽

Transaction Database ◽

Temporal And Spatial ◽

Mining Algorithms ◽

Mining Association Rule

Association rule mining is one of the most important techniques of data mining. Algorithms based on matrix are efficient due to only scanning the transaction database for one time. In this paper, an algorithm of association rule mining based on the compression matrix is given. It mainly compresses the transaction matrix by integrating various strategies and fleetly finds frequent itemsets. The new algorithm optimizes the known algorithms of mining association rule based on matrix given by some researchers in recent years, which greatly reduces the temporal and spatial complexity, and highly promotes the efficiency of finding frequent itemsets.

Download Full-text

Study of Various Parallel Implementations of Association Rule Mining Algorithm

American Journal of Advanced Computing ◽

10.15864/ajac.1305 ◽

2020 ◽

Vol 1 (3) ◽

pp. 1-7

Author(s):

Sarbani Dasgupta ◽

Banani Saha

Keyword(s):

Association Rule ◽

Association Rule Mining ◽

Parallel Implementation ◽

Rule Learning ◽

Frequent Itemsets ◽

Rule Mining ◽

Large Dataset ◽

Transactional Databases ◽

Frequent Itemsets Mining ◽

Sequential Implementation

In data mining, Apriori technique is generally used for frequent itemsets mining and association rule learning over transactional databases. The frequent itemsets generated by the Apriori technique provides association rules which are used for finding trends in the database. As the size of the database increases, sequential implementation of Apriori technique will take a lot of time and at one point of time the system may crash. To overcome this problem, several algorithms for parallel implementation of Apriori technique have been proposed. This paper gives a comparative study on various parallel implementation of Apriori technique .It also focuses on the advantages of using the Map Reduce technology, the latest technology used in parallelization of large dataset mining.

Download Full-text

Construction Search Engine Based on Formal Concept Analysis and Association Rule Mining

Advanced Engineering Forum ◽

10.4028/www.scientific.net/aef.6-7.625 ◽

2012 ◽

Vol 6-7 ◽

pp. 625-630 ◽

Cited By ~ 1

Author(s):

Hong Sheng Xu

Keyword(s):

Search Engine ◽

Association Rule ◽

Formal Concept Analysis ◽

Association Rule Mining ◽

High Efficiency ◽

Concept Lattice ◽

Concept Analysis ◽

Frequent Itemsets ◽

Formal Concept ◽

Rule Mining

In the form of background in the form of concept partial relation to the corresponding concept lattice, concept lattice is the core data structure of formal concept analysis. Association rule mining process includes two phases: first find all the frequent itemsets in data collection, Second it is by these frequent itemsets to generate association rules. This paper analyzes the association rule mining algorithms, such as Apriori and FP-Growth. The paper presents the construction search engine based on formal concept analysis and association rule mining. Experimental results show that the proposed algorithm has high efficiency.

Download Full-text

An Efficient Compression Technique for Vertical Mining Methods

Research and Trends in Data Mining Technologies and Applications ◽

10.4018/978-1-59904-271-8.ch006 ◽

2007 ◽

pp. 143-173

Author(s):

Mafruz Ashrafi ◽

David Taniar ◽

Kate Smith

Keyword(s):

Association Rule ◽

Association Rule Mining ◽

Frequent Itemsets ◽

Main Memory ◽

Rule Mining ◽

Compression Technique ◽

Efficient Manner ◽

Handling Technique ◽

Mining Methods ◽

Mining Algorithms

Association rule mining is one of the most widely used data mining techniques. To achieve a better performance, many efficient algorithms have been proposed. Despite these efforts, many of these algorithms require a large amount of main memory to enumerate all frequent itemsets, especially when the dataset is large or the user-specified support is low. Thus, it becomes apparent that we need to have an efficient main memory handling technique, which allows association rule mining algorithms to handle larger datasets in the main memory. To achieve this goal, in this chapter we propose an algorithm for vertical association rule mining that compresses a vertical dataset in an efficient manner, using bit vectors. Our performance evaluations show that the compression ratio attained by our proposed technique is better than those of the other well-known techniques.

Download Full-text

Support-Less Association Rule Mining Using Tuple Count Cube

Journal of Information & Knowledge Management ◽

10.1142/s0219649207001846 ◽

2007 ◽

Vol 06 (04) ◽

pp. 271-280

Author(s):

Qin Ding ◽

William Perrizo

Keyword(s):

Association Rule ◽

Association Rule Mining ◽

Data Cube ◽

Apriori Algorithm ◽

Rule Mining ◽

High Confidence ◽

Support Level ◽

Vertical Data ◽

Frequent Item Sets ◽

High Support

Association rule mining is one of the important tasks in data mining and knowledge discovery (KDD). The traditional task of association rule mining is to find all the rules with high support and high confidence. In some applications, we are interested in finding high confidence rules even though the support may be low. This type of problem differs from the traditional association rule mining problem; hence, it is called support-less association rule mining. Existing algorithms for association rule mining, such as the Apriori algorithm, cannot be used efficiently for support-less association rule mining since those algorithms mostly rely on identifying frequent item-sets with high support. In this paper, we propose a new model to perform support-less association rule mining, i.e., to derive high confidence rules regardless of their support level. A vertical data structure, the Peano Count Tree (P-tree), is used in our model to represent all the information we need. Based on the P-tree structure, we build a special data cube, called the Tuple Count Cube (T-cube), to derive high confidence rules. Data cube operations, such as roll-up, on T-cube, provide efficient ways to calculate the count information needed for support-less association rule mining.

Download Full-text

Research on Association Rule Mining Algorithm Based on Distributed Data

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.998-999.899 ◽

2014 ◽

Vol 998-999 ◽

pp. 899-902 ◽

Cited By ~ 1

Author(s):

Cheng Luo ◽

Ying Chen

Keyword(s):

Data Mining ◽

Association Rule ◽

Association Rule Mining ◽

Large Scale ◽

Frequent Itemsets ◽

Network Communication ◽

Data Mining Algorithm ◽

Distributed Data ◽

Rule Mining ◽

Mining Algorithm

Existing data miming algorithms have mostly implemented data mining under centralized environment, but the large-scale database exists in the distributed form. According to the existing problem of the distributed data mining algorithm FDM and its improved algorithms, which exist the problem that the frequent itemsets are lost and network communication cost too much. This paper proposes a association rule mining algorithm based on distributed data (ARADD). The mapping marks the array mechanism is included in the ARADD algorithm, which can not only keep the integrity of the frequent itemsets, but also reduces the cost of network communication. The efficiency of algorithm is proved in the experiment.

Download Full-text

The Novel Model of Construct Materials Science and Information Based on Association Rule Mining

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.327.197 ◽

2013 ◽

Vol 327 ◽

pp. 197-200

Author(s):

Guo Fang Kuang ◽

Ying Cun Cao

Keyword(s):

Experimental Data ◽

Data Mining ◽

Association Rule ◽

Association Rule Mining ◽

Materials Science ◽

Frequent Itemsets ◽

Data Sets ◽

The Novel ◽

Rule Mining ◽

Novel Model

The material is used by humans to manufacture the machines, components, devices and other products of substances. Association rules originated in the field of data mining, people use it to find large amounts of data between itemsets of the association. Apriori is a breadth-first algorithm to obtain the support is greater than the minimum support of frequent itemsets by repeatedly scanning the database. This paper presents the construction of materials science and information model based on association rule mining. Experimental data sets prove that the proposed algorithm is effective and reasonable.

Download Full-text

Application of Association Rule Mining Technology in Collection and Management of Wireless Sensor Network Node

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.685.575 ◽

2014 ◽

Vol 685 ◽

pp. 575-578

Author(s):

Guang Jiang Wang ◽

Shi Guo Jin

Keyword(s):

Wireless Sensor Network ◽

Sensor Network ◽

Association Rules ◽

Association Rule ◽

Association Rule Mining ◽

Frequent Itemsets ◽

Wireless Sensor ◽

Network Node ◽

Rule Mining ◽

Collection And Management

Association rule mining is an important data mining method; it is the key link of finding frequent itemsets. The process of association rules mining is roughly into two steps: the first step is to find out from all the concentration of all the frequent itemsets; the second step is to obtain the association rules from frequent itemsets. This paper analyzes the collected information of nodes in wireless sensor network and management. The paper presents application of association rule mining technology in the collection and management of wireless sensor network node.

Download Full-text

Confabulation-Inspired Association Rule Mining for Rare and Frequent Itemsets

IEEE Transactions on Neural Networks and Learning Systems ◽

10.1109/tnnls.2014.2303137 ◽

2014 ◽

Vol 25 (11) ◽

pp. 2053-2064 ◽

Cited By ~ 11

Author(s):

Azadeh Soltani ◽

M.-R Akbarzadeh-T

Keyword(s):

Association Rule ◽

Association Rule Mining ◽

Frequent Itemsets ◽

Rule Mining

Download Full-text

An Enhanced Approach to Mine Maximal Frequent Itemset using Maximal Frequent Itemset Prima Algorithm (MFIPA)

Asian Journal of Computer Science and Technology ◽

10.51983/ajcst-2019.8.s2.2035 ◽

2019 ◽

Vol 8 (S2) ◽

pp. 9-12

Author(s):

R. Smeeta Mary ◽

K. Perumal

Keyword(s):

Data Mining ◽

Association Rule ◽

Association Rule Mining ◽

Frequent Itemsets ◽

Frequent Itemset ◽

Decision Makers ◽

New Method ◽

Rule Mining

In data mining finding out the frequent itemsets is one of the very essential topics. Data mining helps in identifying the best knowledge for different decision makers. Frequent itemset generation is the precondition and most time-consuming method for association rule mining. In this paper we suggest a new algorithm for frequent itemset detection that works with datasets in distributed manner. The proposed algorithm brings in a new method to find frequent itemset not including the necessitate to create candidate itemsets. The proposed approach could be implemented using horizontal representation for transaction datasets and allocating prime value. It explores all the frequent itemset that is present in the input and according to the support the maximum frequent itemset is identified. It was applied on different transactions database and compared with well-known algorithms: FP-Growth and Parallel Apriori with different support levels. The try out showed that the proposed algorithm attain major time improvement over both algorithms.

Download Full-text