Closed-Itemset Incremental-Mining Problem

Encyclopedia of Data Warehousing and Mining ◽

10.4018/978-1-59140-557-3.ch029 ◽

2011 ◽

pp. 150-153

Author(s):

Luminita Dumitriu

Keyword(s):

Association Rules ◽

Frequent Itemsets ◽

Frequent Itemset ◽

Incremental Mining ◽

Minimum Support ◽

Confidence Threshold ◽

Support Threshold ◽

Mining Association Rules

Association rules, introduced by Agrawal, Imielinski and Swami (1993), provide useful means to discover associations in data. The problem of mining association rules in a database is defined as finding all the association rules that hold with more than a user-given minimum support threshold and a user-given minimum confidence threshold. According to Agrawal, Imielinski and Swami, this problem is solved in two steps: 1. Find all frequent itemsets in the database. 2. For each frequent itemset I, generate all the association rules I’ÞI\I’, where I’ÌI.

Download Full-text

An Efficient Approach for Incremental Mining Fuzzy Frequent Itemsets with FP-Tree

International Journal of Uncertainty Fuzziness and Knowledge-Based Systems ◽

10.1142/s0218488516500185 ◽

2016 ◽

Vol 24 (03) ◽

pp. 367-386 ◽

Cited By ~ 8

Author(s):

Weigang Huo ◽

Xingjie Feng ◽

Zhiyuan Zhang

Keyword(s):

Execution Time ◽

Frequent Itemsets ◽

Incremental Mining ◽

Minimum Support ◽

Efficient Approach ◽

Support Threshold ◽

Dynamic Databases ◽

Norm Operator ◽

Memory Cost

Keeping the generated fuzzy frequent itemsets up-to-date and discovering the new fuzzy frequent itemsets are challenging problems in dynamic databases. In this paper, the classical H-struct structure is extended to mining fuzzy frequent itemsets. The extended H-mine algorithm can use any t-norm operator to calculate the support of fuzzy itemset. The FP-tree-based structure called the Initial-FP-tree and the New-FP-tree are built to maintain the fuzzy frequent itemsets in the original database and the new inserted transactions respectively. The strategy of incremental mining of fuzzy frequent itemsets is achieved by breath-first-traversing the Initial-FP-tree and the New-FP-tree. All of the fuzzy frequent itemsets in the updated database can be obtained by traversing the Initial-FP-tree. The experiments on real datasets show that the proposed approach runs faster than the batch extended H-mine algorithm. Comparing with the existing algorithm for incremental mining fuzzy frequent itemsets, the proposed approach is superior in terms of the execution time. The memory cost of the proposed approach is lower than that of the existing algorithm when the minimum support threshold is low.

Download Full-text

Partition based Single Scan Method for Mining Frequent Item Sets

International Journal of Engineering and Advanced Technology - Regular Issue ◽

10.35940/ijeat.f9237.088619 ◽

2019 ◽

Vol 8 (6) ◽

pp. 4917-4922

Keyword(s):

Unique Feature ◽

Frequent Itemsets ◽

Frequent Itemset ◽

Minimum Support ◽

Itemset Mining ◽

Highly Sensitive ◽

Support Threshold ◽

Hidden Patterns ◽

The Cost ◽

Frequent Item Sets

Frequent Itemset mining (FIM) concept and limitations are explored in this paper, for the purpose of extracting unknown hidden patterns as itemsets from the transactional database. Since candidate generation and support calculations are the major tasks in FIM, the major limitations of FIM are tackled, (i) huge possible frequent itemsets are generated as candidates at each pass (ii) Data base scan at each pass to calculate the support of the generated itemsets (iii) generated itemsets are highly sensitive to the minimum support threshold. SS-FIM a single scan algorithm is to deal with the above limitations. However, several unnecessary itemsets are being hashed in the buckets. To overcome the limitations, a partition based approach is proposed in this paper. The proposed approach, PSSFIM, takes single scan of the database to identify frequent itemsets. The unique feature of PSSFIM allow to generate size of candidate itemsets independent on the minimum support. It allows the candidates in hash that are possible for frequent, which intuitively reduces the cost in terms of verifying the support of generated candidates. It is compared with SS-FIM and Apriori with the standard datasets. The results show that the PSSFIM is good at the comparison of SS-FIM and Apriori.

Download Full-text

Mining predicate rules without minimum support threshold

Kuwait Journal of Science ◽

10.48129/kjs.v48i4.9782 ◽

2021 ◽

Vol 48 (4) ◽

Author(s):

Hafiz I. Ahmad ◽

◽

Alex T. H. Sim ◽

Roliana Ibrahim ◽

Mohammad Abrar ◽

...

Keyword(s):

Association Rules ◽

Association Rule ◽

Association Rule Mining ◽

State Of The Art ◽

Predicate Logic ◽

Frequent Itemsets ◽

Rule Mining ◽

Minimum Support ◽

Support Threshold ◽

Definition Of

Association rule mining (ARM) is used for discovering frequent itemsets for interesting relationships of associative and correlative behaviors within the data. This gives new insights of great value, both commercial and academic. The traditional ARM techniques discover interesting association rules based on a predefined minimum support threshold. However, there is no known standard of an exact definition of minimum support and providing an inappropriate minimum support value may result in missing important rules. In addition, most of the rules discovered by these traditional ARM techniques refer to already known knowledge. To address these limitations of the minimum support threshold in ARM techniques, this study proposes an algorithm to mine interesting association rules without minimum support using predicate logic and a property of a proposed interestingness measure (g measure). The algorithm scans the database and uses g measure’s property to search for interesting combinations. The selected combinations are mapped to pseudo-implications and inference rules of logic are used on the pseudo-implications to produce and validate the predicate rules. Experimental results of the proposed technique show better performance against state-of-the-art classification techniques, and reliable predicate rules are discovered based on the reliability differences of the presence and absence of the rule’s consequence.

Download Full-text

Finding Non-Coincidental Sporadic Rules Using Apriori-Inverse

Data Warehousing and Mining ◽

10.4018/978-1-59904-951-9.ch204 ◽

2008 ◽

pp. 3222-3234

Author(s):

Yun Sing Koh ◽

Nathan Rountree ◽

Richard O’Keefe

Keyword(s):

Data Mining ◽

Rare Disease ◽

Association Rules ◽

Frequent Itemsets ◽

New Method ◽

Apriori Algorithm ◽

High Confidence ◽

Minimum Support ◽

Rare Association ◽

Support Threshold

Discovering association rules efficiently is an important data mining problem. We define sporadic rules as those with low support but high confidence; for example, a rare association of two symptoms indicating a rare disease. To find such rules using the well-known Apriori algorithm, minimum support has to be set very low, producing a large number of trivial frequent itemsets. To alleviate this problem, we propose a new method of discovering sporadic rules without having to produce all other rules above the minimum support threshold. The new method, called Apriori-Inverse, is a variation of the Apriori algorithm that uses the notion of maximum support instead of minimum support to generate candidate itemsets. Candidate itemsets of interest to us fall below a maximum support value but above a minimum absolute support value. Rules above maximum support are considered frequent rules, which are of no interest to us, whereas rules that occur by chance fall below the minimum absolute support value. We define two classes of sporadic rule: perfectly sporadic rules (those that consist only of items falling below maximum support) and imperfectly sporadic rules (those that may contain items over the maximum support threshold). This article is an expanded version of Koh and Rountree (2005).

Download Full-text

Towards Scalable Algorithm for Closed Itemset Mining in High-Dimensional Data

Indonesian Journal of Electrical Engineering and Computer Science ◽

10.11591/ijeecs.v8.i2.pp487-494 ◽

2017 ◽

Vol 8 (2) ◽

pp. 487

Author(s):

Fatimah Audah Md. Zaki ◽

Nurul Fariza Zulkurnain

Keyword(s):

High Dimensional Data ◽

Search Tree ◽

Frequent Itemsets ◽

Main Memory ◽

Frequent Itemset ◽

High Dimensional ◽

Major Drawback ◽

Scalable Algorithm ◽

Support Threshold ◽

Closed Frequent Itemset

<p>Mining frequent itemsets from large dataset has a major drawback in which the explosive number of itemsets requires additional mining process which might filter the interesting ones. Therefore, as the solution, the concept of closed frequent itemset was introduced that is lossless and condensed representation of all the frequent itemsets and their corresponding supports. Unfortunately, many algorithms are not memory-efficient since it requires the storage of closed itemsets in main memory for duplication checks. This paper presents BFF, a scalable algorithm for discovering closed frequent itemsets from high-dimensional data. Unlike many well-known algorithms, BFF traverses the search tree in breadth-first manner resulted to a minimum use of memory and less running time. The tests conducted on a number of microarray datasets show that the performance of this algorithm improved significantly as the support threshold decreases which is crucial in generating more interesting rules.</p>

Download Full-text

MapReduce Frequent Itemsets for Mining Association Rules

2016 International Conference on Information System and Artificial Intelligence (ISAI) ◽

10.1109/isai.2016.0066 ◽

2016 ◽

Author(s):

Arkan A. G. Al-Hamodi ◽

Song-Feng Lu

Keyword(s):

Association Rules ◽

Frequent Itemsets ◽

Mining Association Rules

Download Full-text

E-MsNFIS: Efficient Negative Frequent Itemsets Mining Based on Multiple Minimum Supports

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.411-414.386 ◽

2013 ◽

Vol 411-414 ◽

pp. 386-389 ◽

Cited By ~ 1

Author(s):

Tian Tian Xu ◽

Xiang Jun Dong

Keyword(s):

Association Rules ◽

Real Life ◽

Frequent Itemsets ◽

Negative Association ◽

Experimental Results ◽

New Method ◽

Minimum Support ◽

Frequent Itemsets Mining ◽

Negative Association Rules ◽

Multiple Minimum Supports

Negative frequent itemsets (NFIS) like (a1a2¬a3a4) have played important roles in real applications because we can mine valued negative association rules from them. In one of our previous work, we proposed a method, namede-NFISto mine NFIS from positive frequent itemsets (PFIS). However,e-NFISonly uses single minimum support, which implicitly assumes that all items in the database are of the same nature or of similar frequencies in the database. This is often not the case in real-life applications. So a lot of methods to mine frequent itemsets with multiple minimum supports have been proposed. These methods allow users to assign different minimum supports to different items. But these methods only mine PFIS, doesn’t consider negative ones. So in this paper, we propose a new method, namede-msNFIS, to mine NFIS from PFIS based on multiple minimum supports. E-msNFIScontains three steps: 1) using existing methods to mine PFIS with multiple minimum supports; 2) using the same method ine-NFISto generate NCIS from PFIS got in step 1; 3) calculating the support of these NCIS only using the support of PFIS and then gettingNFIS. Experimental results show that thee-msNFISis efficient.

Download Full-text

The Application of Apriori Algorithm in Analysis on Admitted Students of Colleges and Universities

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.321-324.2578 ◽

2013 ◽

Vol 321-324 ◽

pp. 2578-2582

Author(s):

Qian Zhang

Keyword(s):

Data Mining ◽

Association Rules ◽

Colleges And Universities ◽

Apriori Algorithm ◽

Data Mining Techniques ◽

Minimum Support ◽

Sample Data ◽

Mining Association Rules

This paper examined the application of Apriori algorithm in extracting association rules in data mining by sample data on student enrollments. It studied the data mining techniques for extraction of association rules, analyzed the correlation between specialties and characteristics of admitted students, and evaluated the algorithm for mining association rules, in which the minimum support was 30% and the minimum confidence was 40%.

Download Full-text

Implementasi Data Mining Menggunakan Algoritma Apriori Pada Penjualan Suku Cadang Motor

Jurnal Ilmu Komputer ◽

10.24843/jik.2021.v14.i02.p07 ◽

2021 ◽

Vol 14 (2) ◽

pp. 125

Author(s):

Ainul Mardiaha ◽

Yulia Yulia

Keyword(s):

Data Mining ◽

Association Rules ◽

Association Rule ◽

A Priori ◽

Spare Parts ◽

Frequent Itemset ◽

Sales Data ◽

Minimum Support ◽

Relationship Of

This research was carried out to simplify or assist Candra Motor workshop owners in managing data and archives of motorcycle parts sales by applying a data mining a priori algorithm method. Data mining is an operation that uses a particular technique or method to look for different patterns or shapes in a selected data. Sales data for a year with the number of 15 items selected using the priori algorithm method. A priori algorithm is an algorithm for taking data with associative rules (association rule) to determine the associative relationship of an item combination. In a priori algorithm, it is determined frequent itemset-1, frequent itemset-2, and frequent itemset-3 so that the association rules can be obtained from previously selected data. To obtain the frequent itemset, each selected data must meet the minimum support and minimum confidence requirements. In this study using minimum support ? 7 or 0.583 and minimum confidence of 90%. So that some rules of association were obtained, where the calculation of the search for association rules manually and using WEKA software obtained the same results.By fulfilling the minimum support and minimum confidence requirements, the most sold spare parts are inner tube, Yamaha oil and MPX oil.

Download Full-text

Algorithm for mining maximum frequent itemsets based on decreasing dimension of frequent itemset in association rules

Journal of Computer Applications ◽

10.3724/sp.j.1087.2011.01339 ◽

2011 ◽

Vol 31 (5) ◽

pp. 1339-1343

Author(s):

Xue-zhong QIAN ◽

Liang HUI

Keyword(s):

Association Rules ◽

Frequent Itemsets ◽

Frequent Itemset

Download Full-text