Interestingness Measures for Association Rules

Data Mining and Knowledge Discovery Technologies ◽

10.4018/978-1-59904-960-1.ch002 ◽

2008 ◽

pp. 36-58

Author(s):

Yun Sing Koh ◽

Richard O’Keefe ◽

Nathan Rountree

Keyword(s):

Experimental Study ◽

Selection Bias ◽

Association Rules ◽

Association Rule ◽

Association Rule Mining ◽

Information Gain ◽

Experimental Results ◽

Rule Mining ◽

Interestingness Measures

Association rules are patterns that offer useful information on dependencies that exist between the sets of items. Current association rule mining techniques such as apriori often extract a very large number of rules. To make sense of these rules we need to order or group the rules in some fashion such that the useful patterns are highlighted. The study of this process involves the investigation of an “interestingness” in the rules. To date, various measures have been proposed but unfortunately, these measures present inconsistent information about the interestingness of a rule. In this chapter, we show that different metrics try to capture different dependencies among variables. Each measure has its own selection bias that justifies the rationale for preferring it compared to other measures. We present an experimental study of the behaviour of the interestingness measures such as lift, rule interest, Laplace, and information gain. Our experimental results verify that many of these measures are very similar in nature. From the findings, we introduce a classification of the current interestingness measures.

Download Full-text

A Kernel Density Estimation Based Interestingness Measure for Association Rule Mining

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.20-23.389 ◽

2010 ◽

Vol 20-23 ◽

pp. 389-394

Author(s):

Zhi Feng Hao ◽

Rui Chu Cai ◽

Tang Wu ◽

Yi Yuan Zhou

Keyword(s):

Density Estimation ◽

Association Rules ◽

Kernel Density Estimation ◽

Association Rule ◽

Association Rule Mining ◽

Kernel Density ◽

Rule Mining ◽

Data Set ◽

Interestingness Measures ◽

Interestingness Measure

Association rules provide a concise statement of potentially useful information, and have been widely used in real applications. However, the usefulness of association rules highly depends on the interestingness measure which is used to select interesting rules from millions of candidates. In this study, a probability analysis of association rules is conducted, and a discrete kernel density estimation based interestingness measure is proposed accordingly. The new proposed interestingness measure makes the most of the information contained in the data set and obtains much lower falsely discovery rate than the existing interestingness measures. Experimental results show the effectiveness of the proposed interestingness measure.

Download Full-text

Frequent Closed Itemsets Based Condensed Representations for Association Rules

Post-Mining of Association Rules ◽

10.4018/978-1-60566-404-0.ch013 ◽

2009 ◽

pp. 246-271 ◽

Cited By ~ 3

Author(s):

Nicolas Pasquier

Keyword(s):

Association Rules ◽

Association Rule ◽

Association Rule Mining ◽

Rule Mining ◽

Theoretical Frameworks ◽

Interestingness Measures ◽

Rule Structure ◽

Closed Itemsets ◽

Condensed Representations ◽

Frequent Closed Itemset

After more than one decade of researches on association rule mining, efficient and scalable techniques for the discovery of relevant association rules from large high-dimensional datasets are now available. Most initial studies have focused on the development of theoretical frameworks and efficient algorithms and data structures for association rule mining. However, many applications of association rules to data from different domains have shown that techniques for filtering irrelevant and useless association rules are required to simplify their interpretation by the end-user. Solutions proposed to address this problem can be classified in four main trends: constraint-based mining, interestingness measures, association rule structure analysis, and condensed representations. This chapter focuses on condensed representations that are characterized in the frequent closed itemset framework to expose their advantages and drawbacks.

Download Full-text

Sensitivity Association Rule Mining using Weight based Fuzzy Logic

Global Journal of Enterprise Information System ◽

10.18311/gjeis/2017/15480 ◽

2017 ◽

Vol 9 (2) ◽

pp. 1 ◽

Cited By ~ 2

Author(s):

Meenakshi Bansal ◽

Dinesh Grover ◽

Dhiraj Sharma

Keyword(s):

Data Mining ◽

Fuzzy Logic ◽

Side Effects ◽

Association Rules ◽

Association Rule ◽

Association Rule Mining ◽

Privacy Preserving ◽

Experimental Results ◽

Rule Mining

Mining of sensitive rules is the most important task in data mining. Most of the existing techniques worked on finding sensitive rules based upon the crisp thresh hold value of support and confidence which cause serious side effects to the original database. To avoid these crisp boundaries this paper aims to use WFPPM (Weighted Fuzzy Privacy Preserving Mining) to extract sensitive association rules. WFPPM completely find the sensitive rules by calculating the weights of the rules. At first, we apply FP-Growth to mine association rules from the database. Next, we implement fuzzy to find the sensitive rules among the extracted rules. Experimental results show that the proposed scheme find actual sensitive rules without any modification along with maintaining the quality of the released data as compared to the previous techniques.

Download Full-text

Loss Profit Estimation Using Temporal Association Rule Mining

International Journal of Business Analytics ◽

10.4018/ijban.2016010103 ◽

2016 ◽

Vol 3 (1) ◽

pp. 45-57 ◽

Cited By ~ 4

Author(s):

Reshu Agarwal ◽

Mandeep Mittal ◽

Sarla Pareek

Keyword(s):

Association Rules ◽

Association Rule ◽

Association Rule Mining ◽

Temporal Association ◽

Timing Constraints ◽

Rule Mining ◽

Data Mining Technique ◽

Temporal Association Rule ◽

Frequent Items

Temporal association rule mining is a data mining technique in which relationships between items which satisfy certain timing constraints can be discovered. This paper presents the concept of temporal association rules in order to solve the problem of classification of inventories by including time expressions into association rules. Firstly, loss profit of frequent items is calculated by using temporal association rule mining algorithm. Then, the frequent items in particular time-periods are ranked according to descending order of loss profits. The manager can easily recognize most profitable items with the help of ranking found in the paper. An example is illustrated to validate the results.

Download Full-text

Binary Particle Swarm Optimization-Based Association Rule Mining for Discovering Relationships between Machine Capabilities and Product Features

Mathematical Problems in Engineering ◽

10.1155/2018/2456010 ◽

2018 ◽

Vol 2018 ◽

pp. 1-16 ◽

Cited By ~ 2

Author(s):

Zhicong Kou ◽

Lifeng Xi

Keyword(s):

Particle Swarm Optimization ◽

Association Rules ◽

Association Rule ◽

Association Rule Mining ◽

Particle Swarm ◽

Performance Comparison ◽

Binary Particle Swarm Optimization ◽

Rule Mining ◽

Swarm Optimization ◽

Product Features

An effective data mining method to automatically extract association rules between manufacturing capabilities and product features from the available historical data is essential for an efficient and cost-effective product development and production. This paper proposes a new binary particle swarm optimization- (BPSO-) based association rule mining (BPSO-ARM) method for discovering the hidden relationships between machine capabilities and product features. In particular, BPSO-ARM does not need to predefine thresholds of minimum support and confidence, which improves its applicability in real-world industrial cases. Moreover, a novel overlapping measure indication is further proposed to eliminate those lower quality rules to further improve the applicability of BPSO-ARM. The effectiveness of BPSO-ARM is demonstrated on a benchmark case and an industrial case about the automotive part manufacturing. The performance comparison indicates that BPSO-ARM outperforms other regular methods (e.g., Apriori) for ARM. The experimental results indicate that BPSO-ARM is capable of discovering important association rules between machine capabilities and product features. This will help support planners and engineers for the new product design and manufacturing.

Download Full-text

Finding Efficient Positive and Negative Itemsets Using Interestingness Measures

International Journal of Engineering & Technology ◽

10.14419/ijet.v7i4.36.24133 ◽

2018 ◽

Vol 7 (4.36) ◽

pp. 533

Author(s):

P. Asha ◽

T. Prem Jacob ◽

A. Pravin

Keyword(s):

Association Rule ◽

Association Rule Mining ◽

Data Gathering ◽

Unstructured Data ◽

Rule Mining ◽

Interestingness Measures ◽

Data Mining Algorithms ◽

Data Formats ◽

Mining Algorithms ◽

Rare Itemsets

Currently, data gathering techniques have increased through which unstructured data creeps in, along with well defined data formats. Mining these data and bringing out useful patterns seems difficult. Various data mining algorithms were put forth for this purpose. The associated patterns generated by the association rule mining algorithms are large in number. Every ARM focuses on positive rule mining and very few literature has focussed on rare_itemsets_mining. The work aims at retrieving the rare itemsets that are of most interest to the user by utilizing various interestingness measures. Both positive and negative itemset mining would be focused in this work.

Download Full-text

Analysis of Various Interestingness Measures in Class Association Rule Mining

SICE Journal of Control Measurement and System Integration ◽

10.9746/jcmsi.4.295 ◽

2011 ◽

Vol 4 (4) ◽

pp. 295-304 ◽

Cited By ~ 6

Author(s):

Xianneng LI ◽

Shingo MABU ◽

Huiyu ZHOU ◽

Kaoru SHIMADA ◽

Kotaro HIRASAWA

Keyword(s):

Association Rule ◽

Association Rule Mining ◽

Rule Mining ◽

Interestingness Measures ◽

Class Association Rule

Download Full-text

Semi-Automatic Ontology Construction by Exploiting Functional Dependencies and Association Rules

Semantic Web ◽

10.4018/978-1-4666-3610-1.ch004 ◽

2013 ◽

pp. 76-96

Author(s):

Luca Cagliero ◽

Tania Cerquitelli ◽

Paolo Garza

Keyword(s):

Association Rules ◽

Association Rule ◽

Association Rule Mining ◽

Description Logic ◽

Functional Dependency ◽

Functional Dependencies ◽

Rule Mining ◽

Domain Experts ◽

Data Schema ◽

Input Dataset

This paper presents a novel semi-automatic approach to construct conceptual ontologies over structured data by exploiting both the schema and content of the input dataset. It effectively combines two well-founded database and data mining techniques, i.e., functional dependency discovery and association rule mining, to support domain experts in the construction of meaningful ontologies, tailored to the analyzed data, by using Description Logic (DL). To this aim, functional dependencies are first discovered to highlight valuable conceptual relationships among attributes of the data schema (i.e., among concepts). The set of discovered correlations effectively support analysts in the assertion of the Tbox ontological statements (i.e., the statements involving shared data conceptualizations and their relationships). Then, the analyst-validated dependencies are exploited to drive the association rule mining process. Association rules represent relevant and hidden correlations among data content and they are used to provide valuable knowledge at the instance level. The pushing of functional dependency constraints into the rule mining process allows analysts to look into and exploit only the most significant data item recurrences in the assertion of the Abox ontological statements (i.e., the statements involving concept instances and their relationships).

Download Full-text

Constraint-Based Association Rule Mining

Encyclopedia of Data Warehousing and Mining, Second Edition ◽

10.4018/978-1-60566-010-3.ch049 ◽

2011 ◽

pp. 307-312 ◽

Cited By ~ 10

Author(s):

Carson Kai-Sang Leung

Keyword(s):

Data Mining ◽

Association Rules ◽

Association Rule ◽

Association Rule Mining ◽

Computational Cost ◽

Knowledge Discovery In Databases ◽

Rule Mining ◽

The Subject ◽

User Focus ◽

High Computational Cost

The problem of association rule mining was introduced in 1993 (Agrawal et al., 1993). Since then, it has been the subject of numerous studies. Most of these studies focused on either performance issues or functionality issues. The former considered how to compute association rules efficiently, whereas the latter considered what kinds of rules to compute. Examples of the former include the Apriori-based mining framework (Agrawal & Srikant, 1994), its performance enhancements (Park et al., 1997; Leung et al., 2002), and the tree-based mining framework (Han et al., 2000); examples of the latter include extensions of the initial notion of association rules to other rules such as dependence rules (Silverstein et al., 1998) and ratio rules (Korn et al., 1998). In general, most of these studies basically considered the data mining exercise in isolation. They did not explore how data mining can interact with the human user, which is a key component in the broader picture of knowledge discovery in databases. Hence, they provided little or no support for user focus. Consequently, the user usually needs to wait for a long period of time to get numerous association rules, out of which only a small fraction may be interesting to the user. In other words, the user often incurs a high computational cost that is disproportionate to what he wants to get. This calls for constraint-based association rule mining.

Download Full-text

Association Rule and Quantitative Association Rule Mining among Infrequent Items

Rare Association Rule Mining and Knowledge Discovery ◽

10.4018/978-1-60566-754-6.ch002 ◽

2010 ◽

pp. 15-32 ◽

Cited By ~ 1

Author(s):

Ling Zhou ◽

Stephen Yau

Keyword(s):

Data Mining ◽

Association Rules ◽

Association Rule ◽

Association Rule Mining ◽

Rule Mining ◽

Transactional Databases ◽

Frequent Items ◽

Increasing Demand ◽

Quantitative Association Rule

Association rule mining among frequent items has been extensively studied in data mining research. However, in recent years, there is an increasing demand for mining infrequent items (such as rare but expensive items). Since exploring interesting relationships among infrequent items has not been discussed much in the literature, in this chapter, the authors propose two simple, practical and effective schemes to mine association rules among rare items. Their algorithms can also be applied to frequent items with bounded length. Experiments are performed on the well-known IBM synthetic database. The authors’ schemes compare favorably to Apriori and FP-growth under the situation being evaluated. In addition, they explore quantitative association rule mining in transactional databases among infrequent items by associating quantities of items: some interesting examples are drawn to illustrate the significance of such mining.

Download Full-text