A Distributed Algorithm for Mining Fuzzy Association Rules in Traditional Databases

2011 ◽

pp. 685-705

Author(s):

Wai-Ho Au

Keyword(s):

Distributed System ◽

Association Rules ◽

Distributed Algorithm ◽

Data Sets ◽

Linguistic Variables ◽

Linguistic Terms ◽

Fuzzy Association Rules ◽

Interestingness Measure ◽

The Difference ◽

Fuzzy Association Rule

The mining of fuzzy association rules has been proposed in the literature recently. Many of the ensuing algorithms are developed to make use of only a single processor or machine. They can be further enhanced by taking advantage of the scalability of parallel or distributed computer systems. The increasing ability to collect data and the resulting huge data volume make the exploitation of parallel or distributed systems become more and more important to the success of fuzzy association rule mining algorithms. This chapter proposes a new distributed algorithm, called DFARM, for mining fuzzy association rules from very large databases. Unlike many existing algorithms that adopt the support-confidence framework such that an association is considered interesting if it satisfies some user-specified minimum percentage thresholds, DFARM embraces an objective measure to distinguish interesting associations from uninteresting ones. This measure is defined as a function of the difference in the actual and the expected number of tuples characterized by different linguistic variables (attributes) and linguistic terms (attribute values). Given a database, DFARM first divides it into several horizontal partitions and assigns them to different sites in a distributed system. It then has each site scan its own database partition to obtain the number of tuples characterized by different linguistic variables and linguistic terms (i.e., the local counts), and exchange the local counts with all the other sites to find the global counts. Based on the global counts, the values of the interestingness measure are computed, and the sites can uncover interesting associations. By repeating this process of counting, exchanging counts, and calculating the interestingness measure, it unveils the underlying interesting associations hidden in the data. We implemented DFARM in a distributed system and used a popular benchmark data set to evaluate its performance. The results show that it has very good size-up, speedup, and scale-up performance. We also evaluated the effectiveness of the proposed interestingness measure on two synthetic data sets. The experimental results show that it is very effective in differentiating between interesting and uninteresting associations.

Download Full-text

A novel interestingness measure based on fusion model for association rules mining

MATEC Web of Conferences ◽

10.1051/matecconf/202133605009 ◽

2021 ◽

Vol 336 ◽

pp. 05009

Author(s):

Junrui Yang ◽

Lin Xu

Keyword(s):

Association Rules ◽

Negative Association ◽

Fusion Model ◽

Interestingness Measures ◽

Association Rules Mining ◽

Interestingness Measure ◽

Advantages And Disadvantages ◽

Measure Model ◽

Negative Association Rules ◽

The Difference

Aiming at the shortcomings of the traditional "support-confidence" association rules mining framework and the problems of mining negative association rules, the concept of interestingness measure is introduced. Analyzed the advantages and disadvantages of some commonly used interestingness measures at present, and combined the cosine measure on the basis of the interestingness measure model based on the difference idea, and proposed a new interestingness measure model. The interestingness measure can effectively express the relationship between the antecedent and the subsequent part of the rule. According to this model, an association rules mining algorithm based on the interestingness measure fusion model is proposed to improve the accuracy of mining. Experiments show that the algorithm has better performance and can effectively help mining positive and negative association rules.

Download Full-text

Finding Active Membership Functions for Genetic-Fuzzy Data Mining

International Journal of Information Technology & Decision Making ◽

10.1142/s0219622015500297 ◽

2015 ◽

Vol 14 (06) ◽

pp. 1215-1242 ◽

Cited By ~ 3

Author(s):

Chun-Hao Chen ◽

Tzung-Pei Hong ◽

Yeong-Chyi Lee ◽

Vincent S. Tseng

Keyword(s):

Data Mining ◽

Association Rules ◽

Fuzzy Data ◽

Membership Functions ◽

Linguistic Terms ◽

Fuzzy Association Rules ◽

Fitness Functions ◽

The Given ◽

Binary Strings ◽

Active Membership

Since transactions may contain quantitative values, many approaches have been proposed to derive membership functions for mining fuzzy association rules using genetic algorithms (GAs), a process known as genetic-fuzzy data mining. However, existing approaches assume that the number of linguistic terms is predefined. Thus, this study proposes a genetic-fuzzy mining approach for extracting an appropriate number of linguistic terms and their membership functions used in fuzzy data mining for the given items. The proposed algorithm adjusts membership functions using GAs and then uses them to fuzzify the quantitative transactions. Each individual in the population represents a possible set of membership functions for the items and is divided into two parts, control genes (CGs) and parametric genes (PGs). CGs are encoded into binary strings and used to determine whether membership functions are active. Each set of membership functions for an item is encoded as PGs with real-number schema. In addition, seven fitness functions are proposed, each of which is used to evaluate the goodness of the obtained membership functions and used as the evolutionary criteria in GA. After the GA process terminates, a better set of association rules with a suitable set of membership functions is obtained. Experiments are made to show the effectiveness of the proposed approach.

Download Full-text

Finding Associations in Composite Data Sets

Developments in Data Extraction, Management, and Analysis ◽

10.4018/978-1-4666-2148-0.ch008 ◽

2013 ◽

pp. 162-186

Author(s):

M. Sulaiman Khan ◽

Maybin Muyeba ◽

Frans Coenen ◽

David Reid ◽

Hissam Tawfik

Keyword(s):

Synthetic Data ◽

Data Sets ◽

Formal Definition ◽

Rule Mining ◽

Fuzzy Association Rules ◽

Back Ground ◽

Fuzzy Association Rule ◽

Definition Of ◽

Fuzzy Association Rule Mining ◽

Composite Data

In this paper, a composite fuzzy association rule mining mechanism (CFARM), directed at identifying patterns in datasets comprised of composite attributes, is described. Composite attributes are defined as attributes that can take simultaneously two or more values that subscribe to a common schema. The objective is to generate fuzzy association rules using “properties” associated with these composite attributes. The exemplar application is the analysis of the nutrients contained in items found in grocery data sets. The paper commences with a review of the back ground and related work, and a formal definition of the CFARM concepts. The CFARM algorithm is then fully described and evaluated using both real and synthetic data sets.

Download Full-text

GENETIC-FUZZY MINING WITH TAXONOMY

International Journal of Uncertainty Fuzziness and Knowledge-Based Systems ◽

10.1142/s021848851240020x ◽

2012 ◽

Vol 20 (supp02) ◽

pp. 187-205 ◽

Cited By ~ 7

Author(s):

CHUN-HAO CHEN ◽

TZUNG-PEI HONG ◽

YEONG-CHYI LEE

Keyword(s):

Association Rules ◽

Multiple Level ◽

Membership Functions ◽

Rule Mining ◽

Fuzzy Association Rules ◽

Fitness Value ◽

Real World Applications ◽

Fuzzy Association Rule ◽

Fuzzy Association Rule Mining ◽

The Given

Data mining is most commonly used in attempts to induce association rules from transaction data. Since transactions in real-world applications usually consist of quantitative values, many fuzzy association-rule mining approaches have been proposed on single- or multiple-concept levels. However, the given membership functions may have a critical influence on the final mining results. In this paper, we propose a multiple-level genetic-fuzzy mining algorithm for mining membership functions and fuzzy association rules using multiple-concept levels. It first encodes the membership functions of each item class (category) into a chromosome according to the given taxonomy. The fitness value of each individual is then evaluated by the summation of large 1-itemsets of each item in different concept levels and the suitability of membership functions in the chromosome. After the GA process terminates, a better set of multiple-level fuzzy association rules can then be expected with a more suitable set of membership functions. Experimental results on a simulation dataset also show the effectiveness of the algorithm.

Download Full-text

A Survey on Fuzzy Association Rule Mining

International Journal of Data Warehousing and Mining ◽

10.4018/jdwm.2013010101 ◽

2013 ◽

Vol 9 (1) ◽

pp. 1-27 ◽

Cited By ~ 13

Author(s):

Harihar Kalia ◽

Satchidananda Dehuri ◽

Ashish Ghosh

Keyword(s):

Association Rules ◽

Association Rule ◽

Quantitative Data ◽

Association Rule Mining ◽

Real Life ◽

Rule Mining ◽

Fuzzy Association Rules ◽

Fuzzy Association Rule ◽

Mining Algorithms ◽

Fuzzy Association Rule Mining

Association rule mining is one of the fundamental tasks of data mining. The conventional association rule mining algorithms, using crisp set, are meant for handling Boolean data. However, in real life quantitative data are voluminous and need careful attention for discovering knowledge. Therefore, to extract association rules from quantitative data, the dataset at hand must be partitioned into intervals, and then converted into Boolean type. In the sequel, it may suffer with the problem of sharp boundary. Hence, fuzzy association rules are developed as a sharp knife to solve the aforesaid problem by handling quantitative data using fuzzy set. In this paper, the authors present an updated survey of fuzzy association rule mining procedures along with a discussion and relevant pointers for further research.

Download Full-text

Finding Associations in Composite Data Sets

International Journal of Data Warehousing and Mining ◽

10.4018/jdwm.2011070101 ◽

2011 ◽

Vol 7 (3) ◽

pp. 1-29 ◽

Cited By ~ 10

Author(s):

M. Sulaiman Khan ◽

Maybin Muyeba ◽

Frans Coenen ◽

David Reid ◽

Hissam Tawfik

Keyword(s):

Association Rule ◽

Synthetic Data ◽

Data Sets ◽

Rule Mining ◽

Fuzzy Association Rules ◽

Back Ground ◽

Fuzzy Association Rule ◽

Definition Of ◽

Fuzzy Association Rule Mining ◽

Composite Data

In this paper, a composite fuzzy association rule mining mechanism (CFARM), directed at identifying patterns in datasets comprised of composite attributes, is described. Composite attributes are defined as attributes that can take simultaneously two or more values that subscribe to a common schema. The objective is to generate fuzzy association rules using “properties” associated with these composite attributes. The exemplar application is the analysis of the nutrients contained in items found in grocery data sets. The paper commences with a review of the back ground and related work, and a formal definition of the CFARM concepts. The CFARM algorithm is then fully described and evaluated using both real and synthetic data sets.

Download Full-text

A DISTRIBUTED ALGORITHM FOR MINING FUZZY ASSOCIATION RULES

Proceedings of the First International Conference on Web Information Systems and Technologies ◽

10.5220/0001228802060209 ◽

2005 ◽

Keyword(s):

Association Rules ◽

Distributed Algorithm ◽

Fuzzy Association Rules

Download Full-text

Mining fuzzy association rules with 2-tuple linguistic terms in stock market data by using genetic algorithm

The 16th CSI International Symposium on Artificial Intelligence and Signal Processing (AISP 2012) ◽

10.1109/aisp.2012.6313772 ◽

2012 ◽

Author(s):

Hadi Lafzi Ghazi ◽

Mohammad Saniee Abadeh

Keyword(s):

Genetic Algorithm ◽

Stock Market ◽

Association Rules ◽

Linguistic Terms ◽

Market Data ◽

Fuzzy Association Rules

Download Full-text

Effective Mining of Weighted Fuzzy Association Rules

Rare Association Rule Mining and Knowledge Discovery ◽

10.4018/978-1-60566-754-6.ch004 ◽

2010 ◽

pp. 47-64 ◽

Cited By ~ 2

Author(s):

Maybin Muyeba ◽

M. Sulaiman Khan ◽

Frans Coenen

Keyword(s):

Association Rules ◽

Association Rule ◽

Association Rule Mining ◽

Real Data ◽

Closure Property ◽

Data Sets ◽

Rule Mining ◽

Fuzzy Association Rules ◽

Novel Approach ◽

Downward Closure

A novel approach is presented for effectively mining weighted fuzzy association rules (ARs). The authors address the issue of invalidation of downward closure property (DCP) in weighted association rule mining where each item is assigned a weight according to its significance wrt some user defined criteria. Most works on weighted association rule mining do not address the downward closure property while some make assumptions to validate the property. This chapter generalizes the weighted association rule mining problem with binary and fuzzy attributes with weighted settings. Their methodology follows an Apriori approach but employs T-tree data structure to improve efficiency of counting itemsets. The authors’ approach avoids pre and post processing as opposed to most weighted association rule mining algorithms, thus eliminating the extra steps during rules generation. The chapter presents experimental results on both synthetic and real-data sets and a discussion on evaluating the proposed approach.

Download Full-text