Finding Active Membership Functions for Genetic-Fuzzy Data Mining

Since transactions may contain quantitative values, many approaches have been proposed to derive membership functions for mining fuzzy association rules using genetic algorithms (GAs), a process known as genetic-fuzzy data mining. However, existing approaches assume that the number of linguistic terms is predefined. Thus, this study proposes a genetic-fuzzy mining approach for extracting an appropriate number of linguistic terms and their membership functions used in fuzzy data mining for the given items. The proposed algorithm adjusts membership functions using GAs and then uses them to fuzzify the quantitative transactions. Each individual in the population represents a possible set of membership functions for the items and is divided into two parts, control genes (CGs) and parametric genes (PGs). CGs are encoded into binary strings and used to determine whether membership functions are active. Each set of membership functions for an item is encoded as PGs with real-number schema. In addition, seven fitness functions are proposed, each of which is used to evaluate the goodness of the obtained membership functions and used as the evolutionary criteria in GA. After the GA process terminates, a better set of association rules with a suitable set of membership functions is obtained. Experiments are made to show the effectiveness of the proposed approach.

Download Full-text

GENETIC-FUZZY MINING WITH TAXONOMY

International Journal of Uncertainty Fuzziness and Knowledge-Based Systems ◽

10.1142/s021848851240020x ◽

2012 ◽

Vol 20 (supp02) ◽

pp. 187-205 ◽

Cited By ~ 7

Author(s):

CHUN-HAO CHEN ◽

TZUNG-PEI HONG ◽

YEONG-CHYI LEE

Keyword(s):

Association Rules ◽

Multiple Level ◽

Membership Functions ◽

Rule Mining ◽

Fuzzy Association Rules ◽

Fitness Value ◽

Real World Applications ◽

Fuzzy Association Rule ◽

Fuzzy Association Rule Mining ◽

The Given

Data mining is most commonly used in attempts to induce association rules from transaction data. Since transactions in real-world applications usually consist of quantitative values, many fuzzy association-rule mining approaches have been proposed on single- or multiple-concept levels. However, the given membership functions may have a critical influence on the final mining results. In this paper, we propose a multiple-level genetic-fuzzy mining algorithm for mining membership functions and fuzzy association rules using multiple-concept levels. It first encodes the membership functions of each item class (category) into a chromosome according to the given taxonomy. The fitness value of each individual is then evaluated by the summation of large 1-itemsets of each item in different concept levels and the suitability of membership functions in the chromosome. After the GA process terminates, a better set of multiple-level fuzzy association rules can then be expected with a more suitable set of membership functions. Experimental results on a simulation dataset also show the effectiveness of the algorithm.

Download Full-text

A Comparison of Different Fitness Functions for Extracting Membership Functions Used in Fuzzy Data Mining

2007 IEEE Symposium on Foundations of Computational Intelligence ◽

10.1109/foci.2007.371526 ◽

2007 ◽

Cited By ~ 3

Author(s):

Chun-Hao Chen ◽

Tzung-Pei Hong ◽

Vincent S. Tseng

Keyword(s):

Data Mining ◽

Fuzzy Data ◽

Membership Functions ◽

Fitness Functions

Download Full-text

Mining Fuzzy Association Rules in Quantitative Databases

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.182-183.2003 ◽

2012 ◽

Vol 182-183 ◽

pp. 2003-2007

Author(s):

Yi Ming Bai ◽

Xian Yao Meng ◽

Xin Jie Han

Keyword(s):

Data Mining ◽

Association Rules ◽

Fuzzy Rule ◽

Real Data ◽

Rule Base ◽

Linguistic Terms ◽

Fuzzy Association Rules ◽

Rule Number ◽

Discrete Values ◽

Definition Of

In this paper, we introduce a novel technique for mining fuzzy association rules in quantitative databases. Unlike other data mining techniques who can only discover association rules in discrete values, the algorithm reveals the relationships among different quantitative values by traversing through the partition grids and produces the corresponding Fuzzy Association Rules. Fuzzy Association Rules employs linguistic terms to represent the revealed regularities and exceptions in quantitative databases. After the fuzzy rule base is built, we utilize the definition of Support Degree in data mining to reduce the rule number and save the useful rules. Throughout this paper, we will use a set of real data from a wine database to demonstrate the ideas and test the models.

Download Full-text

Fining Active Membership Functions in Fuzzy Data Mining

Data Mining: Foundations and Practice - Studies in Computational Intelligence ◽

10.1007/978-3-540-78488-3_11 ◽

2008 ◽

pp. 179-196 ◽

Cited By ~ 3

Author(s):

Tzung-Pei Hong ◽

Chun-Hao Chen ◽

Yu-Lung Wu ◽

Vincent S. Tseng

Keyword(s):

Data Mining ◽

Fuzzy Data ◽

Membership Functions ◽

Active Membership

Download Full-text

A Novel and Improved Apriori Algorithm

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.721.543 ◽

2014 ◽

Vol 721 ◽

pp. 543-546 ◽

Cited By ~ 1

Author(s):

Dong Juan Gu ◽

Lei Xia

Keyword(s):

Data Mining ◽

Association Rules ◽

Inner Product ◽

Apriori Algorithm ◽

Product Operation ◽

Classical Algorithm ◽

Transaction Database ◽

The Given ◽

To Receive ◽

Improved Algorithm

Apriori algorithm is the classical algorithm in data mining association rules. Because the Apriori algorithm needs scan database for many times, it runs too slowly. In order to improve the running efficiency, this paper improves the Apriori algorithm based on the Apriori analysis. The improved idea is that it transforms the transaction database into corresponding 0-1 matrix. Whose each vector and subsequent vector does inner product operation to receive support. And comparing with the given minsupport, the rows and columns will be deleted if vector are less than the minsupport, so as to reduce the size of the rating matrix, improve the running speeding. Because the improved algorithm only needs to scan the database once when running, therefore the running speeding is more quickly. The experiment also shows that this improved algorithm is efficient and feasible.

Download Full-text

Association Rules Mining Based on Adaptive Fuzzy Clustering Algorithm

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.998-999.842 ◽

2014 ◽

Vol 998-999 ◽

pp. 842-845 ◽

Cited By ~ 1

Author(s):

Jia Mei Guo ◽

Yin Xiang Pei

Keyword(s):

Data Mining ◽

Association Rules ◽

Clustering Algorithm ◽

Original Data ◽

Data Set ◽

Association Rules Mining ◽

Fuzzy Association Rules ◽

Redundant Data ◽

Fuzzy Partitions ◽

Rules Extraction

Association rules extraction is one of the important goals of data mining and analyzing. Aiming at the problem that information lose caused by crisp partition of numerical attribute , in this article, we put forward a fuzzy association rules mining method based on fuzzy logic. First, we use c-means clustering to generate fuzzy partitions and eliminate redundant data, and then map the original data set into fuzzy interval, in the end, we extract the fuzzy association rules on the fuzzy data set as providing the basis for proper decision-making. Results show that this method can effectively improve the efficiency of data mining and the semantic visualization and credibility of association rules.

Download Full-text

Optimization of membership functions in anomaly detection based on fuzzy data mining

2005 International Conference on Machine Learning and Cybernetics ◽

10.1109/icmlc.2005.1527271 ◽

2005 ◽

Cited By ~ 1

Author(s):

Tian-Qing Zhu ◽

Ping Xiong

Keyword(s):

Data Mining ◽

Anomaly Detection ◽

Fuzzy Data ◽

Membership Functions

Download Full-text

Genetic learning of the membership functions for mining fuzzy association rules from low quality data

Information Sciences ◽

10.1016/j.ins.2014.10.027 ◽

2015 ◽

Vol 295 ◽

pp. 358-378 ◽

Cited By ~ 23

Author(s):

Ana María Palacios ◽

José Luis Palacios ◽

Luciano Sánchez ◽

Jesús Alcalá-Fdez

Keyword(s):

Association Rules ◽

Quality Data ◽

Membership Functions ◽

Fuzzy Association Rules

Download Full-text

Fuzzy data mining for discovering changes in association rules over time

2002 IEEE World Congress on Computational Intelligence. 2002 IEEE International Conference on Fuzzy Systems. FUZZ-IEEE'02. Proceedings (Cat. No.02CH37291) ◽

10.1109/fuzz.2002.1006622 ◽

2003 ◽

Author(s):

Wai-Ho Au ◽

K.C.C. Chan

Keyword(s):

Data Mining ◽

Association Rules ◽

Fuzzy Data ◽

Over Time

Download Full-text

Research on automobile insurance fraud identification based on fuzzy association rules

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-201301 ◽

2021 ◽

pp. 1-14

Author(s):

Chun Yan ◽

Jiahui Liu ◽

Wei Liu ◽

Xinhong Liu

Keyword(s):

Simulated Annealing ◽

Association Rules ◽

Insurance Industry ◽

Automobile Insurance ◽

Insurance Fraud ◽

Fuzzy Data ◽

Apriori Algorithm ◽

Rule Mining ◽

Fuzzy Association Rules ◽

Fuzzy C Means

With the development of automobile insurance industry, how to identify automobile insurance fraud from massive data becomes particularly important. The purpose of this paper is to improve automobile insurance fraud management and explore the application of data mining technology in automobile insurance fraud identification. To this aim, an Apriori algorithm based on simulated annealing genetic fuzzy C-means (SAGFCM-Apriori) have been proposed. The SAGFCM-Apriori algorithm combines fuzzy theory with association rule mining, expanding the application scope of the Apriori algorithm. Considering that the clustering center of the traditional fuzzy C-means (FCM) algorithm is easy to fall into local optimal, the simulated annealing genetic (SAG) algorithm is used to optimize it. The SAG algorithm optimized FCM (SAGFCM) is used to generate fuzzy membership degrees and introduces fuzzy data into the Apriori algorithm. The Apriori algorithm is improved by reducing the rule mining time when acquiring rules. The results of empirical studies on several data sets demonstrate that the optimization of FCM by SAG can effectively avoid the local optimal problem, improve the accuracy of clustering, and enable SAGFCM-Apriori to obtain better fuzzy data during data preprocessing. Moreover, the proposed algorithm can reduce the mining time of association rules and improve mining efficiency. Finally, the SAGFCM-Apriori algorithm is applied to the scene of automobile insurance fraud identification, and the automobile insurance fraud data is mined to obtain fuzzy association rules that can identify fraud claims.

Download Full-text