Association Rule Hiding Based on Intersection Lattice

Association rule hiding has been playing a vital role in sensitive knowledge preservation when sharing data between enterprises. The aim of association rule hiding is to remove sensitive association rules from the released database such that side effects are reduced as low as possible. This research proposes an efficient algorithm for hiding a specified set of sensitive association rules based on intersection lattice of frequent itemsets. In this research, we begin by analyzing the theory of the intersection lattice of frequent itemsets and the applicability of this theory into association rule hiding problem. We then formulate two heuristics in order to (a) specify the victim items based on the characteristics of the intersection lattice of frequent itemsets and (b) identify transactions for data sanitization based on the weight of transactions. Next, we propose a new algorithm for hiding a specific set of sensitive association rules with minimum side effects and low complexity. Finally, experiments were carried out to clarify the efficiency of the proposed approach. Our results showed that the proposed algorithm, AARHIL, achieved minimum side effects and CPU-Time when compared to current similar state of the art approaches in the context of hiding a specified set of sensitive association rules.

Download Full-text

Privacy Preserving Fuzzy Association Rule Mining in Data Clusters Using Particle Swarm Optimization

Research Anthology on Privatizing and Securing Data ◽

10.4018/978-1-7998-8954-0.ch050 ◽

2021 ◽

pp. 1096-1116

Author(s):

Sathiyapriya Krishnamoorthy ◽

G. Sudha Sadasivam ◽

M. Rajalakshmi ◽

K. Kowsalyaa ◽

M. Dhivya

Keyword(s):

Particle Swarm Optimization ◽

Side Effects ◽

Association Rule ◽

Association Rule Mining ◽

Particle Swarm ◽

Privacy Preserving ◽

Rule Mining ◽

Swarm Optimization ◽

Fuzzy Association Rule ◽

Sensitive Knowledge

An association rule is classified as sensitive if its thread of revelation is above certain confidence value. If these sensitive rules were revealed to the public, it is possible to deduce sensitive knowledge from the published data and offers benefit for the business competitors. Earlier studies in privacy preserving association rule mining focus on binary data and has more side effects. But in practical applications the transactions contain the purchased quantities of the items. Hence preserving privacy of quantitative data is essential. The main goal of the proposed system is to hide a group of interesting patterns which contains sensitive knowledge such that modifications have minimum side effects like lost rules, ghost rules, and number of modifications. The proposed system applies Particle Swarm Optimization to a few clusters of particles thus reducing the number of modification. Experimental results demonstrate that the proposed approach is efficient in terms of lost rules, number of modifications, hiding failure with complete avoidance of ghost rules.

Download Full-text

Survey on Association Rule Hiding Techniques

International Journal of Scientific Research in Science Engineering and Technology ◽

10.32628/ijsrset196368 ◽

2019 ◽

pp. 300-305

Author(s):

G. Bhavani ◽

S. Sivakumari

Keyword(s):

Data Mining ◽

Comparative Analysis ◽

Association Rules ◽

Association Rule ◽

Privacy Preservation ◽

Sensitive Information ◽

Privacy Preserving Data Mining ◽

Interesting Part ◽

Future Improvement ◽

Sensitive Knowledge

Data mining process extracts useful information from a large amount of data. The most interesting part of data mining is discovering the unseen patterns without unpacking sensitive knowledge. Privacy Preserving Data Mining abbreviated as PPDM deals with the issue of sustaining the privacy of information. This methodology covers the sensitive information from disclosure. PPDM techniques are established for hiding the sensitive information even after performing the data mining. One of the practices to hide the sensitive association rules is termed as association rule hiding. The main objective of association rule hiding algorithm is to slightly adjust the original database so that no sensitive association rule is derived from it. The following article presents a detailed survey of various association rule hiding techniques for preserving privacy in data mining. At first, different techniques developed by previous researchers are studied in detail. Then, a comparative analysis is carried out to know the limitations of each technique and then providing a suggestion for future improvement in association rule hiding for privacy preservation.

Download Full-text

Review of Association Mining Methods for the Extraction of Rules Based on the Frequency and Utility Factors

International Journal of Information Technology Project Management ◽

10.4018/ijitpm.2021100101 ◽

2021 ◽

Vol 12 (4) ◽

pp. 1-10

Author(s):

Subba Reddy Meruva ◽

Venkateswarlu Bondu

Keyword(s):

Association Rules ◽

Association Rule ◽

Strong Association ◽

Vital Role ◽

Frequent Pattern ◽

Association Mining ◽

Observation Study ◽

Mining Methods ◽

High Utility ◽

High Utility Itemsets

Association rule defines the relationship among the items and discovers the frequent items using a support-confidence framework. This framework establishes user-interested or strong association rules with two thresholds (i.e., minimum support and minimum confidence). Traditional association rule mining methods (i.e., apriori and frequent pattern growth [FP-growth]) are widely used for discovering of frequent itemsets, and limitation of these methods is that they are not considering the key factors of the items such as profit, quantity, or cost of items during the mining process. Applications like e-commerce, marketing, healthcare, and web recommendations, etc. consist of items with their utility or profit. Such cases, utility-based itemsets mining methods, are playing a vital role in the generation of effective association rules and are also useful in the mining of high utility itemsets. This paper presents the survey on high-utility itemsets mining methods and discusses the observation study of existing methods with their experimental study using benchmarked datasets.

Download Full-text

Application of Association Rule Mining Technology in Collection and Management of Wireless Sensor Network Node

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.685.575 ◽

2014 ◽

Vol 685 ◽

pp. 575-578

Author(s):

Guang Jiang Wang ◽

Shi Guo Jin

Keyword(s):

Wireless Sensor Network ◽

Sensor Network ◽

Association Rules ◽

Association Rule ◽

Association Rule Mining ◽

Frequent Itemsets ◽

Wireless Sensor ◽

Network Node ◽

Rule Mining ◽

Collection And Management

Association rule mining is an important data mining method; it is the key link of finding frequent itemsets. The process of association rules mining is roughly into two steps: the first step is to find out from all the concentration of all the frequent itemsets; the second step is to obtain the association rules from frequent itemsets. This paper analyzes the collected information of nodes in wireless sensor network and management. The paper presents application of association rule mining technology in the collection and management of wireless sensor network node.

Download Full-text

An Approach to the Mining of User Focused Frequent Itemsets Based on Attention

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.536-537.520 ◽

2014 ◽

Vol 536-537 ◽

pp. 520-523

Author(s):

Jia Liu ◽

Zhen Ya Zhang ◽

Hong Mei Cheng ◽

Qian Sheng Fang

Keyword(s):

Association Rules ◽

Association Rule ◽

Frequent Itemsets ◽

Selection Model ◽

Experimental Results ◽

Apriori Algorithm ◽

Mining Method ◽

Information Filter ◽

Frequent Itemsets Mining ◽

Log File

Usually, non trivial network visiting behaviors implied in network visiting log can be treated as the frequent itemsets or association rules if data in networking log file are transformed into transaction and technologies on association rule can be used to mine those frequent itemsets which are focused by user or some application. To mine non trivial behaviors of network visiting effectively, an attention based frequent itemsets mining method is proposed in this paper. In our proposed method, properties of users focusing is described as attention set and the early selection model of attention as information filter is referenced in the design of our method. Experimental results show that our proposed method is faster than apriori algorithm on the mining of frequent itemsets which is focused by our attention.

Download Full-text

EXACT KNOWLEDGE HIDING IN TRANSACTIONAL DATABASES

International Journal of Artificial Intelligence Tools ◽

10.1142/s0218213009000020 ◽

2009 ◽

Vol 18 (01) ◽

pp. 17-37

Author(s):

ARIS GKOULALAS-DIVANIS ◽

VASSILIOS S. VERYKIOS

Keyword(s):

Side Effects ◽

Heuristic Algorithms ◽

State Of The Art ◽

Frequent Itemsets ◽

Efficient Algorithms ◽

Optimal Solutions ◽

The Past ◽

Transactional Databases ◽

Sensitive Knowledge ◽

Knowledge Hiding

The hiding of sensitive knowledge in the form of frequent itemsets, has gained increasing attention over the past years. This paper highlights the process of border revision, which is essential for the identification of hiding solutions bearing no side-effects, and provides efficient algorithms for the computation of the revised positive and the revised negative borders. By utilizing border revision, we unify the theory behind two exact hiding algorithms that guarantee optimal solutions both in terms of database distortion and side-effects introduced by the hiding process. Following that, we propose a novel extension to one of the hiding algorithms that allows it to identify exact hiding solutions to a much wider range of problems (than its original counterpart). Through experimentation, we compare the exact hiding schemes against two state-of-the-art heuristic algorithms and demonstrate their ability to consistently provide solutions of higher quality to a wide variety of hiding problems.

Download Full-text

A Scalable Vertical Model for Mining Association Rules

Journal of Information & Knowledge Management ◽

10.1142/s0219649204000912 ◽

2004 ◽

Vol 03 (04) ◽

pp. 317-329 ◽

Cited By ~ 2

Author(s):

Imad Rahal ◽

Dongmei Ren ◽

William Perrizo

Keyword(s):

Association Rules ◽

Association Rule ◽

Monotonicity Property ◽

Frequent Itemsets ◽

Large Datasets ◽

Rule Mining ◽

Very Large Datasets ◽

Computationally Intensive ◽

Downward Closure ◽

Mining Frequent Itemsets

Association rule mining (ARM) is the data-mining process for finding all association rules in datasets matching user-defined measures of interest such as support and confidence. Usually, ARM proceeds by mining all frequent itemsets — a step known to be very computationally intensive — from which rules are then derived in a straight forward manner. In general, mining all frequent itemsets prunes the space by using the downward closure (or anti-monotonicity) property of support which states that no itemset can be frequent unless all of its subsets are frequent. A large number of papers have addressed the problem of ARM but not many of them have focused on scalability over very large datasets (i.e. when datasets contain a very large number of transactions). In this paper, we propose a new model for representing data and mining frequent itemsets that is based on the P-tree technology for compression and faster logical operations over vertically structured data and on set enumeration trees for fast itemset enumeration. Experimental results presented hereinafter show big improvements for our approach over large datasets when compared to other contemporary approaches in the literature.

Download Full-text

Knowledge acquisition of association rules from the customer-lifetime-value perspective

Kybernetes ◽

10.1108/k-03-2016-0042 ◽

2018 ◽

Vol 47 (3) ◽

pp. 441-457 ◽

Cited By ~ 3

Author(s):

Cheng-Hsiung Weng ◽

Tony Cheng-Kui Huang

Keyword(s):

Association Rules ◽

Association Rule ◽

Frequent Itemsets ◽

Customer Lifetime Value ◽

Rule Mining ◽

Content Type ◽

Customer Lifetime ◽

Lifetime Value ◽

Optimum Approach ◽

Customer Values

Purpose Customer lifetime value (CLV) scoring is highly effective when applied to marketing databases. Some researchers have extended the traditional association rule problem by associating a weight with each item in a transaction. However, studies of association rule mining have considered the relative benefits or significance of “items” rather than “transactions” belonging to different customers. Because not all customers are financially attractive to firms, it is crucial that their profitability be determined and that transactions be weighted according to CLV. This study aims to discover association rules from the CLV perspective. Design/methodology/approach This study extended the traditional association rule problem by allowing the association of CLV weight with a transaction to reflect the interest and intensity of customer values. Furthermore, the authors proposed a new algorithm, frequent itemsets of CLV weight (FICLV), to discover frequent itemsets from CLV-weighted transactions. Findings Experimental results from the survey data indicate that the proposed FICLV algorithm can discover valuable frequent itemsets. Moreover, the frequent itemsets identified using the FICLV algorithm outperform those discovered through conventional approaches for predicting customer purchasing itemsets in the coming period. Originality/value This study is the first to introduce the optimum approach for discovering frequent itemsets from transactions through considering CLV.

Download Full-text

Mining predicate rules without minimum support threshold

Kuwait Journal of Science ◽

10.48129/kjs.v48i4.9782 ◽

2021 ◽

Vol 48 (4) ◽

Author(s):

Hafiz I. Ahmad ◽

◽

Alex T. H. Sim ◽

Roliana Ibrahim ◽

Mohammad Abrar ◽

...

Keyword(s):

Association Rules ◽

Association Rule ◽

Association Rule Mining ◽

State Of The Art ◽

Predicate Logic ◽

Frequent Itemsets ◽

Rule Mining ◽

Minimum Support ◽

Support Threshold ◽

Definition Of

Association rule mining (ARM) is used for discovering frequent itemsets for interesting relationships of associative and correlative behaviors within the data. This gives new insights of great value, both commercial and academic. The traditional ARM techniques discover interesting association rules based on a predefined minimum support threshold. However, there is no known standard of an exact definition of minimum support and providing an inappropriate minimum support value may result in missing important rules. In addition, most of the rules discovered by these traditional ARM techniques refer to already known knowledge. To address these limitations of the minimum support threshold in ARM techniques, this study proposes an algorithm to mine interesting association rules without minimum support using predicate logic and a property of a proposed interestingness measure (g measure). The algorithm scans the database and uses g measure’s property to search for interesting combinations. The selected combinations are mapped to pseudo-implications and inference rules of logic are used on the pseudo-implications to produce and validate the predicate rules. Experimental results of the proposed technique show better performance against state-of-the-art classification techniques, and reliable predicate rules are discovered based on the reliability differences of the presence and absence of the rule’s consequence.

Download Full-text

Hiding Sensitive Association Rules by Sanitizing

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.694-697.2317 ◽

2013 ◽

Vol 694-697 ◽

pp. 2317-2321

Author(s):

Hui Wang

Keyword(s):

Data Mining ◽

Side Effects ◽

Knowledge Discovery ◽

Association Rules ◽

Association Rule ◽

Mining Technology ◽

Data Mining Techniques ◽

Confidential Data ◽

Knowledge Hiding ◽

Existing Data

The goal of knowledge discovery is to extract hidden or useful unknown knowledge from databases, while the objective of knowledge hiding is to prevent certain confidential data or knowledge from being extracted through data mining techniques. Hiding sensitive association rules is focused. The side-effects of the existing data mining technology are investigated. The problem of sensitive association rule hiding is described formally. The representative sanitizing strategies for sensitive association rule hiding are discussed.

Download Full-text