Personalized Privacy-Preserving Frequent Itemset Mining Using Randomized Response

Frequent itemset mining is the important first step of association rule mining, which discovers interesting patterns from the massive data. There are increasing concerns about the privacy problem in the frequent itemset mining. Some works have been proposed to handle this kind of problem. In this paper, we introduce a personalized privacy problem, in which different attributes may need different privacy levels protection. To solve this problem, we give a personalized privacy-preserving method by using the randomized response technique. By providing different privacy levels for different attributes, this method can get a higher accuracy on frequent itemset mining than the traditional method providing the same privacy level. Finally, our experimental results show that our method can have better results on the frequent itemset mining while preserving personalized privacy.

Download Full-text

Mining Association Rules: A Case Study on Benchmark Dense Data

Indonesian Journal of Electrical Engineering and Computer Science ◽

10.11591/ijeecs.v3.i3.pp546-553 ◽

2016 ◽

Vol 3 (3) ◽

pp. 546 ◽

Cited By ~ 2

Author(s):

Mustafa Bin Man ◽

Wan Aezwani Wan Abu Bakar ◽

Zailani Abdullah ◽

Masita@Masila Abd Jalil ◽

Tutut Herawan

Keyword(s):

Association Rules ◽

Association Rule ◽

Frequent Itemset ◽

Frequent Itemset Mining ◽

Data Repository ◽

Rule Mining ◽

Itemset Mining ◽

Major Attention ◽

Performance Results

<p class="Abstract">Data mining is the process of discovering knowledge and previously unknown pattern from large amount of data. The association rule mining (ARM) has been in trend where a new pattern analysis can be discovered to project for an important prediction about any issues. Since the first introduction of frequent itemset mining, it has received a major attention among researchers and various efficient and sophisticated algorithms have been proposed to do frequent itemset mining. Among the best-known algorithms are Apriori and FP-Growth. In this paper, we explore these algorithms and comparing their results in generating association rules based on benchmark dense datasets. The datasets are taken from frequent itemset mining data repository. The two algorithms are implemented in Rapid Miner 5.3.007 and the performance results are shown as comparison. FP-Growth is found to be better algorithm when encountering the support-confidence framework.</p>

Download Full-text

Game-theoretic privacy preserving constructions for rational and malicious secret sharing models for collaborative frequent itemset mining

International Journal of Knowledge Engineering and Data Mining ◽

10.1504/ijkedm.2017.091025 ◽

2017 ◽

Vol 4 (3/4) ◽

pp. 320

Author(s):

Nirali R. Nanavati ◽

Prakash Lalwani ◽

Devesh C. Jinwala

Keyword(s):

Secret Sharing ◽

Privacy Preserving ◽

Frequent Itemset ◽

Frequent Itemset Mining ◽

Itemset Mining ◽

Game Theoretic

Download Full-text

Privacy-preserving frequent itemset mining in outsourced transaction databases

2015 International Conference on Advances in Computing, Communications and Informatics (ICACCI) ◽

10.1109/icacci.2015.7275706 ◽

2015 ◽

Cited By ~ 1

Author(s):

Iyer Chandrasekharan ◽

P.K. Baruah ◽

Ravi Mukkamala

Keyword(s):

Privacy Preserving ◽

Frequent Itemset ◽

Frequent Itemset Mining ◽

Itemset Mining

Download Full-text

Privacy-Preserving Frequent Itemset Mining for Sparse and Dense Data

Secure IT Systems - Lecture Notes in Computer Science ◽

10.1007/978-3-319-70290-2_9 ◽

2017 ◽

pp. 139-155

Author(s):

Peeter Laud ◽

Alisa Pankova

Keyword(s):

Privacy Preserving ◽

Frequent Itemset ◽

Frequent Itemset Mining ◽

Itemset Mining ◽

Dense Data

Download Full-text

CFM: collusion-free model of privacy preserving frequent itemset mining

International Journal of Information and Computer Security ◽

10.1504/ijics.2020.109476 ◽

2020 ◽

Vol 13 (3/4) ◽

pp. 249

Author(s):

Yoones A. Sekhavat

Keyword(s):

Privacy Preserving ◽

Frequent Itemset ◽

Frequent Itemset Mining ◽

Itemset Mining ◽

Free Model

Download Full-text

Dummy Data Insert Scheme for Privacy Preserving Frequent Itemset Mining in Data Stream

Journal of the Korea Institute of Information Security and Cryptology ◽

10.13089/jkiisc.2013.23.3.383 ◽

2013 ◽

Vol 23 (3) ◽

pp. 383-393

Author(s):

Jay Yeol Jung ◽

Kee Sung Kim ◽

Ik Rae Jeong

Keyword(s):

Data Stream ◽

Privacy Preserving ◽

Frequent Itemset ◽

Frequent Itemset Mining ◽

Itemset Mining

Download Full-text

Privacy-preserving frequent itemset mining in vertically partitioned database using symmetric homomorphic encryption scheme

International Journal of Information Privacy Security and Integrity ◽

10.1504/ijipsi.2020.111464 ◽

2020 ◽

Vol 4 (3) ◽

pp. 203

Author(s):

Jyoti Lamba ◽

V.C. Venkaiah

Keyword(s):

Homomorphic Encryption ◽

Privacy Preserving ◽

Frequent Itemset ◽

Frequent Itemset Mining ◽

Encryption Scheme ◽

Itemset Mining

Download Full-text

Postdiffset: an Eclat-like algorithm for frequent itemset mining

International Journal of Engineering & Technology ◽

10.14419/ijet.v7i2.28.12911 ◽

2018 ◽

Vol 7 (2.28) ◽

pp. 197

Author(s):

W A.W.A. Bakar ◽

M A. Jalil ◽

M Man ◽

Z Abdullah ◽

F Mohd

Keyword(s):

Data Mining ◽

Association Rule ◽

Frequent Itemset ◽

Frequent Itemset Mining ◽

Underlying Structure ◽

Data Format ◽

Itemset Mining ◽

Data Formats ◽

Vertical Data ◽

Mining Algorithms

Frequent itemset mining is a major field in data mining techniques. This is because it deals with usual and normal occurrences of set of items in a database transaction. Originated from market basket analysis, frequent itemset generation may lead to the formulation of association rule as to derive correlation or patterns. Association rule mining still remains as one of the most prominent areas in data mining that aims to extract interesting correlations, frequent patterns, association or casual structures among set of items in the transaction databases. Underlying structure of association rules mining algorithms are based upon horizontal or vertical data formats. These two data formats have been widely discussed by showing few examples of algorithm of each data formats. The works on horizontal approaches suffer in many candidate generation and multiple database scans that contributes to higher memory consumptions. In response to improve on horizontal approach, the works on vertical approaches are established. Eclat algorithm is one example of algorithm in vertical approach database format. Motivated to its ‘fast intersection’, in this paper, we review and analyze the fundamental Eclat and Eclat-variants such as tidset, diffset, and sortdiffset. In response to vertical data format and as a continuity to Eclat extension, we propose a postdiffset algorithm as a new member in Eclat variants that use tidset format in the first looping and diffset in the later looping. We present the performance of postdiffset results in time execution as to indicate some improvements has been achieved in frequent itemset mining.

Download Full-text