Lazy Update of Frequent Item set Mining with Differential Privacy in Streaming Data Environment

In the paper the author introduces FCW_MRFI, which is a streaming data frequent item mining algorithm based on variable window. The FCW_MRFI algorithm can mine frequent item in any window of recent streaming data, whose given length is L. Meanwhile, it divides recent streaming data into several windows of variable length according to m, which is the number of the counter array. This algorithm can achieve smaller query error in recent windows, and can minimize the maximum query error in the whole recent streaming data.

Download Full-text

The Interactive Query Method with Clustering and Differential Privacy Protection Model Under Big Data Environment

Communications in Computer and Information Science - Big Data and Security ◽

10.1007/978-981-16-3150-4_28 ◽

2021 ◽

pp. 327-336

Author(s):

Huanyu Fan ◽

Yunan Zhu ◽

Chao Shan

Keyword(s):

Big Data ◽

Privacy Protection ◽

Differential Privacy ◽

Interactive Query ◽

Data Environment

Download Full-text

Trajectory Privacy Protection on Spatial Streaming Data with Differential Privacy

2018 IEEE Global Communications Conference (GLOBECOM) ◽

10.1109/glocom.2018.8647918 ◽

2018 ◽

Cited By ~ 1

Author(s):

Xiang Liu ◽

Yuchun Guo ◽

Yishuai Chen ◽

Xiaoying Tan

Keyword(s):

Privacy Protection ◽

Differential Privacy ◽

Streaming Data

Download Full-text

An Optimized Frequent Item Query Algorithm for Uncertain Streaming Data

Journal of Networks ◽

10.4304/jnw.9.11.3030-3037 ◽

1969 ◽

Vol 9 (11) ◽

Author(s):

Yanru Xue ◽

Min Liu ◽

Feng Wang

Keyword(s):

Streaming Data ◽

Frequent Item ◽

Query Algorithm

Download Full-text

An algorithm for differential privacy streaming data publication based on matrix mechanism under exponential decay mode

Scientia Sinica Informationis ◽

10.1360/n112017-00111 ◽

2017 ◽

Vol 47 (11) ◽

pp. 1493-1509 ◽

Cited By ~ 1

Author(s):

Lan SUN ◽

Liqun ZHANG ◽

Chen GE ◽

Yingjie WU

Keyword(s):

Decay Mode ◽

Exponential Decay ◽

Differential Privacy ◽

Streaming Data ◽

Data Publication ◽

Matrix Mechanism

Download Full-text

Measurement of Local Differential Privacy Techniques for IoT-based Streaming Data

10.1109/pst52912.2021.9647839 ◽

2021 ◽

Author(s):

Sharmin Afrose ◽

Danfeng Daphne Yao ◽

Olivera Kotevska

Keyword(s):

Differential Privacy ◽

Streaming Data

Download Full-text

CGM

Proceedings of the VLDB Endowment ◽

10.14778/3476249.3476277 ◽

2021 ◽

Vol 14 (11) ◽

pp. 2258-2270

Author(s):

Ergute Bao ◽

Yin Yang ◽

Xiaokui Xiao ◽

Bolin Ding

Keyword(s):

Differential Privacy ◽

Large Population ◽

Main Idea ◽

Formal Proof ◽

Streaming Data ◽

Correlated Noise ◽

Sensitive Data ◽

Aggregated Data ◽

Protection Scheme ◽

Value Range

Local differential privacy (LDP) is a well-established privacy protection scheme for collecting sensitive data, which has been integrated into major platforms such as iOS, Chrome, and Windows. The main idea is that each individual randomly perturbs her data on her local device, and only uploads the noisy version to an untrusted data aggregator. This paper focuses on the collection of streaming data consisting of regular updates, e.g. , daily app usage. Such streams, when aggregated over a large population, often exhibit strong autocorrelations , e.g. , the average usage of an app usually does not change dramatically from one day to the next. To our knowledge, this property has been largely neglected in existing LDP mechanisms. Consequently, data collected with current LDP methods often exhibit unrealistically violent fluctuations due to the added noise, drowning the overall trend, as shown in our experiments. This paper proposes a novel correlated Gaussian mechanism ( CGM ) for enforcing (ϵ, δ)-LDP on streaming data collection, which reduces noise by exploiting public-known autocorrelation patterns of the aggregated data. This is done through non-trivial modifications to the core of the underlying Gaussian Mechanism; in particular, CGM injects temporally correlated noise, computed through an optimization program that takes into account the given autocorrelation pattern, data value range, and utility metric. CGM comes with formal proof of correctness, and consumes negligible computational resources. Extensive experiments using real datasets from different application domains demonstrate that CGM achieves consistent and significant utility gains compared to the baseline method of repeatedly running the underlying one-shot LDP mechanism.

Download Full-text