Adaptive Differentially Private Data Release for Data Sharing and Data Mining

Author(s):  
Li Xiong
Author(s):  
Antonio Famulari ◽  
Francesco Longo ◽  
Giuseppe Campobello ◽  
Thomas Bonald ◽  
Marco Scarpa
Keyword(s):  

Author(s):  
Bogdan C. Popescu ◽  
Bruno Crispo ◽  
Andrew S. Tanenbaum
Keyword(s):  

2019 ◽  
Vol 2019 (1) ◽  
pp. 26-46 ◽  
Author(s):  
Thee Chanyaswad ◽  
Changchang Liu ◽  
Prateek Mittal

Abstract A key challenge facing the design of differential privacy in the non-interactive setting is to maintain the utility of the released data. To overcome this challenge, we utilize the Diaconis-Freedman-Meckes (DFM) effect, which states that most projections of high-dimensional data are nearly Gaussian. Hence, we propose the RON-Gauss model that leverages the novel combination of dimensionality reduction via random orthonormal (RON) projection and the Gaussian generative model for synthesizing differentially-private data. We analyze how RON-Gauss benefits from the DFM effect, and present multiple algorithms for a range of machine learning applications, including both unsupervised and supervised learning. Furthermore, we rigorously prove that (a) our algorithms satisfy the strong ɛ-differential privacy guarantee, and (b) RON projection can lower the level of perturbation required for differential privacy. Finally, we illustrate the effectiveness of RON-Gauss under three common machine learning applications – clustering, classification, and regression – on three large real-world datasets. Our empirical results show that (a) RON-Gauss outperforms previous approaches by up to an order of magnitude, and (b) loss in utility compared to the non-private real data is small. Thus, RON-Gauss can serve as a key enabler for real-world deployment of privacy-preserving data release.


2017 ◽  
Vol 12 (S330) ◽  
pp. 181-184
Author(s):  
T. Marchetti ◽  
E. M. Rossi ◽  
G. Kordopatis ◽  
A. G. A. Brown ◽  
A. Rimoldi ◽  
...  

AbstractHypervelocity stars (HVSs) are characterized by a total velocity in excess of the Galactic escape speed, and with trajectories consistent with coming from the Galactic Centre. We apply a novel data mining routine, an artificial neural network, to discover HVSs in the TGAS subset of the first data release of the Gaia satellite, using only the astrometry of the stars. We find 80 stars with a predicted probability >90% of being HVSs, and we retrieved radial velocities for 47 of those. We discover 14 objects with a total velocity in the Galactic rest frame >400 km s−1, and 5 of these have a probability >50% of being unbound from the Milky Way. Tracing back orbits in different Galactic potentials, we discover 1 HVS candidate, 5 bound HVS candidates, and 5 runaway star candidates with remarkably high velocities, between 400 and 780 km s−1. We wait for future Gaia releases to confirm the goodness of our sample and to increase the number of HVS candidates.


2019 ◽  
Vol 96 ◽  
pp. 1-10
Author(s):  
Giuseppe Bianchi ◽  
Tooska Dargahi ◽  
Alberto Caponi ◽  
Mauro Conti
Keyword(s):  

2017 ◽  
Vol 17 (2) ◽  
pp. 44-55 ◽  
Author(s):  
M. Antony Sheela ◽  
K. Vijayalakshmi

Abstract Data mining on vertically or horizontally partitioned dataset has the overhead of protecting the private data. Perturbation is a technique that protects the revealing of data. This paper proposes a perturbation and anonymization technique that is performed on the vertically partitioned data. A third-party coordinator is used to partition the data recursively in various parties. The parties perturb the data by finding the mean, when the specified threshold level is reached. The perturbation maintains the statistical relationship among attributes.


Sign in / Sign up

Export Citation Format

Share Document