Privacy-Preserving Trajectory Data Publishing by Dynamic Anonymization with Bounded Distortion

Publication of trajectory data that contain rich information of vehicles in the dimensions of time and space (location) enables online monitoring and supervision of vehicles in motion and offline traffic analysis for various management tasks. However, it also provides security holes for privacy breaches as exposing individual’s privacy information to public may results in attacks threatening individual’s safety. Therefore, increased attention has been made recently on the privacy protection of trajectory data publishing. However, existing methods, such as generalization via anonymization and suppression via randomization, achieve protection by modifying the original trajectory to form a publishable trajectory, which results in significant data distortion and hence a low data utility. In this work, we propose a trajectory privacy-preserving method called dynamic anonymization with bounded distortion. In our method, individual trajectories in the original trajectory set are mixed in a localized manner to form synthetic trajectory data set with a bounded distortion for publishing, which can protect the privacy of location information associated with individuals in the trajectory data set and ensure a guaranteed utility of the published data both individually and collectively. Through experiments conducted on real trajectory data of Guangzhou City Taxi statistics, we evaluate the performance of our proposed method and compare it with the existing mainstream methods in terms of privacy preservation against attacks and trajectory data utilization. The results show that our proposed method achieves better performance on data utilization than the existing methods using globally static anonymization, without trading off the data security against attacks.

Download Full-text

Privacy-Preserving Data Mining

Handbook of Research on Innovations in Database Technologies and Applications ◽

10.4018/978-1-60566-242-8.ch056 ◽

2009 ◽

pp. 527-536 ◽

Cited By ~ 10

Author(s):

Alexandre Evfimievski ◽

Tyrone Grandison

Keyword(s):

Data Mining ◽

Privacy Preservation ◽

Statistical Processing ◽

Privacy Preserving ◽

Third Party ◽

Data Publishing ◽

Sensitive Information ◽

Public Servants ◽

Data Set ◽

Privacy Preserving Data Mining

Privacy-preserving data mining (PPDM) refers to the area of data mining that seeks to safeguard sensitive information from unsolicited or unsanctioned disclosure. Most traditional data mining techniques analyze and model the data set statistically, in aggregated form, while privacy preservation is primarily concerned with protecting against disclosure of individual data records. This domain separation points to the technical feasibility of PPDM. Historically, issues related to PPDM were first studied by the national statistical agencies interested in collecting private social and economical data, such as census and tax records, and making it available for analysis by public servants, companies, and researchers. Building accurate socioeconomical models is vital for business planning and public policy. Yet, there is no way of knowing in advance what models may be needed, nor is it feasible for the statistical agency to perform all data processing for everyone, playing the role of a trusted third party. Instead, the agency provides the data in a sanitized form that allows statistical processing and protects the privacy of individual records, solving a problem known as privacypreserving data publishing. For a survey of work in statistical databases, see Adam and Wortmann (1989) and Willenborg and de Waal (2001).

Download Full-text

Anonymization Based on Improved Bucketization (AIB): A Privacy-Preserving Data Publishing Technique for Improving Data Utility in Healthcare Data

Journal of Medical Imaging and Health Informatics ◽

10.1166/jmihi.2021.3901 ◽

2021 ◽

Vol 11 (12) ◽

pp. 3164-3173

Author(s):

R. Indhumathi ◽

S. Sathiya Devi

Keyword(s):

Medical Information ◽

Threshold Value ◽

Privacy Preserving ◽

Data Publishing ◽

Published Data ◽

Sensitive Information ◽

Data Utility ◽

Healthcare Data ◽

Privacy Preserving Data Publishing ◽

Horizontal Partitioning

Data sharing is essential in present biomedical research. A large quantity of medical information is gathered and for different objectives of analysis and study. Because of its large collection, anonymity is essential. Thus, it is quite important to preserve privacy and prevent leakage of sensitive information of patients. Most of the Anonymization methods such as generalisation, suppression and perturbation are proposed to overcome the information leak which degrades the utility of the collected data. During data sanitization, the utility is automatically diminished. Privacy Preserving Data Publishing faces the main drawback of maintaining tradeoff between privacy and data utility. To address this issue, an efficient algorithm called Anonymization based on Improved Bucketization (AIB) is proposed, which increases the utility of published data while maintaining privacy. The Bucketization technique is used in this paper with the intervention of the clustering method. The proposed work is divided into three stages: (i) Vertical and Horizontal partitioning (ii) Assigning Sensitive index to attributes in the cluster (iii) Verifying each cluster against privacy threshold (iv) Examining for privacy breach in Quasi Identifier (QI). To increase the utility of published data, the threshold value is determined based on the distribution of elements in each attribute, and the anonymization method is applied only to the specific QI element. As a result, the data utility has been improved. Finally, the evaluation results validated the design of paper and demonstrated that our design is effective in improving data utility.

Download Full-text

Personalized Privacy-Preserving Trajectory Data Publishing

Chinese Journal of Electronics ◽

10.1049/cje.2017.01.024 ◽

2017 ◽

Vol 26 (2) ◽

pp. 285-291 ◽

Cited By ~ 5

Author(s):

Qiwei Lu ◽

Caimei Wang ◽

Yan Xiong ◽

Huihua Xia ◽

Wenchao Huang ◽

...

Keyword(s):

Privacy Preserving ◽

Data Publishing ◽

Trajectory Data

Download Full-text

Novel Privacy-preserving algorithm based on frequent path for trajectory data publishing

Knowledge-Based Systems ◽

10.1016/j.knosys.2018.01.007 ◽

2018 ◽

Vol 148 ◽

pp. 55-65 ◽

Cited By ~ 13

Author(s):

Yulan Dong ◽

Dechang Pi

Keyword(s):

Privacy Preserving ◽

Data Publishing ◽

Trajectory Data

Download Full-text

Analytical Study on Privacy Attack Models in Privacy Preserving Data Publishing

Cyber Law, Privacy, and Security ◽

10.4018/978-1-5225-8897-9.ch076 ◽

2019 ◽

pp. 1518-1538

Author(s):

Sowmyarani C. N. ◽

Dayananda P.

Keyword(s):

Private Information ◽

Background Knowledge ◽

Privacy Preserving ◽

Data Publishing ◽

Background Information ◽

Published Data ◽

Privacy Attack ◽

Privacy Preserving Data Publishing ◽

Available Information ◽

Attack Models

Privacy attack on individual records has great concern in privacy preserving data publishing. When an intruder who is interested to know the private information of particular person of his interest, will acquire background knowledge about the person. This background knowledge may be gained though publicly available information such as Voter's id or through social networks. Combining this background information with published data; intruder may get the private information causing a privacy attack of that person. There are many privacy attack models. Most popular attack models are discussed in this chapter. The study of these attack models plays a significant role towards the invention of robust Privacy preserving models.

Download Full-text

Mutual Correlation-Based Anonymization for Privacy Preserving Medical Data Publishing

Handbook of Research on Information Security in Biomedical Signal Processing - Advances in Information Security, Privacy, and Ethics ◽

10.4018/978-1-5225-5152-2.ch016 ◽

2018 ◽

pp. 304-319

Author(s):

Ashoka Kukkuvada ◽

Poornima Basavaraju

Keyword(s):

Privacy Preservation ◽

Information Gain ◽

Personal Data ◽

Privacy Preserving ◽

Medical Data ◽

Innovative Approach ◽

Data Publishing ◽

Mutual Correlation ◽

Data Utility ◽

Data Elements

Currently the industry is focused on managing, retrieving, and securing massive amounts of data. Hence, privacy preservation is a significant concern for those organizations that publish/share personal data for vernacular analysis. In this chapter, the authors presented an innovative approach that makes use of information gain of the quasi attributes with respect to sensitive attributes for anonymizing the data, which gives the fruitfulness of an attribute in classifying the data elements, which is a two-way correlation among attributes. The authors show that the proposed approach preserves better data utility and has lesser complexity than former methods.

Download Full-text

Privacy Preserving Big Data Publishing

Research Anthology on Privatizing and Securing Data ◽

10.4018/978-1-7998-8954-0.ch060 ◽

2021 ◽

pp. 1281-1298

Author(s):

Nancy Victor ◽

Daphne Lopez

Keyword(s):

Big Data ◽

Language Processing ◽

Data Privacy ◽

Privacy Preservation ◽

Personal Data ◽

Privacy Preserving ◽

Data Publishing ◽

Time Data ◽

Digital World ◽

Distributed Framework

Data privacy plays a noteworthy part in today's digital world where information is gathered at exceptional rates from different sources. Privacy preserving data publishing refers to the process of publishing personal data without questioning the privacy of individuals in any manner. A variety of approaches have been devised to forfend consumer privacy by applying traditional anonymization mechanisms. But these mechanisms are not well suited for Big Data, as the data which is generated nowadays is not just structured in manner. The data which is generated at very high velocities from various sources includes unstructured and semi-structured information, and thus becomes very difficult to process using traditional mechanisms. This chapter focuses on the various challenges with Big Data, PPDM and PPDP techniques for Big Data and how well it can be scaled for processing both historical and real-time data together using Lambda architecture. A distributed framework for privacy preservation in Big Data by combining Natural language processing techniques is also proposed in this chapter.

Download Full-text

An efficient method for privacy-preserving trajectory data publishing based on data partitioning

The Journal of Supercomputing ◽

10.1007/s11227-019-02906-6 ◽

2019 ◽

Vol 76 (7) ◽

pp. 5276-5300 ◽

Cited By ~ 1

Author(s):

Songyuan Li ◽

Hong Shen ◽

Yingpeng Sang ◽

Hui Tian

Keyword(s):

Efficient Method ◽

Privacy Preserving ◽

Data Partitioning ◽

Data Publishing ◽

Trajectory Data

Download Full-text

Privacy Preserving Anonymization Schemes-On Transaction Data Publishing

Journal of Computational and Theoretical Nanoscience ◽

10.1166/jctn.2020.9289 ◽

2020 ◽

Vol 17 (9) ◽

pp. 4623-4626

Author(s):

Nisha Nehra ◽

Suneet Kumar

Keyword(s):

Privacy Preservation ◽

Privacy Preserving ◽

Data Publishing ◽

Transaction Data ◽

Past Work ◽

The Past ◽

Identity Disclosure ◽

Comprehensive Survey ◽

Work Done ◽

Transactional Data

Now days, due to the sheer amount of data, its complexity and the rate at which it is generated, traditional algorithms that are present so far for the privacy preservation of relation data publishing are not capable enough to ensure privacy as efficiently for transactional data also. From last two decades the interest also increases to provide better data preserving schemes for data publishing. There are a number of algorithms, schemes, models and techniques in the literature that ensure privacy against identity disclosure and attribute disclosure attacks. This paper is a comprehensive survey of the past work done in the field of anonymization to provide privacy against transactional data publishing.

Download Full-text

Suppression techniques for privacy-preserving trajectory data publishing

Knowledge-Based Systems ◽

10.1016/j.knosys.2020.106354 ◽

2020 ◽

Vol 206 ◽

pp. 106354 ◽

Cited By ~ 1

Author(s):

Chen-Yi Lin

Keyword(s):

Privacy Preserving ◽

Data Publishing ◽

Trajectory Data

Download Full-text