Big Data Privacy Preservation Using Two Phase Top-Down Specialization Algorithm with Multidimensional Map Reduce Framework on Hadoop

Big data privacy preservation is one of the most disturbed issues in current industry. Sometimes the data privacy problems never identified when input data is published on cloud environment. Data privacy preservation in hadoop deals in hiding and publishing input dataset to the distributed environment. In this paper investigate the problem of big data anonymization for privacy preservation from the perspectives of scalability and time factor etc. At present, many cloud applications with big data anonymization faces the same kind of problems. For recovering this kind of problems, here introduced a data anonymization algorithm called Two Phase Top-Down Specialization (TPTDS) algorithm that is implemented in hadoop. For the data anonymization-45,222 records of adults information with 15 attribute values was taken as the input big data. With the help of multidimensional anonymization in map reduce framework, here implemented proposed Two-Phase Top-Down Specialization anonymization algorithm in hadoop and it will increases the efficiency on the big data processing system. By conducting experiment in both one dimensional and multidimensional map reduce framework with Two Phase Top-Down Specialization algorithm on hadoop, the better result shown in multidimensional anonymization on input adult dataset. Data sets is generalized in a top-down manner and the better result was shown in multidimensional map reduce framework by the better IGPL values generated by the algorithm. The anonymization was performed with specialization operation on taxonomy tree. The experiment shows that the solutions improves the IGPL values, anonymity parameter and decreases the execution time of big data privacy preservation by compared to the existing algorithm. This experimental result will leads to great application to the distributed environment.

Download Full-text

LRDM: Local Record-Driving Mechanism for Big Data Privacy Preservation in Social Networks

2016 IEEE First International Conference on Data Science in Cyberspace (DSC) ◽

10.1109/dsc.2016.94 ◽

2016 ◽

Cited By ~ 2

Author(s):

Weihao Li ◽

Hui Li

Keyword(s):

Social Networks ◽

Big Data ◽

Data Privacy ◽

Privacy Preservation ◽

Driving Mechanism ◽

Big Data Privacy

Download Full-text

Scalable Two-Phase Top-Down Specification for Big Data Anonymization Using Apache Pig

Advances in Intelligent Systems and Computing - Advances in Artificial Intelligence and Data Engineering ◽

10.1007/978-981-15-3514-7_75 ◽

2020 ◽

pp. 1009-1021

Author(s):

Anushree Raj ◽

Rio D’Souza

Keyword(s):

Big Data ◽

Top Down ◽

Two Phase ◽

Data Anonymization ◽

Apache Pig

Download Full-text

A Scalable Two-Phase Top-Down Specialization Approach for Data Anonymization Using Map Reduce on Cloud

International Journal of Computer Applications Technology and Research ◽

10.7753/ijcatr0405.1015 ◽

2015 ◽

Vol 4 (5) ◽

pp. 409-413

Author(s):

R. Thaayumaanavan ◽

N. Priya ◽

J. Balaguru

Keyword(s):

Map Reduce ◽

Top Down ◽

Two Phase ◽

Data Anonymization

Download Full-text

Big Data Privacy Preservation for Cyber-Physical Systems

10.1007/978-3-030-13370-2 ◽

2019 ◽

Cited By ~ 1

Author(s):

Miao Pan ◽

Jingyi Wang ◽

Sai Mounika Errapotu ◽

Xinyue Zhang ◽

Jiahao Ding ◽

...

Keyword(s):

Big Data ◽

Data Privacy ◽

Privacy Preservation ◽

Cyber Physical Systems ◽

Physical Systems ◽

Big Data Privacy

Download Full-text

D2D Big Data Privacy-Preserving Framework Based on (a, k)-Anonymity Model

Mathematical Problems in Engineering ◽

10.1155/2019/2076542 ◽

2019 ◽

Vol 2019 ◽

pp. 1-11 ◽

Cited By ~ 1

Author(s):

Jie Wang ◽

Hongtao Li ◽

Feng Guo ◽

Wenyin Zhang ◽

Yifeng Cui

Keyword(s):

Big Data ◽

Private Information ◽

Data Privacy ◽

Privacy Preservation ◽

Computing Time ◽

Privacy Preserving ◽

D2d Communication ◽

Group Data ◽

Big Data Privacy ◽

Daunting Challenge

As a novel and promising technology for 5G networks, device-to-device (D2D) communication has garnered a significant amount of research interest because of the advantages of rapid sharing and high accuracy on deliveries as well as its variety of applications and services. Big data technology offers unprecedented opportunities and poses a daunting challenge to D2D communication and sharing, where the data often contain private information concerning users or organizations and thus are at risk of being leaked. Privacy preservation is necessary for D2D services but has not been extensively studied. In this paper, we propose an (a, k)-anonymity privacy-preserving framework for D2D big data deployed on MapReduce. Firstly, we provide a framework for the D2D big data sharing and analyze the threat model. Then, we propose an (a, k)-anonymity privacy-preserving framework for D2D big data deployed on MapReduce. In our privacy-preserving framework, we adopt (a, k)-anonymity as privacy-preserving model for D2D big data and use the distributed MapReduce to classify and group data for massive datasets. The results of experiments and theoretical analysis show that our privacy-preserving algorithm deployed on MapReduce is effective for D2D big data privacy protection with less information loss and computing time.

Download Full-text

Scalable Local-Recoding Anonymization using Locality Sensitive Hashing for Big Data Privacy Preservation

Proceedings of the 25th ACM International on Conference on Information and Knowledge Management - CIKM '16 ◽

10.1145/2983323.2983841 ◽

2016 ◽

Cited By ~ 5

Author(s):

Xuyun Zhang ◽

Christopher Leckie ◽

Wanchun Dou ◽

Jinjun Chen ◽

Ramamohanarao Kotagiri ◽

...

Keyword(s):

Big Data ◽

Data Privacy ◽

Privacy Preservation ◽

Locality Sensitive Hashing ◽

Big Data Privacy

Download Full-text

Proximity-Aware Local-Recoding Anonymization with MapReduce for Scalable Big Data Privacy Preservation in Cloud

IEEE Transactions on Computers ◽

10.1109/tc.2014.2360516 ◽

2015 ◽

Vol 64 (8) ◽

pp. 2293-2307 ◽

Cited By ~ 49

Author(s):

Xuyun Zhang ◽

Wanchun Dou ◽

Jian Pei ◽

Surya Nepal ◽

Chi Yang ◽

...

Keyword(s):

Big Data ◽

Data Privacy ◽

Privacy Preservation ◽

Big Data Privacy

Download Full-text

Critical Analysis of Big Data Privacy Preservation Techniques and Challenges

10.1007/978-981-16-3071-2_23 ◽

2021 ◽

pp. 267-278

Author(s):

Suman Madan ◽

Kirti Bhardwaj ◽

Shubhangi Gupta

Keyword(s):

Big Data ◽

Critical Analysis ◽

Data Privacy ◽

Privacy Preservation ◽

Big Data Privacy

Download Full-text

MRMondrian: Scalable Multidimensional Anonymisation for Big Data Privacy Preservation

IEEE Transactions on Big Data ◽

10.1109/tbdata.2017.2787661 ◽

2017 ◽

pp. 1-1 ◽

Cited By ~ 5

Author(s):

Xuyun Zhang ◽

Lianyong Qi ◽

Wanchun Dou ◽

Qiang He ◽

Christopher Leckie ◽

...

Keyword(s):

Big Data ◽

Data Privacy ◽

Privacy Preservation ◽

Big Data Privacy

Download Full-text

Big Data Anonymization in Cloud using k-Anonymity Algorithm using Map Reduce Framework

International Journal of Scientific Research in Computer Science Engineering and Information Technology ◽

10.32628/cseit19516 ◽

2019 ◽

pp. 50-56

Author(s):

Anushree Raj ◽

Rio G L D'Souza

Keyword(s):

Big Data ◽

Data Analysis ◽

Privacy Protection ◽

Map Reduce ◽

Top Down ◽

Huge Amount ◽

Cloud Data ◽

Data Anonymization

Anonymization techniques are enforced to provide privacy protection for the data published on cloud. These techniques include various algorithms to generalize or suppress the data. Top Down Specification in k anonymity is the best generalization algorithm for data anonymization. As the data increases on cloud, data analysis becomes very tedious. Map reduce framework can be adapted to process on these huge amount of Big Data. We implement generalized method using Map phase and Reduce Phase for data anonymization on cloud in two different phases of Top Down Specification.

Download Full-text