Multi-Party Verifiable Privacy-Preserving Federated k-Means Clustering in Outsourced Environment

Security and Communication Networks ◽

10.1155/2021/3630312 ◽

2021 ◽

Vol 2021 ◽

pp. 1-11

Author(s):

Ruiqi Hou ◽

Fei Tang ◽

Shikai Liang ◽

Guowei Ling

Keyword(s):

Hash Function ◽

Clustering Algorithm ◽

High Accuracy ◽

Privacy Preserving ◽

Sensitive Information ◽

Encrypted Data ◽

Multiple Data ◽

Private Data ◽

Data Volume ◽

Cloud Servers

As a commonly used algorithm in data mining, clustering has been widely applied in many fields, such as machine learning, information retrieval, and pattern recognition. In reality, data to be analyzed are often distributed to multiple parties. Moreover, the rapidly increasing data volume puts heavy computing pressure on data owners. Thus, data owners tend to outsource their own data to cloud servers and obtain data analysis results for the federated data. However, the existing privacy-preserving outsourced k -means schemes cannot verify whether participants share consistent data. Considering the scenarios with multiple data owners and sensitive information security in an outsourced environment, we propose a verifiable privacy-preserving federated k -means clustering scheme. In this article, cloud servers and participants perform k -means clustering algorithm over encrypted data without exposing private data and intermediate results in each iteration. In particular, our scheme can verify the shares from participants when updating the cluster centers based on secret sharing, hash function and blockchain, so that our scheme can resist inconsistent share attacks by malicious participants. Finally, the security and experimental analysis are carried out to show that our scheme can protect private data and get high-accuracy clustering results.

Privacy-Preserving Sorting Algorithms Based on Logistic Map for Clouds

Security and Communication Networks ◽

10.1155/2018/2373545 ◽

2018 ◽

Vol 2018 ◽

pp. 1-10

Author(s):

Hua Dai ◽

Hui Ren ◽

Zhiye Chen ◽

Geng Yang ◽

Xun Yi

Keyword(s):

Data Privacy ◽

Logistic Map ◽

Security Analysis ◽

Privacy Preserving ◽

Service Recommendation ◽

Sensitive Data ◽

Encrypted Data ◽

Sorting Algorithms ◽

Common Operation ◽

Cloud Servers

Outsourcing data in clouds is adopted by more and more companies and individuals due to the profits from data sharing and parallel, elastic, and on-demand computing. However, it forces data owners to lose control of their own data, which causes privacy-preserving problems on sensitive data. Sorting is a common operation in many areas, such as machine learning, service recommendation, and data query. It is a challenge to implement privacy-preserving sorting over encrypted data without leaking privacy of sensitive data. In this paper, we propose privacy-preserving sorting algorithms which are on the basis of the logistic map. Secure comparable codes are constructed by logistic map functions, which can be utilized to compare the corresponding encrypted data items even without knowing their plaintext values. Data owners firstly encrypt their data and generate the corresponding comparable codes and then outsource them to clouds. Cloud servers are capable of sorting the outsourced encrypted data in accordance with their corresponding comparable codes by the proposed privacy-preserving sorting algorithms. Security analysis and experimental results show that the proposed algorithms can protect data privacy, while providing efficient sorting on encrypted data.

Efficient and Privacy-Preserving Multi-User Outsourced K-Means Clustering

Computer and Information Science ◽

10.5539/cis.v14n2p26 ◽

2021 ◽

Vol 14 (2) ◽

pp. 26

Author(s):

Na Li ◽

Lianguan Huang ◽

Yanling Li ◽

Meng Sun

Keyword(s):

Data Mining ◽

Big Data ◽

Clustering Algorithm ◽

Privacy Preserving ◽

Locality Sensitive Hashing ◽

Sensitive Information ◽

The Public ◽

Big Data Mining ◽

Euclidean Distances ◽

Computational Resources

In recent years, with the development of the Internet, the data on the network presents an outbreak trend. Big data mining aims at obtaining useful information through data processing, such as clustering, clarifying and so on. Clustering is an important branch of big data mining and it is popular because of its simplicity. A new trend for clients who lack of storage and computational resources is to outsource the data and clustering task to the public cloud platforms. However, as datasets used for clustering may contain some sensitive information (e.g., identity information, health information), simply outsourcing them to the cloud platforms can't protect the privacy. So clients tend to encrypt their databases before uploading to the cloud for clustering. In this paper, we focus on privacy protection and efficiency promotion with respect to k-means clustering, and we propose a new privacy-preserving multi-user outsourced k-means clustering algorithm which is based on locality sensitive hashing (LSH). In this algorithm, we use a Paillier cryptosystem encrypting databases, and combine LSH to prune off some unnecessary computations during the clustering. That is, we don't need to compute the Euclidean distances between each data record and each clustering center. Finally, the theoretical and experimental results show that our algorithm is more efficient than most existing privacy-preserving k-means clustering.

PP-VCA: A Privacy-Preserving and Verifiable Combinatorial Auction Mechanism

Wireless Communications and Mobile Computing ◽

10.1155/2020/8888284 ◽

2020 ◽

Vol 2020 ◽

pp. 1-11

Author(s):

Mingwu Zhang ◽

Bingruolan Zhou

Keyword(s):

Privacy Preserving ◽

Network Routing ◽

Combinatorial Auction ◽

Sensitive Information ◽

Sensitive Data ◽

Winner Determination ◽

Spectrum Auction ◽

Auction Price ◽

Data Volume ◽

Auction Mechanisms

Combinatorial auctions can be employed in the fields such as spectrum auction, network routing, railroad segment, and energy auction, which allow multiple goods to be sold simultaneously and any combination of goods to be bid and the maximum sum of combinations of bidding prices to be calculated. However, in traditional combinatorial auction mechanisms, data concerning bidders’ price and bundle might reveal sensitive information, such as personal preference and competitive relation since the winner determination problem needs to be resolved in terms of sensitive data as above. In order to solve this issue, this paper exploits a privacy-preserving and verifiable combinatorial auction protocol (PP-VCA) to protect bidders’ privacy and ensure the correct auction price in a secure manner, in which we design a one-way and monotonically increasing function to protect a bidder’s bid to enable the auctioneer to pick out the largest bid without revealing any information about bids. Moreover, we design and employ three subprotocols, namely, privacy-preserving winner determination protocol, privacy-preserving scalar protocol, and privacy-preserving verifiable payment determination protocol, to implement the combinatorial auction with bidder privacy and payment verifiability. The results of comprehensive experimental evaluations indicate that our proposed scheme provides a better efficiency and flexibility to meet different types of data volume in terms of the number of goods and bidders.

Optimal Privacy Preserving Technique Over Big Data Analytics Using Oppositional Fruit Fly Algorithm

Recent Advances in Computer Science and Communications ◽

10.2174/2213275911666181119113913 ◽

2020 ◽

Vol 13 (2) ◽

pp. 283-295

Author(s):

Ajmeera Kiran ◽

Vasumathi Devara

Keyword(s):

Big Data ◽

Data Analytics ◽

Input Data ◽

Clustering Algorithm ◽

Big Data Analytics ◽

Fruit Fly ◽

Privacy Preserving ◽

Sensitive Information ◽

Convolution Process ◽

Fuzzy C Means Clustering

Background: Big data analytics is the process of utilizing a collection of data accompanied on the internet to store and retrieve anywhere and at any time. Big data is not simply a data but it involves the data generated by variety of gadgets or devices or applications. Objective: When massive volume of data is stored, there is a possibility for malevolent attacks on the searching data are stored in the server because of under privileged privacy preserving approaches. These traditional methods result in many drawbacks due to various attacks on sensitive information. Hence, to enhance the privacy preserving for sensitive information stored in the database, the proposed method makes use of efficient methods. Methods: In this manuscript, an optimal privacy preserving over big data using Hadoop and mapreduce framework is proposed. Initially, the input data is grouped by modified fuzzy c means clustering algorithm. Then we are performing a map reduce framework. And then the clustered data is fed to the mapper; in mapper the privacy of input data is done by convolution process. To validate the privacy of input data the recommended technique utilizes the optimal artificial neural network. Here, oppositional fruit fly algorithm is used to enhancing the neural networks. Results: The routine of the suggested system is assessed by means of clustering accuracy, error value, memory, and time. The experimentation is performed by KDD dataset. Conclusion: A result shows that our proposed system has maximum accuracy and attains the effective convolution process to improve privacy preserving.

SoK: Efficient Privacy-preserving Clustering

Proceedings on Privacy Enhancing Technologies ◽

10.2478/popets-2021-0068 ◽

2021 ◽

Vol 2021 (4) ◽

pp. 225-248

Author(s):

Aditya Hegde ◽

Helen Möllering ◽

Thomas Schneider ◽

Hossein Yalame

Keyword(s):

Privacy Preserving ◽

Sensitive Information ◽

Multiple Sources ◽

Machine Learning Technique ◽

Clustering Quality ◽

Business Analysis ◽

Learning Technique ◽

Real World Applications ◽

Cloud Servers

Abstract Clustering is a popular unsupervised machine learning technique that groups similar input elements into clusters. It is used in many areas ranging from business analysis to health care. In many of these applications, sensitive information is clustered that should not be leaked. Moreover, nowadays it is often required to combine data from multiple sources to increase the quality of the analysis as well as to outsource complex computation to powerful cloud servers. This calls for efficient privacy-preserving clustering. In this work, we systematically analyze the state-of-the-art in privacy-preserving clustering. We implement and benchmark today’s four most efficient fully private clustering protocols by Cheon et al. (SAC’19), Meng et al. (ArXiv’19), Mohassel et al. (PETS’20), and Bozdemir et al. (ASIACCS’21) with respect to communication, computation, and clustering quality. We compare them, assess their limitations for a practical use in real-world applications, and conclude with open challenges.

Data mining as a tool in privacy-preserving data publishing

Tatra Mountains Mathematical Publications ◽

10.2478/v10127-010-0011-z ◽

2010 ◽

Vol 45 (1) ◽

pp. 151-159 ◽

Cited By ~ 2

Author(s):

Michal Sramka

Keyword(s):

Data Mining ◽

Decision Making ◽

Data Analysis ◽

Privacy Preserving ◽

Data Publishing ◽

Sensitive Information ◽

Private Data ◽

Privacy Preserving Data Publishing

ABSTRACTMany databases contain data about individuals that are valuable for research, marketing, and decision making. Sharing or publishing data about individuals is however prone to privacy attacks, breaches, and disclosures. The concern here is about individuals’ privacy-keeping the sensitive information about individuals private to them. Data mining in this setting has been shown to be a powerful tool to breach privacy and make disclosures. In contrast, data mining can be also used in practice to aid data owners in their decision on how to share and publish their databases. We present and discuss the role and uses of data mining in these scenarios and also briefly discuss other approaches to private data analysis.

An Efficient Privacy-Preserving Rank Query over Encrypted Data in Cloud Computing

Chinese Journal of Computers ◽

10.3724/sp.j.1016.2012.02215 ◽

2012 ◽

Vol 35 (11) ◽

pp. 2215 ◽

Cited By ~ 2

Author(s):

Fang-Quan CHENG ◽

Zhi-Yong PENG ◽

Wei SONG ◽

Shu-Lin WANG ◽

Yi-Hui CUI

Keyword(s):

Cloud Computing ◽

Privacy Preserving ◽

Encrypted Data

A Survey on Privacy-Preserving Outsourced Data on Cloud with Multiple Data Providers

SSRN Electronic Journal ◽

10.2139/ssrn.3601814 ◽

2020 ◽

Author(s):

Aman Singh Chauhan ◽

Dikshika Rani ◽

Akash Kumar ◽

Rishabh Gupta ◽

Ashutosh Kumar Singh

Keyword(s):

Privacy Preserving ◽

Multiple Data ◽

Outsourced Data

Privacy-Preserving Deep Neural Network Methods: Computational and Perceptual Methods—An Overview

Electronics ◽

10.3390/electronics10111367 ◽

2021 ◽

Vol 10 (11) ◽

pp. 1367

Author(s):

Raghida El El Saj ◽

Ehsan Sedgh Sedgh Gooya ◽

Ayman Alfalou ◽

Mohamad Khalil

Keyword(s):

Neural Networks ◽

Classification Accuracy ◽

Deep Neural Networks ◽

Privacy Preserving ◽

Sensitive Data ◽

Encrypted Data ◽

High Classification Accuracy ◽

Hide Information ◽

Network Methods ◽

Cloud Environments

Privacy-preserving deep neural networks have become essential and have attracted the attention of many researchers due to the need to maintain the privacy and the confidentiality of personal and sensitive data. The importance of privacy-preserving networks has increased with the widespread use of neural networks as a service in unsecured cloud environments. Different methods have been proposed and developed to solve the privacy-preserving problem using deep neural networks on encrypted data. In this article, we reviewed some of the most relevant and well-known computational and perceptual image encryption methods. These methods as well as their results have been presented, compared, and the conditions of their use, the durability and robustness of some of them against attacks, have been discussed. Some of the mentioned methods have demonstrated an ability to hide information and make it difficult for adversaries to retrieve it while maintaining high classification accuracy. Based on the obtained results, it was suggested to develop and use some of the cited privacy-preserving methods in applications other than classification.

Achieving Efficient and Privacy-Preserving Set Containment Search over Encrypted Data

IEEE Transactions on Services Computing ◽

10.1109/tsc.2021.3065240 ◽

2021 ◽

pp. 1-1

Author(s):

Yandong Zheng ◽

Rongxing Lu ◽

Yunguo Guan ◽

Jun Shao ◽

Hui Zhu

Keyword(s):

Privacy Preserving ◽

Encrypted Data ◽

Set Containment