A Privacy Preserving Cloud-Based K-NN Search Scheme with Lightweight User Loads

Yeong-Cherng Hsu; Chih-Hsin Hsueh; Ja-Ling Wu

doi:10.3390/computers9010001

A Privacy Preserving Cloud-Based K-NN Search Scheme with Lightweight User Loads

Computers ◽

10.3390/computers9010001 ◽

2020 ◽

Vol 9 (1) ◽

pp. 1 ◽

Cited By ~ 1

Author(s):

Yeong-Cherng Hsu ◽

Chih-Hsin Hsueh ◽

Ja-Ling Wu

Keyword(s):

Data Privacy ◽

Nearest Neighbor ◽

Search Algorithm ◽

Data Access ◽

Privacy Preserving ◽

Secret Key ◽

K Nearest Neighbor ◽

Sensitive Data ◽

Cloud Data ◽

Cloud Server

With the growing popularity of cloud computing, it is convenient for data owners to outsource their data to a cloud server. By utilizing the massive storage and computational resources in cloud, data owners can also provide a platform for users to make query requests. However, due to the privacy concerns, sensitive data should be encrypted before outsourcing. In this work, a novel privacy preserving K-nearest neighbor (K-NN) search scheme over the encrypted outsourced cloud dataset is proposed. The problem is about letting the cloud server find K nearest points with respect to an encrypted query on the encrypted dataset, which was outsourced by data owners, and return the searched results to the querying user. Comparing with other existing methods, our approach leverages the resources of the cloud more by shifting most of the required computational loads, from data owners and query users, to the cloud server. In addition, there is no need for data owners to share their secret key with others. In a nutshell, in the proposed scheme, data points and user queries are encrypted attribute-wise and the entire search algorithm is performed in the encrypted domain; therefore, our approach not only preserves the data privacy and query privacy but also hides the data access pattern from the cloud server. Moreover, by using a tree structure, the proposed scheme could accomplish query requests in sub-liner time, according to our performance analysis. Finally, experimental results demonstrate the practicability and the efficiency of our method.

Download Full-text

Privacy Preserving k-Nearest Neighbor for Medical Diagnosis in e-Health Cloud

Journal of Healthcare Engineering ◽

10.1155/2018/4073103 ◽

2018 ◽

Vol 2018 ◽

pp. 1-11 ◽

Cited By ~ 7

Author(s):

Jeongsu Park ◽

Dong Hoon Lee

Keyword(s):

Cloud Computing ◽

Medical Diagnosis ◽

Nearest Neighbor ◽

Data Access ◽

Privacy Preserving ◽

K Nearest Neighbor ◽

Diagnosis System ◽

Cloud Servers ◽

Health Cloud ◽

Medical Dataset

Cloud computing is highly suitable for medical diagnosis in e-health services where strong computing ability is required. However, in spite of the huge benefits of adopting the cloud computing, the medical diagnosis field is not yet ready to adopt the cloud computing because it contains sensitive data and hence using the cloud computing might cause a great concern in privacy infringement. For instance, a compromised e-health cloud server might expose the medical dataset outsourced from multiple medical data owners or infringe on the privacy of a patient inquirer by leaking his/her symptom or diagnosis result. In this paper, we propose a medical diagnosis system using e-health cloud servers in a privacy preserving manner when medical datasets are owned by multiple data owners. The proposed system is the first one that achieves the privacy of medical dataset, symptoms, and diagnosis results and hides the data access pattern even from e-health cloud servers performing computations using the data while it is still robust against collusion of the entities. As a building block of the proposed diagnosis system, we design a novel privacy preserving protocol for finding the k data with the highest similarity (PE-FTK) to a given symptom. The protocol reduces the average running time by 35% compared to that of a previous work in the literature. Moreover, the result of the previous work is probabilistic, i.e., the result can contain some error, while the result of our PE-FTK is deterministic, i.e., the result is correct without any error probability.

Download Full-text

Using Cryptography For Privacy-Preserving Data Mining

Data Mining and Knowledge Discovery Technologies ◽

10.4018/978-1-60566-218-3.ch014 ◽

2008 ◽

pp. 175-194

Author(s):

Justin Zhan

Keyword(s):

Data Mining ◽

Data Privacy ◽

Nearest Neighbor ◽

Privacy Preserving ◽

K Nearest Neighbor ◽

Privacy Concerns ◽

Private Data ◽

Definition Of ◽

Types Of Information ◽

Neighbor Classification

To conduct data mining, we often need to collect data from various parties. Privacy concerns may prevent the parties from directly sharing the data and some types of information about the data. How multiple parties collaboratively conduct data mining without breaching data privacy presents a challenge. The goal of this paper is to provide solutions for privacy-preserving k-nearest neighbor classification which is one of data mining tasks. Our goal is to obtain accurate data mining results without disclosing private data. We propose a formal definition of privacy and show that our solutions preserve data privacy.

Download Full-text

Control Cloud Data Access Privilge Anonymity with Attributed Based Encryption

International Journal of Advanced Research in Computer Science and Software Engineering ◽

10.23956/ijarcsse.v7i8.68 ◽

2017 ◽

Vol 7 (8) ◽

pp. 279

Author(s):

P. Sudheer ◽

T. Lakshmi Surekha

Keyword(s):

Data Privacy ◽

Low Cost ◽

Data Access ◽

Privacy Concerns ◽

Cloud Data ◽

Computing Paradigm ◽

Attribute Based Encryption ◽

Data Content ◽

Identity Privacy ◽

Cloud Servers

Cloud computing is a revolutionary computing paradigm, which enables flexible, on-demand, and low-cost usage of computing resources, but the data is outsourced to some cloud servers, and various privacy concerns emerge from it. Various schemes based on the attribute-based encryption have been to secure the cloud storage. Data content privacy. A semi anonymous privilege control scheme AnonyControl to address not only the data privacy. But also the user identity privacy. AnonyControl decentralizes the central authority to limit the identity leakage and thus achieves semi anonymity. The Anonymity –F which fully prevent the identity leakage and achieve the full anonymity.

Download Full-text

Federated deep learning for detecting COVID-19 lung abnormalities in CT: a privacy-preserving multinational validation study

npj Digital Medicine ◽

10.1038/s41746-021-00431-6 ◽

2021 ◽

Vol 4 (1) ◽

Author(s):

Qi Dou ◽

Tiffany Y. So ◽

Meirui Jiang ◽

Quande Liu ◽

Varut Vardhanabhuti ◽

...

Keyword(s):

Data Privacy ◽

Medical Training ◽

External Validation ◽

Mainland China ◽

Privacy Preserving ◽

Generalization Capability ◽

Major Focus ◽

Sensitive Data ◽

Lung Abnormalities ◽

Medical Image Diagnosis

AbstractData privacy mechanisms are essential for rapidly scaling medical training databases to capture the heterogeneity of patient data distributions toward robust and generalizable machine learning systems. In the current COVID-19 pandemic, a major focus of artificial intelligence (AI) is interpreting chest CT, which can be readily used in the assessment and management of the disease. This paper demonstrates the feasibility of a federated learning method for detecting COVID-19 related CT abnormalities with external validation on patients from a multinational study. We recruited 132 patients from seven multinational different centers, with three internal hospitals from Hong Kong for training and testing, and four external, independent datasets from Mainland China and Germany, for validating model generalizability. We also conducted case studies on longitudinal scans for automated estimation of lesion burden for hospitalized COVID-19 patients. We explore the federated learning algorithms to develop a privacy-preserving AI model for COVID-19 medical image diagnosis with good generalization capability on unseen multinational datasets. Federated learning could provide an effective mechanism during pandemics to rapidly develop clinically useful AI across institutions and countries overcoming the burden of central aggregation of large amounts of sensitive data.

Download Full-text

Clustering of cancer data based on Stiefel manifold for multiple views

BMC Bioinformatics ◽

10.1186/s12859-021-04195-4 ◽

2021 ◽

Vol 22 (1) ◽

Author(s):

Jing Tian ◽

Jianping Zhao ◽

Chunhou Zheng

Keyword(s):

Optimization Problem ◽

Nearest Neighbor ◽

Search Algorithm ◽

Stiefel Manifold ◽

Omics Data ◽

K Nearest Neighbor ◽

Cancer Data ◽

Clustering Problem ◽

Multiple Datasets ◽

Cluster Class

Abstract Background In recent years, various sequencing techniques have been used to collect biomedical omics datasets. It is usually possible to obtain multiple types of omics data from a single patient sample. Clustering of omics data plays an indispensable role in biological and medical research, and it is helpful to reveal data structures from multiple collections. Nevertheless, clustering of omics data consists of many challenges. The primary challenges in omics data analysis come from high dimension of data and small size of sample. Therefore, it is difficult to find a suitable integration method for structural analysis of multiple datasets. Results In this paper, a multi-view clustering based on Stiefel manifold method (MCSM) is proposed. The MCSM method comprises three core steps. Firstly, we established a binary optimization model for the simultaneous clustering problem. Secondly, we solved the optimization problem by linear search algorithm based on Stiefel manifold. Finally, we integrated the clustering results obtained from three omics by using k-nearest neighbor method. We applied this approach to four cancer datasets on TCGA. The result shows that our method is superior to several state-of-art methods, which depends on the hypothesis that the underlying omics cluster class is the same. Conclusion Particularly, our approach has better performance than compared approaches when the underlying clusters are inconsistent. For patients with different subtypes, both consistent and differential clusters can be identified at the same time.

Download Full-text

Privacy preserving cloud data access with multi-authorities

2013 Proceedings IEEE INFOCOM ◽

10.1109/infcom.2013.6567070 ◽

2013 ◽

Cited By ~ 72

Author(s):

Taeho Jung ◽

Xiang-Yang Li ◽

Zhiguo Wan ◽

Meng Wan

Keyword(s):

Data Access ◽

Privacy Preserving ◽

Cloud Data

Download Full-text

KVGCN: A KNN Searching and VLAD Combined Graph Convolutional Network for Point Cloud Segmentation

Remote Sensing ◽

10.3390/rs13051003 ◽

2021 ◽

Vol 13 (5) ◽

pp. 1003

Author(s):

Nan Luo ◽

Hongquan Yu ◽

Zhenfeng Huo ◽

Jinhui Liu ◽

Quan Wang ◽

...

Keyword(s):

Point Cloud ◽

Nearest Neighbor ◽

Semantic Segmentation ◽

K Nearest Neighbor ◽

Topological Graph ◽

Convolutional Network ◽

Cloud Data ◽

Nearest Neighbor Searching ◽

Point Cloud Segmentation ◽

Local Feature Extraction

Semantic segmentation of the sensed point cloud data plays a significant role in scene understanding and reconstruction, robot navigation, etc. This work presents a Graph Convolutional Network integrating K-Nearest Neighbor searching (KNN) and Vector of Locally Aggregated Descriptors (VLAD). KNN searching is utilized to construct the topological graph of each point and its neighbors. Then, we perform convolution on the edges of constructed graph to extract representative local features by multiple Multilayer Perceptions (MLPs). Afterwards, a trainable VLAD layer, NetVLAD, is embedded in the feature encoder to aggregate the local and global contextual features. The designed feature encoder is repeated for multiple times, and the extracted features are concatenated in a jump-connection style to strengthen the distinctiveness of features and thereby improve the segmentation. Experimental results on two datasets show that the proposed work settles the shortcoming of insufficient local feature extraction and promotes the accuracy (mIoU 60.9% and oAcc 87.4% for S3DIS) of semantic segmentation comparing to existing models.

Download Full-text

Privacy-Preserving Sorting Algorithms Based on Logistic Map for Clouds

Security and Communication Networks ◽

10.1155/2018/2373545 ◽

2018 ◽

Vol 2018 ◽

pp. 1-10

Author(s):

Hua Dai ◽

Hui Ren ◽

Zhiye Chen ◽

Geng Yang ◽

Xun Yi

Keyword(s):

Data Privacy ◽

Logistic Map ◽

Security Analysis ◽

Privacy Preserving ◽

Service Recommendation ◽

Sensitive Data ◽

Encrypted Data ◽

Sorting Algorithms ◽

Common Operation ◽

Cloud Servers

Outsourcing data in clouds is adopted by more and more companies and individuals due to the profits from data sharing and parallel, elastic, and on-demand computing. However, it forces data owners to lose control of their own data, which causes privacy-preserving problems on sensitive data. Sorting is a common operation in many areas, such as machine learning, service recommendation, and data query. It is a challenge to implement privacy-preserving sorting over encrypted data without leaking privacy of sensitive data. In this paper, we propose privacy-preserving sorting algorithms which are on the basis of the logistic map. Secure comparable codes are constructed by logistic map functions, which can be utilized to compare the corresponding encrypted data items even without knowing their plaintext values. Data owners firstly encrypt their data and generate the corresponding comparable codes and then outsource them to clouds. Cloud servers are capable of sorting the outsourced encrypted data in accordance with their corresponding comparable codes by the proposed privacy-preserving sorting algorithms. Security analysis and experimental results show that the proposed algorithms can protect data privacy, while providing efficient sorting on encrypted data.

Download Full-text

A GROSS ERROR ELIMINATION METHOD FOR POINT CLOUD DATA BASED ON KD-TREE

ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-archives-xlii-3-719-2018 ◽

2018 ◽

Vol XLII-3 ◽

pp. 719-722 ◽

Cited By ~ 1

Author(s):

Q. Kang ◽

G. Huang ◽

S. Yang

Keyword(s):

Point Cloud ◽

Nearest Neighbor ◽

Target Point ◽

Point Cloud Data ◽

K Nearest Neighbor ◽

Cloud Data ◽

Gross Error ◽

Error Elimination ◽

K Nearest Neighbor Algorithm ◽

Key Steps

Point cloud data has been one type of widely used data sources in the field of remote sensing. Key steps of point cloud data’s pro-processing focus on gross error elimination and quality control. Owing to the volume feature of point could data, existed gross error elimination methods need spend massive memory both in space and time. This paper employed a new method which based on Kd-tree algorithm to construct, k-nearest neighbor algorithm to search, settled appropriate threshold to determine with result turns out a judgement that whether target point is or not an outlier. Experimental results show that, our proposed algorithm will help to delete gross error in point cloud data and facilitate to decrease memory consumption, improve efficiency.

Download Full-text

Enhanced Integrity Checking for Preserve Data Owner and User Level Privacy Using Dual Cryptography Approach

International Journal of Scientific Research in Computer Science Engineering and Information Technology ◽

10.32628/cseit195346 ◽

2019 ◽

pp. 138-146

Author(s):

Poovizhi. M ◽

Raja. G

Keyword(s):

Data Storage ◽

Data Privacy ◽

Capital Expenditure ◽

Data Access ◽

Third Party ◽

Configurable Computing ◽

Local Data ◽

Cloud Data ◽

On Demand ◽

Integrity Checking

Using Cloud Storage, users can tenuously store their data and enjoy the on-demand great quality applications and facilities from a shared pool of configurable computing resources, without the problem of local data storage and maintenance. However, the fact that users no longer have physical possession of the outsourced data makes the data integrity protection in Cloud Computing a formidable task, especially for users with constrained dividing resources. From users’ perspective, including both individuals and IT systems, storing data remotely into the cloud in a flexible on-demand manner brings tempting benefits: relief of the burden for storage management, universal data access with independent geographical locations, and avoidance of capital expenditure on hardware, software, and personnel maintenances, etc. To securely introduce an effective Sanitizer and third party auditor (TPA), the following two fundamental requirements have to be met: 1) TPA should be able to capably audit the cloud data storage without demanding the local copy of data, and introduce no additional on-line burden to the cloud user; 2) The third party auditing process should take in no new vulnerabilities towards user data privacy. In this project, utilize and uniquely combine the public auditing protocols with double encryption approach to achieve the privacy-preserving public cloud data auditing system, which meets all integrity checking without any leakage of data. To support efficient handling of multiple auditing tasks, we further explore the technique of online signature to extend our main result into a multi-user setting, where TPA can perform multiple auditing tasks simultaneously. We can implement double encryption algorithm to encrypt the data twice and stored cloud server in Electronic Health Record applications.

Download Full-text