The issues connected with the anonymization of medical data. Part 1. The introduction to the anonymization of medical data. Ensuring the protection of sensitive information with the use of such methods as f(a) and f(a,b)

Introduction: Medical documentation must be protected against damage or loss, in compliance with its integrity and credibility and the opportunity to a permanent access by the authorized staff and, finally, protected against the access of unauthorized persons. Anonymization is one of the methods to safeguard the data against the disclosure.Aim of the study: The study aims at the analysis of methods of anonymization, the analysis of methods of the protection of anonymized data and the study of a new security type of privacy enabling to control sensitive data by the entity which the data concerns.Material and methods: The analytical and algebraic methods were used.Results: The study ought to deliver the materials supporting the choice and analysis of the ways of the anonymization of medical data, and develop a new privacy protection solution enabling the control of sensitive data by entities whom this data concerns.Conclusions: In the paper, the analysis of solutions of data anonymizing used for medical data privacy protection was con-ducted. The methods, such as k-Anonymity, (X,y)- Anonymity, (a,k)- Anonymity, (k,e)-Anonymity, (X,y)-Privacy, LKC-Privacy, l-Diversity, (X,y)-Linkability, t-Closeness, Confidence Bounding and Personalized Privacy were described, explained and analyzed. The analysis of solutions to control sensitive data by their owners was also conducted. Apart from the existing methods of the anonymization, the analysis of methods of the anonimized data protection was conducted, in particular the methods of: d-Presence, e-Differential Privacy, (d,g)-Privacy, (a,b)-Distributing Privacy and protections against (c,t)-Isolation were analyzed. The author introduced a new solution of the controlled protection of privacy. The solution is based on marking a protected field and multi-key encryption of the sensitive value. The suggested way of fields marking is in accordance to the XML standard. For the encryption (n,p) different key cipher was selected. To decipher the content the p keys of n is used. The proposed solution enables to apply brand new methods for the control of privacy of disclosing sensitive data.

Download Full-text

The issues connected with the anonymization of medical data. Part 2. Advanced anonymization and anonymization controlled by owner of protected sensitive data

HIGHER SCHOOL’S PULSE ◽

10.5604/01.3001.0003.3161 ◽

2014 ◽

Vol 8 (2) ◽

pp. 13-24 ◽

Cited By ~ 1

Author(s):

Arkadiusz Liber

Keyword(s):

Privacy Protection ◽

Differential Privacy ◽

Data Access ◽

Medical Data ◽

Medical Documentation ◽

Research Review ◽

Sensitive Data ◽

Data Anonymization ◽

Data Access Control ◽

Anonymized Data

Introduction: Medical documentation ought to be accessible with the preservation of its integrity as well as the protection of personal data. One of the manners of its protection against disclosure is anonymization. Contemporary methods ensure anonymity without the possibility of sensitive data access control. it seems that the future of sensitive data processing systems belongs to the personalized method. In the first part of the paper k-Anonymity, (X,y)- Anonymity, (α,k)- Anonymity, and (k,e)-Anonymity methods were discussed. these methods belong to well - known elementary methods which are the subject of a significant number of publications. As the source papers to this part, Samarati, Sweeney, wang, wong and zhang’s works were accredited. the selection of these publications is justified by their wider research review work led, for instance, by Fung, Wang, Fu and y. however, it should be noted that the methods of anonymization derive from the methods of statistical databases protection from the 70s of 20th century. Due to the interrelated content and literature references the first and the second part of this article constitute the integral whole.Aim of the study: The analysis of the methods of anonymization, the analysis of the methods of protection of anonymized data, the study of a new security type of privacy enabling device to control disclosing sensitive data by the entity which this data concerns.Material and methods: Analytical methods, algebraic methods.Results: Delivering material supporting the choice and analysis of the ways of anonymization of medical data, developing a new privacy protection solution enabling the control of sensitive data by entities which this data concerns.Conclusions: In the paper the analysis of solutions for data anonymization, to ensure privacy protection in medical data sets, was conducted. the methods of: k-Anonymity, (X,y)- Anonymity, (α,k)- Anonymity, (k,e)-Anonymity, (X,y)-Privacy, lKc-Privacy, l-Diversity, (X,y)-linkability, t-closeness, confidence Bounding and Personalized Privacy were described, explained and analyzed. The analysis of solutions of controlling sensitive data by their owner was also conducted. Apart from the existing methods of the anonymization, the analysis of methods of the protection of anonymized data was included. In particular, the methods of: δ-Presence, e-Differential Privacy, (d,γ)-Privacy, (α,β)-Distributing Privacy and protections against (c,t)-isolation were analyzed. Moreover, the author introduced a new solution of the controlled protection of privacy. the solution is based on marking a protected field and the multi-key encryption of sensitive value. The suggested way of marking the fields is in accordance with Xmlstandard. For the encryption, (n,p) different keys cipher was selected. to decipher the content the p keys of n were used. The proposed solution enables to apply brand new methods to control privacy of disclosing sensitive data.

Download Full-text

Research on Data Privacy Protection of Internet of Vehicles Based on Differential Privacy

IOP Conference Series Earth and Environmental Science ◽

10.1088/1755-1315/428/1/012007 ◽

2020 ◽

Vol 428 ◽

pp. 012007

Author(s):

Xue Luo ◽

Juan Wang ◽

Jing Xu ◽

Mengting Shen

Keyword(s):

Privacy Protection ◽

Data Privacy ◽

Differential Privacy ◽

Internet Of Vehicles ◽

Data Privacy Protection

Download Full-text

Trajectory data privacy protection based on differential privacy mechanism

IOP Conference Series Materials Science and Engineering ◽

10.1088/1757-899x/351/1/012017 ◽

2018 ◽

Vol 351 ◽

pp. 012017 ◽

Cited By ~ 1

Author(s):

Ke Gu ◽

Lihao Yang ◽

Yongzhi Liu ◽

Niandong Liao

Keyword(s):

Privacy Protection ◽

Data Privacy ◽

Differential Privacy ◽

Trajectory Data ◽

Data Privacy Protection

Download Full-text

Medical Data Privacy Protection Act should be Reappraised

Journal of Korean Medical Association ◽

10.5124/jkma.2007.50.3.204 ◽

2007 ◽

Vol 50 (3) ◽

pp. 204

Author(s):

Yang Dong Park

Keyword(s):

Privacy Protection ◽

Data Privacy ◽

Medical Data ◽

Data Privacy Protection

Download Full-text

Secure Anonymous Query-Based Encryption for Data Privacy Preserving in Cloud

Research Anthology on Privatizing and Securing Data ◽

10.4018/978-1-7998-8954-0.ch037 ◽

2021 ◽

pp. 815-839

Author(s):

Martin Konan ◽

Wenyong Wang

Keyword(s):

Privacy Protection ◽

Data Privacy ◽

Keyword Search ◽

Data Encryption ◽

Public Key ◽

Primary Method ◽

Sensitive Data ◽

Security Issues ◽

Data Privacy Protection ◽

Cloud Applications

Data privacy protection is a paramount issue in cloud applications for the last decade. In addition, data encryption, which is the primary method to impart security in clouds, is proved insufficient to guarantee data privacy protection from some security issues like homogeneity and background knowledge attacks. Therefore, it is important to provide a security mechanism that provide not only anonymous data but also anonymous continuous queries. So, this paper proposes a new scheme (Moye) that tackles this challenge by protecting queries to be linked to specific sensitive data. Specifically, the proposed solution is based on the design of a hybrid implementation of public key encryption with keyword search (PEKS) and subset membership encryption (SME) cryptosystem to enhance both data and query privacy protection. In addition, this approach provides an efficient and anonymous data processing by using an optimized k-anonymity scheme. Doing so, the authors protect searchable keywords and queries from inside and outside guessing attacks for the effectiveness of the proposed solution.

Download Full-text

Frequent Symptom Sets Identification from Uncertain Medical Data in Differentially Private Way

Scientific Programming ◽

10.1155/2017/7545347 ◽

2017 ◽

Vol 2017 ◽

pp. 1-10 ◽

Cited By ~ 1

Author(s):

Zhe Ding ◽

Zhen Qin ◽

Zhiguang Qin

Keyword(s):

Data Mining ◽

Data Privacy ◽

Differential Privacy ◽

Uncertain Data ◽

Frequent Itemsets ◽

Medical Data ◽

Patient Data ◽

Background Information ◽

Sensitive Information ◽

Symptom Association Probability

Data mining techniques are applied to identify hidden patterns in large amounts of patient data. These patterns can assist physicians in making more accurate diagnosis. For different physical conditions of patients, the same physiological index corresponds to a different symptom association probability for each patient. Data mining technologies based on certain data cannot be directly applied to these patients’ data. Patient data are sensitive data. An adversary with sufficient background information can make use of the patterns mined from uncertain medical data to obtain the sensitive information of patients. In this paper, a new algorithm is presented to determine the top K most frequent itemsets from uncertain medical data and to protect data privacy. Based on traditional algorithms for mining frequent itemsets from uncertain data, our algorithm applies sparse vector algorithm and the Laplace mechanism to ensure differential privacy for the top K most frequent itemsets for uncertain medical data and the expected supports of these frequent itemsets. We prove that our algorithm can guarantee differential privacy in theory. Moreover, we carry out experiments with four real-world scenario datasets and two synthetic datasets. The experimental results demonstrate the performance of our algorithm.

Download Full-text

Trajectory Data Privacy Protection based on Differential Privacy Mechanism

Informatica ◽

10.31449/inf.v42i3.1638 ◽

2018 ◽

Vol 42 (3) ◽

Cited By ~ 1

Author(s):

Ke Gu ◽

Lihao Yang ◽

Yongzhi Liu ◽

Bo Yin

Keyword(s):

Privacy Protection ◽

Data Privacy ◽

Differential Privacy ◽

Trajectory Data ◽

Data Privacy Protection

Download Full-text

An Effective Data Privacy Protection Algorithm Based on Differential Privacy in Edge Computing

IEEE Access ◽

10.1109/access.2019.2939015 ◽

2019 ◽

Vol 7 ◽

pp. 136203-136213 ◽

Cited By ~ 2

Author(s):

Yi Qiao ◽

Zhaobin Liu ◽

Haoze Lv ◽

Minghui Li ◽

Zhiyi Huang ◽

...

Keyword(s):

Privacy Protection ◽

Data Privacy ◽

Differential Privacy ◽

Edge Computing ◽

Data Privacy Protection

Download Full-text

Secure Anonymous Query-Based Encryption for Data Privacy Preserving in Cloud

International Journal of Information Security and Privacy ◽

10.4018/ijisp.2018100101 ◽

2018 ◽

Vol 12 (4) ◽

pp. 1-23 ◽

Cited By ~ 1

Author(s):

Martin Konan ◽

Wenyong Wang

Keyword(s):

Privacy Protection ◽

Data Privacy ◽

Keyword Search ◽

Data Encryption ◽

Public Key Encryption ◽

Sensitive Data ◽

Security Issues ◽

Query Privacy ◽

Data Privacy Protection ◽

Cloud Applications

Download Full-text

A federated interpretable scorecard and its application in credit scoring

International Journal of Financial Engineering ◽

10.1142/s2424786321420093 ◽

2021 ◽

pp. 2142009

Author(s):

Fanglan Zheng ◽

Erihe ◽

Kun Li ◽

Jiang Tian ◽

Xiaojia Xiang

Keyword(s):

Privacy Protection ◽

Data Privacy ◽

Credit Scoring ◽

Training Session ◽

Model Performance ◽

Parameter Tuning ◽

Kolmogorov Smirnov ◽

Data Privacy Protection ◽

Tuning Process ◽

Positive Coefficients

In this paper, we propose a vertical federated learning (VFL) structure for logistic regression with bounded constraint for the traditional scorecard, namely FL-LRBC. Under the premise of data privacy protection, FL-LRBC enables multiple agencies to jointly obtain an optimized scorecard model in a single training session. It leads to the formation of scorecard model with positive coefficients to guarantee its desirable characteristics (e.g., interpretability and robustness), while the time-consuming parameter-tuning process can be avoided. Moreover, model performance in terms of both AUC and the Kolmogorov–Smirnov (KS) statistics is significantly improved by FL-LRBC, due to the feature enrichment in our algorithm architecture. Currently, FL-LRBC has already been applied to credit business in a China nation-wide financial holdings group.

Download Full-text