scholarly journals Routes for breaching and protecting genetic privacy

2013 ◽  
Author(s):  
Yaniv Erlich ◽  
Arvind Narayanan

We are entering the era of ubiquitous genetic information for research, clinical care, and personal curiosity. Sharing these datasets is vital for rapid progress in understanding the genetic basis of human diseases. However, one growing concern is the ability to protect the genetic privacy of the data originators. Here, we technically map threats to genetic privacy and discuss potential mitigation strategies for privacy-preserving dissemination of genetic data.

eLife ◽  
2020 ◽  
Vol 9 ◽  
Author(s):  
Michael D Edge ◽  
Graham Coop

Direct-to-consumer (DTC) genetics services are increasingly popular, with tens of millions of customers. Several DTC genealogy services allow users to upload genetic data to search for relatives, identified as people with genomes that share identical by state (IBS) regions. Here, we describe methods by which an adversary can learn database genotypes by uploading multiple datasets. For example, an adversary who uploads approximately 900 genomes could recover at least one allele at SNP sites across up to 82% of the genome of a median person of European ancestries. In databases that detect IBS segments using unphased genotypes, approximately 100 falsified uploads can reveal enough genetic information to allow genome-wide genetic imputation. We provide a proof-of-concept demonstration in the GEDmatch database, and we suggest countermeasures that will prevent the exploits we describe.


2019 ◽  
Vol 6 (1) ◽  
pp. 1-36 ◽  
Author(s):  
Ellen Wright Clayton ◽  
Barbara J Evans ◽  
James W Hazel ◽  
Mark A Rothstein

Abstract Recent advances in technology have significantly improved the accuracy of genetic testing and analysis, and substantially reduced its cost, resulting in a dramatic increase in the amount of genetic information generated, analysed, shared, and stored by diverse individuals and entities. Given the diversity of actors and their interests, coupled with the wide variety of ways genetic data are held, it has been difficult to develop broadly applicable legal principles for genetic privacy. This article examines the current landscape of genetic privacy to identify the roles that the law does or should play, with a focus on federal statutes and regulations, including the Health Insurance Portability and Accountability Act (HIPAA) and the Genetic Information Nondiscrimination Act (GINA). After considering the many contexts in which issues of genetic privacy arise, the article concludes that few, if any, applicable legal doctrines or enactments provide adequate protection or meaningful control to individuals over disclosures that may affect them. The article describes why it may be time to shift attention from attempting to control access to genetic information to considering the more challenging question of how these data can be used and under what conditions, explicitly addressing trade-offs between individual and social goods in numerous applications.


2021 ◽  
Vol 21 (1) ◽  
Author(s):  
Yiqing Zhao ◽  
Saravut J. Weroha ◽  
Ellen L. Goode ◽  
Hongfang Liu ◽  
Chen Wang

Abstract Background Next-generation sequencing provides comprehensive information about individuals’ genetic makeup and is commonplace in oncology clinical practice. However, the utility of genetic information in the clinical decision-making process has not been examined extensively from a real-world, data-driven perspective. Through mining real-world data (RWD) from clinical notes, we could extract patients’ genetic information and further associate treatment decisions with genetic information. Methods We proposed a real-world evidence (RWE) study framework that incorporates context-based natural language processing (NLP) methods and data quality examination before final association analysis. The framework was demonstrated in a Foundation-tested women cancer cohort (N = 196). Upon retrieval of patients’ genetic information using NLP system, we assessed the completeness of genetic data captured in unstructured clinical notes according to a genetic data-model. We examined the distribution of different topics regarding BRCA1/2 throughout patients’ treatment process, and then analyzed the association between BRCA1/2 mutation status and the discussion/prescription of targeted therapy. Results We identified seven topics in the clinical context of genetic mentions including: Information, Evaluation, Insurance, Order, Negative, Positive, and Variants of unknown significance. Our rule-based system achieved a precision of 0.87, recall of 0.93 and F-measure of 0.91. Our machine learning system achieved a precision of 0.901, recall of 0.899 and F-measure of 0.9 for four-topic classification and a precision of 0.833, recall of 0.823 and F-measure of 0.82 for seven-topic classification. We found in result-containing sentences, the capture of BRCA1/2 mutation information was 75%, but detailed variant information (e.g. variant types) is largely missing. Using cleaned RWD, significant associations were found between BRCA1/2 positive mutation and targeted therapies. Conclusions In conclusion, we demonstrated a framework to generate RWE using RWD from different clinical sources. Rule-based NLP system achieved the best performance for resolving contextual variability when extracting RWD from unstructured clinical notes. Data quality issues such as incompleteness and discrepancies exist thus manual data cleaning is needed before further analysis can be performed. Finally, we were able to use cleaned RWD to evaluate the real-world utility of genetic information to initiate a prescription of targeted therapy.


2013 ◽  
Vol 41 (S1) ◽  
pp. 65-68 ◽  
Author(s):  
Michelle Huckaby Lewis

Human biological tissue samples are an invaluable resource for biomedical research designed to find causes of diseases and their treatments. Controversy has arisen, however, when research has been conducted with laboratory specimens either without the consent of the source of the specimen or when the research conducted with the specimen has expanded beyond the scope of the original consent agreement. Moreover, disputes have arisen regarding which party, the researcher or the source of the specimen, has control over who may use the specimens and for what purposes. The purposes of this article are: (1) to summarize the most important litigation regarding the use of laboratory specimens, and (2) to demonstrate how legal theory regarding control of laboratory specimens has evolved from arguments based upon property interests in biological samples to claims that the origins of laboratory specimens have privacy interests in their genetic information that should be protected.


2010 ◽  
pp. 91-113
Author(s):  
Juri Monducci

The law pertaining to personal data has developed in Italy over a thirty-year span that took us from recognition of such data in the case law, in 1975, to its statutory protection, in 2003. This evolution would subsequently come to the point of specifically regulating the processing of genetic data as data revealing an individual's genetic makeup, thereby also revealing the biological future of individuals and their offspring: this information describes an individual at a core level where the deepest, most unchangeable traits are found and can therefore nurture what is nowadays referred to as genetic determinism, which reduces the person to a complex of genetic data and so ignores the whole layer of characteristics that make each of us unique. There is, then, a discriminatory risk inherent in the processing of genetic data, and equally clear are the psychological implications of such processing, so much so that the need has arisen to have rules in place aimed at regulating the biotechnologies and genetics in particular. These rules have given birth to the so-called fourthgeneration rights, inclusive of the right to ones genetic identity and the right not to know ones genetics (although this is something that had been discussed earlier, too), and it is to a discussion of these rights that this essay is devoted.


2000 ◽  
Vol 28 (3) ◽  
pp. 245-257 ◽  
Author(s):  
Mark A. Hall ◽  
Stephen S. Rich

Since 1991, over half the states have enacted laws that restrict or prohibit insurers’ use of genetic information in pricing, issuing, or structuring health insurance. Wisconsin was the first state to do so, in 1991, followed by Ohio in 1993, California and Colorado in 1994, and then several more states a year in each of the next five years. Similar legislation has been pending in Congress for several years. Also, a 1996 federal law known as the Health Insurance Portability and Accountability Act (HIPAA) prohibits group health insurers from applying “preexisting condition” exclusions to genetic conditions that are indicated solely by genetic tests and not by any actual symptoms.


2020 ◽  
Author(s):  
Yiqing ZHAO ◽  
Saravut J Weroha ◽  
Ellen Goode ◽  
Hongfang Liu ◽  
Chen Wang

Abstract Background: Next-generation sequencing provides comprehensive information about individuals’ genetic makeup and is commonplace in oncology clinical practice. However, the utility of genetic information in clinical decision-making process has not been examined extensively from a real-world, data-driven perspective. Through mining real-world data (RWD) from clinical notes, we could extract patients’ genetic information and further associate treatment decisions with genetic information.Methods: We proposed a real-world evidence (RWE) study framework that incorporates context-based natural language processing (NLP) methods and data quality examination before final association analysis. The framework was demonstrated on a Foundation-tested women cancer cohort (N=196). Upon retrieval of patients’ genetic information using NLP system, we assessed completeness of genetic data captured in unstructured clinical notes according a genetic data-model. We examined the distribution of different topics regarding BRCA1/2 throughout patients’ treatment process, and then analyzed the association between BRCA1/2 mutation status and the discussion/prescription of targeted therapy. Results: We identified seven topics in clinical context of genetic mentions including: Information, Evaluation, Insurance, Order, Negative, Positive, and Variants of unknown significance (VUS). Our rule-based system achieved a precision of 0.87, recall of 0.93 and F-measure of 0.91. Our machine learning system achieved a precision of 0.901, recall of 0.899 and F-measure of 0.9 for four-topic classification and a precision of 0.833, recall of 0.823 and F-measure of 0.82 for seven-topic classification. We found in result-containing sentences, capture of BRCA1/2 mutation information was 75%, but detailed variant information (e.g. variant types) is largely missing. Using cleaned RWD, significant associations were found between BRCA1/2 positive mutation and targeted therapies.Conclusions: In conclusion, we demonstrated a framework to generate RWE using RWD from different clinical sources. Rule-based NLP system achieved the best performance for resolving contextual variability when extracting RWD from unstructured clinical notes. Data quality issue such as incompleteness and discrepancies exist thus manual data cleaning is needed before further analysis can be performed. Finally, we were able to use cleaned RWD to evaluate real-world utility of genetic information to initiate prescription of targeted therapy.


2012 ◽  
Vol 2012 ◽  
pp. 1-10 ◽  
Author(s):  
Charles McCartan ◽  
Robert Mason ◽  
S. R. Jayasinghe ◽  
Lyn R. Griffiths

Cardiomyopathies represent a group of diseases of the myocardium of the heart and include diseases both primarily of the cardiac muscle and systemic diseases leading to adverse effects on the heart muscle size, shape, and function. Traditionally cardiomyopathies were defined according to phenotypical appearance. Now, as our understanding of the pathophysiology of the different entities classified under each of the different phenotypes improves and our knowledge of the molecular and genetic basis for these entities progresses, the traditional classifications seem oversimplistic and do not reflect current understanding of this myriad of diseases and disease processes. Although our knowledge of the exact basis of many of the disease processes of cardiomyopathies is still in its infancy, it is important to have a classification system that has the ability to incorporate the coming tide of molecular and genetic information. This paper discusses how the traditional classification of cardiomyopathies based on morphology has evolved due to rapid advances in our understanding of the genetic and molecular basis for many of these clinical entities.


Author(s):  
Elsa Supiot ◽  
Margo Bernelin

This chapter analyzes the European Union framing of the protection of genetic privacy in the context of the European Commission's 2012 proposal to amend the 95/46/EC Data Protection Directive. This market-driven proposal, fitting a wider European movement with regard to health-related legal framework, takes into account the challenges to privacy protection brought by rapid technological development. Although the proposal is an attempt to clarify the 1995 Data Protection Directive, including the question of genetic data, it also creates some controversial grey areas, especially concerning the extensive regulatory role to be played by the European Commission. With regard to genetic privacy, this chapter takes the opportunity to develop on this paradox, and gives an analysis of the European design on the matter.


Author(s):  
Dara Hallinan

This chapter evaluates the concept of genetic privacy and its relationship with biobanking. Genetic privacy is simply a sub-concept of privacy referring to states of separation and exclusivity arising in relation to the processing of genetic data. Genetic privacy rights, then, are simply a subset of privacy rights relating to the processing of genetic data. The chapter then proceeds to map the range of genetic privacy rights engaged by the biobanking process along two axes: the transactional axis—genetic privacy rights held by research subjects; and the relational axis—genetic privacy right held by genetic relatives and genetic groups. Subsequently, it moves to map other types of interests engaged by biobanking, including interests related to the research process and third-party non-research interests in accessing biobank substances. Finally, the chapter offers a rough schematic of the relationships, including conflicts and confluences, between identified rights and interests.


Sign in / Sign up

Export Citation Format

Share Document