Linked-read whole-genome sequencing resolves common and private structural variants in multiple myeloma

Multiple myeloma (MM) is an incurable and aggressive plasma cell malignancy characterized by a complex karyotype with multiple structural variants (SVs) and copy number variations (CNVs). Linked-read whole-genome sequencing (lrWGS) allows for refined detection and reconstruction of SVs by providing long-range genetic information from standard short-read sequencing. This makes lrWGS an attractive solution for capturing the full genomic complexity of MM. Here we show that high-quality lrWGS data can be generated from low numbers of FACS sorted cells without DNA purification. Using this protocol, we analyzed FACS sorted MM cells from 37 MM patients with lrWGS. We found high concordance between lrWGS and FISH for the detection of recurrent translocations and CNVs. Outside of the regions investigated by FISH, we identified >150 additional SVs and CNVs across the cohort. Analysis of the lrWGS data allowed for resolving the structure of diverse SVs affecting the MYC and t(11;14) loci causing the duplication of genes and gene regulatory elements. In addition, we identified private SVs causing the dysregulation of genes recurrently involved in translocations with the IGH locus and show that these can alter the molecular classification of the MM. Overall, we conclude that lrWGS allows for the detection of aberrations critical for MM prognostics and provides a feasible route for providing comprehensive genetics. Implementing lrWGS could provide more accurate clinical prognostics, facilitate genomic medicine initiatives, and greatly improve the stratification of patients included in clinical trials.

Download Full-text

Whole genome sequencing identifies common and rare structural variants contributing to hematologic traits in the NHLBI TOPMed program

10.1101/2021.12.16.21267871 ◽

2021 ◽

Author(s):

Marsha M. Wheeler ◽

Adrienne M Stilp ◽

Shuquan Rao ◽

Bjarni V Halldorsson ◽

Doruk V Beyter ◽

...

Keyword(s):

Blood Cell ◽

Whole Genome Sequencing ◽

Genome Sequencing ◽

Association Studies ◽

Regulatory Elements ◽

Chromatin Domain ◽

Genome Wide Association Studies ◽

Whole Genome ◽

Structural Variants ◽

Genome Wide

Genome-wide association studies (GWAS) have identified thousands of single nucleotide variants and small indels that contribute to the genetic architecture of hematologic traits. While structural variants (SVs) are known to cause rare blood or hematopoietic disorders, the genome-wide contribution of SVs to quantitative blood cell trait variation is unknown. Here we utilized SVs detected from whole genome sequencing (WGS) in ancestrally diverse participants of the NHLBI TOPMed program (N=50,675). Using single variant tests, we assessed the association of common and rare SVs with red cell-, white cell-, and platelet-related quantitative traits. The results show 33 independent SVs (23 common and 10 rare) reaching genome-wide significance. The majority of significant association signals (N=27) replicated in independent datasets from deCODE genetics and the UK BioBank. Moreover, most trait-associated SVs (N=24) are within 1Mb of previously-reported GWAS loci. SV analyses additionally discovered an association between a complex structural variant on 17p11.2 and white blood cell-related phenotypes. Based on functional annotation, the majority of significant SVs are located in non-coding regions (N=26) and predicted to impact regulatory elements and/or local chromatin domain boundaries in blood cells. We predict that several trait-associated SVs represent the causal variant. This is supported by genome-editing experiments which provide evidence that a deletion associated with lower monocyte counts leads to disruption of an S1PR3 monocyte enhancer and decreased S1PR3 expression.

Download Full-text

Continuing the sequence? Towards an economic evaluation of whole genome sequencing for the diagnosis of rare diseases in Scotland

Journal of Community Genetics ◽

10.1007/s12687-021-00541-4 ◽

2021 ◽

Author(s):

Michael Abbott ◽

Lynda McKenzie ◽

Blanca Viridiana Guizar Moran ◽

Sebastian Heidenreich ◽

Rodolfo Hernández ◽

...

Keyword(s):

Economic Evaluation ◽

Genetic Testing ◽

Whole Genome Sequencing ◽

Genome Sequencing ◽

Rare Diseases ◽

Genomic Medicine ◽

Future Research ◽

Clinical Genetics ◽

Whole Genome ◽

Peace Of Mind

AbstractNovel developments in genomic medicine may reduce the length of the diagnostic odyssey for patients with rare diseases. Health providers must thus decide whether to offer genome sequencing for the diagnosis of rare conditions in a routine clinical setting. We estimated the costs of singleton standard genetic testing and trio-based whole genome sequencing (WGS), in the context of the Scottish Genomes Partnership (SGP) study. We also explored what users value about genomic sequencing. Insights from the costing and value assessments will inform a subsequent economic evaluation of genomic medicine in Scotland. An average cost of £1,841 per singleton was estimated for the standard genetic testing pathway, with significant variability between phenotypes. WGS cost £6625 per family trio, but this estimate reflects the use of WGS during the SGP project and large cost savings may be realised if sequencing was scaled up. Patients and families valued (i) the chance of receiving a diagnosis (and the peace of mind and closure that brings); (ii) the information provided by WGS (including implications for family planning and secondary findings); and (iii) contributions to future research. Our costings will be updated to address limitations of the current study for incorporation in budget impact modelling and cost-effectiveness analysis (cost per diagnostic yield). Our insights into the benefits of WGS will guide the development of a discrete choice experiment valuation study. This will inform a user-perspective cost–benefit analysis of genome-wide sequencing, accounting for the broader non-health outcomes. Taken together, our research will inform the long-term strategic development of NHS Scotland clinical genetics testing services, and will be of benefit to others seeking to undertake similar evaluations in different contexts.

Download Full-text

Identifying Aspects of Public Attitudes Toward Whole Genome Sequencing to Inform the Integration of Genomics into Care

Public Health Genomics ◽

10.1159/000515952 ◽

2021 ◽

pp. 1-12

Author(s):

Holly Etchegary ◽

Daryl Pullman ◽

Charlene Simmonds ◽

Zoha Rabie ◽

Proton Rahman

Keyword(s):

Whole Genome Sequencing ◽

Genome Sequencing ◽

Online Survey ◽

Public Attitudes ◽

Genetic Counselor ◽

Genomic Medicine ◽

Genomic Data ◽

Whole Genome ◽

The Public ◽

Genomic Test

Introduction: The growth of global sequencing initiatives and commercial genomic test offerings suggests the public will increasingly be confronted with decisions about sequencing. Understanding public attitudes can assist efforts to integrate sequencing into care and inform the development of public education and outreach strategies. Methods: A 48-item online survey was advertised on Facebook in Eastern Canada and hosted on SurveyMonkey in late 2018. The survey measured public interest in whole genome sequencing and attitudes toward various aspects of sequencing using vignettes, scaled, and open-ended items. Results: While interest in sequencing was high, critical attitudes were observed. In particular, items measuring features of patient control and choice regarding genomic data were strongly endorsed by respondents. Majority wanted to specify upfront how their data could be used, retain the ability to withdraw their sample at a later date, sign a written consent form, and speak to a genetic counselor prior to sequencing. Concerns about privacy and unauthorized access to data were frequently observed. Education level was the sociodemographic variable most often related to attitude statements such that those with higher levels of education generally displayed more critical attitudes. Conclusions: Attitudes identified here could be used to inform the development of implementation strategies for genomic medicine. Findings suggest health systems must address patient concerns about privacy, consent practices, and the strong desire to control what happens to their genomic data through public outreach and education. Specific oversight procedures and policies that are clearly communicated to the public will be required.

Download Full-text

Abstract 343: Bayesian Selection of Modifier Genes in Hypertrophic Cardiomyopathy Through Whole Genome Sequencing

Circulation Research ◽

10.1161/res.117.suppl_1.343 ◽

2015 ◽

Vol 117 (suppl_1) ◽

Author(s):

Matthew Wheeler ◽

Daryl Waggott ◽

Megan Grove ◽

Frederick Dewey ◽

Cuiping Pan ◽

...

Keyword(s):

Hypertrophic Cardiomyopathy ◽

Whole Genome Sequencing ◽

Genome Sequencing ◽

A Priori ◽

Copy Number Variants ◽

Whole Genome Sequence ◽

Monogenic Disease ◽

Whole Genome ◽

Genetic Modifiers ◽

Structural Variants

Background: Technological advances have greatly reduced the cost of whole genome sequencing. For single individuals clinical application is apparent, while exome sequencing in tens of thousands of people has allowed a more global view of genetic variation that can inform interpretation of specific variants in individuals. We hypothesized that genome sequencing of patients with monogenic cardiomyopathy would facilitate discovery of genetic modifiers of phenotype. Methods and Results: We identified 48 individuals diagnosed with cardiomyopathy and with putative mutations in MYH7, the gene encoding beta myosin heavy chain. We carried out whole genome sequencing and applied a newly developed analytical pipeline optimized for discovery of genes modifying severity of clinical presentation and outcomes. Using a combination of external priors and rare variant burden tests we scored genes as potential modifiers. There were 96 genes that reached a modifier score of 6 out of 12 or better (9=2, 8=8, 7=17, 6=69). We identified NCKAP1, a gene that regulates actin filament dynamics, and CAMSAP1, a calmodulin regulate gene that regulates microtubule dynamics, as top scoring modifiers of hypertrophic cardiomyopathy phenotypes (score=9) while LDB2, RYR2, FBN1 and ATP1A2 had modifier scores of 8. Of the top scoring genes, 21 out of 96 were identified as candidates a priori. Our candidate prioritization scheme identified the previously described modifiers of cardiomyopathy phenotype, FHOD3 and MYBPC3, as top scoring genes. We identified structural variants in 21 clinically sequenced cardiomyopathy associated genes, 13 of which were at less than 10% frequency. Copy number variants in ILK and CSRP3 were nominally associated with ejection fraction (p=0.03), while 8 genes showed copy gains (GLA, FKTN, SGCD, TTN, SOS1, ANKRD1, VCL and NEBL). Structural variants were found in CSRP3, MYL3 and TNNC1, all of which have been implicated as causative for HCM. Conclusion: Evaluation of the whole genome sequence, even in the case of putatively monogenic disease, leads to important diagnostic and scientific insights not revealed by panel-based sequencing.

Download Full-text

6 Translating genomics for clinical benefit

Postgraduate Medical Journal ◽

10.1136/postgradmedj-2019-fpm.6 ◽

2019 ◽

Vol 95 (1130) ◽

pp. 686.3-686

Author(s):

Mark Caulfield

Keyword(s):

Whole Genome Sequencing ◽

Genome Sequencing ◽

Genomic Medicine ◽

Health Gain ◽

Annual Review ◽

Whole Genome ◽

Department Of Health ◽

Data Points ◽

Genome Analyses ◽

The Uk

The UK 100,000 Genomes Project has focussed on transforming genomic medicine in the National Health Service using whole genome sequencing in rare disease, cancer and infection. Genomics England partnering with the NHS established 13 Genomic Medicine Centres, the NHS whole genome sequencing centre and the Genomics England Clinical Interpretation Partnership (3337 researchers from 24 countries). We sequenced the 100,000th genome on the 5th December 2019 and completed an initial analysis for participants in July 2019. Alongside these genomes we have assembled a longitudinal life course dataset for research and diagnosis including 2.6 billion clinical data points for the 3000 plus researchers to work on to drive up the value of the genomes for direct healthcare. In parallel we have partnered the NHS to establish one of the world’s most advanced Genomic Medicine Service where we re-evaluated 300,000 genomic tests and upgraded 25% of tests to newer technologies with an annual review. The Department of Health have announced the ambition to undertake 5 million genome analyses over the next 5 years focused on new areas tractable to health gain.

Download Full-text

Whole-genome sequencing analysis of structural variants in oesophageal adenocarcinoma

The Lancet ◽

10.1016/s0140-6736(17)30430-0 ◽

2017 ◽

Vol 389 ◽

pp. S34

Author(s):

Gianmarco Contino ◽

Maria Secrier ◽

Paul A W Edward ◽

Rebecca Fitzgerald

Keyword(s):

Whole Genome Sequencing ◽

Genome Sequencing ◽

Oesophageal Adenocarcinoma ◽

Whole Genome ◽

Sequencing Analysis ◽

Structural Variants

Download Full-text

Discovery of Novel Recurrent Mutations in Childhood Early T-Cell Precursor Acute Lymphoblastic Leukemia by Whole Genome Sequencing - a Report From the St Jude Children's Research Hospital - Washington University Pediatric Cancer Genome Project

Blood ◽

10.1182/blood.v118.21.68.68 ◽

2011 ◽

Vol 118 (21) ◽

pp. 68-68

Author(s):

Jinghui Zhang ◽

Li Ding ◽

Linda Holmfeldt ◽

Gang Wu ◽

Susan L. Heatley ◽

...

Keyword(s):

Gene Expression ◽

Acute Lymphoblastic Leukemia ◽

T Cell ◽

Whole Genome Sequencing ◽

Genome Sequencing ◽

Lymphoblastic Leukemia ◽

Whole Genome ◽

Structural Variants ◽

Cell Precursor ◽

Recurrent Mutations

Abstract Abstract 68 Early T-cell precursor acute lymphoblastic leukemia (ETP ALL) is characterized by an immature T-lineage immunophenotype (cCD3+, CD1a-, CD8- and CD5dim) aberrant expression of myeloid and stem cell markers, a distinct gene expression profile and very poor outcome. The underlying genetic basis of this form of leukemia is unknown. Here we report results of whole genome sequencing (WGS) of tumor and normal DNA from 12 children with ETP ALL. Genomes were sequenced to 30-fold haploid coverage using the Illumina GAIIx platform, and all putative somatic sequence and structural variants were validated. The frequency of mutations in 43 genes was assessed in a recurrence cohort of 52 ETP and 42 non-ETP T-ALL samples from patients enrolled in St Jude, Children's Oncology Group and AEIOP trials. Transcriptomic resequencing was performed for two WGS cases, and whole exome sequencing for three ETP ALL cases in the recurrence cohort. We identified 44 interchromosomal translocations (mean 4 per patient, range 0–12), 32 intrachromosomal translocations (mean 3, 0–7), 53 deletions (mean 4, 0–10) and 16 insertions (mean 1, 0–5). Three cases exhibited a pattern of complex rearrangements suggestive of a single cellular catastrophe (“chromothripsis”), two of which had mutations targeting mismatch and DNA repair (MLH3 and DCLRE1C). While no single chromosomal alteration was present in all cases, 10 of 12 ETP ALLs harbored chromosomal rearrangements, several of which involved complex multichromosomal translocations and resulted in the expression of chimeric in-frame novel fusion genes disrupting hematopoietic regulators, including ETV6-INO80D, NAP1L1-MLLT10, RUNX1-EVX1 and NUP214-SQSTM1, each occurring in a single case. An additional ETP case with the ETV6-INO80D fusion was identified in the recurrence cohort. Additionally, 51% of structural variants had breakpoints in genes, including those with roles in hematopoiesis and leukemogenesis, and genes also targeted by mutation in other cases (MLH3, SUZ12, RUNX1). We identified a high frequency of activating mutations in genes regulating cytokine receptor and Ras signalling in ETP ALL (67.2% of ETP compared to 19% of non-ETP T-ALL) including NRAS (17%), FLT3 (14%), JAK3 (9%), SH2B3 (or LNK; 9%), IL7R (8%), JAK1 (8%), KRAS (3%), and BRAF (2%). Seven cases (5 ETP, 2 non-ETP) harbored in frame insertion mutations in the transmembrane domain of IL7R, which were transforming when expressed in the murine cell lines, and resulted in enhanced colony formation when expressed in primary murine hematopoietic cells. The IL7R mutations resulted in constitutive Jak-Stat activation in these cell lines and primary leukemic cells expressing these mutations. Fifty-eight percent of ETP cases (compared to 17% of non-ETP cases) harbored mutations known or predicted to disrupt hematopoietic and lymphoid development, including ETV6 (33%), RUNX1 (16%), IKZF1 (14%), GATA3 (10%), EP300 (5%) and GATA2 (2%). GATA3 regulates early T cell development, and mutations in this gene were observed exclusively in ETP ALL. The mutations were commonly biallelic, and were clustered at R276, a residue critical for binding of GATA3 to DNA. Strikingly, mutations disrupting chromatin modifying genes were also highly enriched in ETP ALL. Genes encoding the the polycomb repressor complex 2 (EZH2, SUZ12 and EED), that mediates histone 3 lysine 27 (H3K27) trimethylation were deleted or mutated in 42% of ETP ALL compared to 12% of non-ETP T-ALL. In addition, alterations of the H3K36 trimethylase SETD2 were observed in 5 ETP cases, but not in non-ETP ALL. We also identified recurrent mutations in genes that have not previously been implicated in hematopoietic malignancies including RELN, DNM2, ECT2L, HNRNPA1 and HNRNPR. Using gene set enrichment analysis we demonstrate that the gene expression profile of ETP ALL shares features not only with normal human hematopoietic stem cells, but also with leukemic initiating cells (LIC) purified from patients with acute myeloid leukemia (AML). These results indicate that mutations that drive proliferation, impair differentiation and disrupt histone modification cooperate to induce an aggressive leukemia with an aberrant immature phenotype. The similarity of the gene expression pattern with that observed in the LIC of AML raises the possibility that myeloid-directed therapies might improve the outcome of ETP ALL. Disclosures: Evans: St. Jude Children's research Hospital: Employment, Patents & Royalties; NIH & NCI: Research Funding; Aldagen: Membership on an entity's Board of Directors or advisory committees.

Download Full-text

Identification of Pathogenic Structural Variants in Rare Disease Patients through Genome Sequencing

10.1101/627661 ◽

2019 ◽

Cited By ~ 2

Author(s):

James M. Holt ◽

Camille L. Birch ◽

Donna M. Brown ◽

Manavalan Gajapathy ◽

Nadiya Sosonkina ◽

...

Keyword(s):

Whole Genome Sequencing ◽

Rare Disease ◽

Genome Sequencing ◽

Genomic Analysis ◽

Whole Genome ◽

Structural Variants ◽

Single Nucleotide Variants ◽

Standard Clinical Practice ◽

Wide Range ◽

Genetic Features

AbstractPurposeClinical whole genome sequencing is becoming more common for determining the molecular diagnosis of rare disease. However, standard clinical practice often focuses on small variants such as single nucleotide variants and small insertions/deletions. This leaves a wide range of larger “structural variants” that are not commonly analyzed in patients.MethodsWe developed a pipeline for processing structural variants for patients who received whole genome sequencing through the Undiagnosed Diseases Network (UDN). This pipeline called structural variants, stored them in an internal database, and filtered the variants based on internal frequencies and external annotations. The remaining variants were manually inspected and then interesting findings were reported as research variants to clinical sites in the UDN.ResultsOf 477 analyzed UDN cases, 286 cases (≈ 60%) received at least one structural variant as a research finding. The variants in 16 cases (≈ 4%) are considered “Certain” or “Highly likely” molecularly diagnosed and another 4 cases are currently in review. Of those 20 cases, at least 13 were identified originally through our pipeline with one finding leading to identification of a new disease. As part of this paper, we have also released the collection of variant calls identified in our cohort along with heterozygous and homozygous call counts. This data is available at https://github.com/HudsonAlpha/UDN_SV_export.ConclusionStructural variants are key genetic features that should be analyzed during routine clinical genomic analysis. For our UDN patients, structural variants helped solve ≈ 4% of the total number of cases (≈ 13% of all genome sequencing solves), a success rate we expect to improve with better tools and greater understanding of the human genome.

Download Full-text

StrVCTVRE: A supervised learning method to predict the pathogenicity of human structural variants

10.1101/2020.05.15.097048 ◽

2020 ◽

Author(s):

Andrew G. Sharo ◽

Zhiqiang Hu ◽

Steven E. Brenner

Keyword(s):

Whole Genome Sequencing ◽

Genome Sequencing ◽

Diagnostic Methods ◽

Training Dataset ◽

Disease Genes ◽

Whole Genome ◽

Structural Variants ◽

Coding Region ◽

Diagnostic Potential ◽

Long Read

AbstractWhole genome sequencing resolves clinical cases where standard diagnostic methods have failed. However, preliminary studies show that at least half of these cases still remain unresolved, even after whole genome sequencing. Structural variants (genomic variants larger than 50 base pairs) of uncertain significance may be the genetic cause of a portion of these unresolved cases. Historically, structural variants (SVs) have been difficult to detect with confidence from short-read sequencing. As both detection algorithms and long-read/linked-read sequencing methods become more accessible, clinical researchers will have access to thousands of reliable SVs of unknown disease relevance. Filtering these SVs by overlap with cataloged SVs is an imperfect solution. Innovative methods to predict the pathogenicity of these SVs will be needed to realize the full diagnostic potential of long-read sequencing. To address this emerging need, we developed StrVCTVRE (Structural Variant Classifier Trained on Variants Rare and Exonic), a classifier that can be used to distinguish pathogenic SVs from benign SVs that overlap exons. We made use of features that capture gene importance, coding region, conservation, expression, and exon structure in a random forest classifier. We found that some features, such as expression and conservation, are important but are absent from SV classification guidelines. Although databases of SVs reflect size biases from sequencing techniques, we leveraged multiple databases to construct a size-matched training set of rare, putatively benign and pathogenic SVs. In independent test sets, we found our method performs accurately across a wide SV size range, which will allow clinical researchers to eliminate nearly 60% of SVs from consideration at an elevated sensitivity of 90%. However, our method and its assessment are still constrained by a small training dataset and acquisition bias in databases of pathogenic variants. StrVCTVRE fills an empty niche in the clinical evaluation of SVs of unknown significance. We anticipate researchers will use it to prioritize SVs in patients where no variant is immediately compelling, empowering deeper investigation into novel SVs and disease genes to resolve cases.

Download Full-text

Whole-genome sequencing reveals progressive versus stable myeloma precursor conditions as two distinct entities

Nature Communications ◽

10.1038/s41467-021-22140-0 ◽

2021 ◽

Vol 12 (1) ◽

Cited By ~ 1

Author(s):

Bénedith Oben ◽

Guy Froyen ◽

Kylee H. Maclachlan ◽

Daniel Leongamornlert ◽

Federico Abascal ◽

...

Keyword(s):

Multiple Myeloma ◽

Whole Genome Sequencing ◽

Genome Sequencing ◽

Genome Sequence ◽

Monoclonal Gammopathy ◽

Whole Genome Sequence ◽

Whole Genome ◽

Driver Genes ◽

Smoldering Myeloma ◽

Undetermined Significance

AbstractMultiple myeloma (MM) is consistently preceded by precursor conditions recognized clinically as monoclonal gammopathy of undetermined significance (MGUS) or smoldering myeloma (SMM). We interrogate the whole genome sequence (WGS) profile of 18 MGUS and compare them with those from 14 SMMs and 80 MMs. We show that cases with a non-progressing, clinically stable myeloma precursor condition (n = 15) are characterized by later initiation in the patient’s life and by the absence of myeloma defining genomic events including: chromothripsis, templated insertions, mutations in driver genes, aneuploidy, and canonical APOBEC mutational activity. This data provides evidence that WGS can be used to recognize two biologically and clinically distinct myeloma precursor entities that are either progressive or stable.

Download Full-text