High Throughput Transcriptome Sequencing of Pediatric Relapsed Acute Lymphoblastic Leukemia (ALL) Identifies Relapse Specific Mutations and Expression

Blood ◽  
2010 ◽  
Vol 116 (21) ◽  
pp. 3233-3233
Author(s):  
Julia A. Meyer ◽  
Laura E. Hogan ◽  
Jinhua Wang ◽  
Jun J. Yang ◽  
Jay Patel ◽  
...  

Abstract Abstract 3233 Introduction: Relapsed ALL carries a very poor prognosis despite intensive therapy, indicating the need for new insights into disease mechanisms. We have previously used gene expression profiling (Hogan et al. ASH 2009) and copy number analysis (Yang et al. Blood 2008) in paired diagnosis and relapsed ALL samples to better understand the biologic mechanisms leading to recurrent disease. To create an integrated genomic profile of ALL, we have now focused on high throughput RNA sequencing to detect changes in the transcriptome from diagnosis to relapse. Patients/Methods: To date we have sequenced 6 matched diagnosis/relapse pairs (i.e. 12 marrow samples) from B-precursor ALL patients enrolled on Children's Oncology Group (COG) P9906 and AALL0232 trials. RNA libraries were prepared from poly-A selected RNA and sequenced using 54 base pair single end reads using the Illumina Genome Analyzer IIx. Each sample was sequenced in at least 7 lanes, generating an average of 100 million reads per sample. BWA (v0.5.8) was used to align the reads to the human genome, producing an average of 53 million mapped reads. Samtools (v0.1.8) was then used to predict genetic variants across the genome, filtering out variants with a low mapping quality (<Q20), sub-optimal alignment (X:1>0), low coverage (<8X), or overlap with known single nucleotide polymorphisms (SNPs) from dbSNP (r131) or the 1000 Genomes Project. Results: We observed a total of 119,000 genetic variants across all samples, with comparable overall mutational burden at relapse and diagnosis. To identify candidate lesions that may indicate a selection for common chemoresistance pathways, we focused our analysis on relapse-enriched, non-synonymous variants. 8,486 non-synonymous variants (insertions/deletions and single nucleotide variants [SNV]) were identified that occurred more often at relapse compared to diagnosis. Our analysis was focused on relapse-enriched SNVs that coded for non-synonymous changes, of which 154 were prioritized for validation. Validation was completed using matched genomic DNA samples and PCR products were directly sequenced. Mutation calls were made by manual review of tracings using the Mutation Surveyor program from Softgenetics. Thirty-three percent of predicted SNV loci were validated, but upon further sequencing of matched germline samples, five relapse specific mutations were confirmed. Mutations in COBRA1, FAM120A, RGS12, SND2, and SMEK2 were found in individual patient relapse samples. Validation is currently ongoing to confirm additional SNVs and an expanded validation of mutations will be completed in an additional 66 matched diagnosis/relapse pairs from COG 9906 and AALL 0232 and 0331 studies. Relapse specific isoforms identifying alternative exon usage was also detected in 15 genes, all of which were shared amongst multiple patients. In addition, a significant increase (p=6.7×10−6) was observed in the number of poly-adenylation sites in the genes of the relapse samples. Conclusions: While, isoform specific expression was shared amongst patients at relapse, all relapse specific mutations were private and our data to date indicate that a diversity of mechanisms contribute to relapsed disease. Further sequencing analysis of our expanded cohort of samples will determine the mutation and isoform expression prevalence, as well as the functional significance and the potential therapeutic relevance. Disclosures: No relevant conflicts of interest to declare.

2021 ◽  
Vol 22 (1) ◽  
Author(s):  
Gavin W. Wilson ◽  
Mathieu Derouet ◽  
Gail E. Darling ◽  
Jonathan C. Yeung

AbstractIdentifying single nucleotide variants has become common practice for droplet-based single-cell RNA-seq experiments; however, presently, a pipeline does not exist to maximize variant calling accuracy. Furthermore, molecular duplicates generated in these experiments have not been utilized to optimally detect variant co-expression. Herein, we introduce scSNV designed from the ground up to “collapse” molecular duplicates and accurately identify variants and their co-expression. We demonstrate that scSNV is fast, with a reduced false-positive variant call rate, and enables the co-detection of genetic variants and A>G RNA edits across twenty-two samples.


2018 ◽  
Author(s):  
Maxime Garcia ◽  
Szilveszter Juhos ◽  
Malin Larsson ◽  
Pall I. Olason ◽  
Marcel Martin ◽  
...  

AbstractSummaryWhole-genome sequencing (WGS) is a cornerstone of precision medicine, but portable and reproducible open-source workflows for WGS analyses of germline and somatic variants are lacking. We present Sarek, a modular, comprehensive, and easy-to-install workflow, combining a range of software for the identification and annotation of single-nucleotide variants (SNVs), insertion and deletion variants (indels), structural variants, tumor sample heterogeneity, and karyotyping from germline or paired tumor/normal samples. Sarek is implemented in a bioinformatics workflow language (Nextflow) with Docker and Singularity compatible containers, ensuring easy deployment and full reproducibility at any Linux based compute cluster or cloud computing environment. Sarek supports the human reference genomes GRCh37 and GRCh38, and can readily be used both as a core production workflow at sequencing facilities and as a powerful stand-alone tool for individual research groups.AvailabilitySource code and instructions for local installation are available at GitHub (https://github.com/SciLifeLab/Sarek) under the MIT open-source license, and we invite the research community to contribute additional functionality as a collaborative open-source development project.


2019 ◽  
Vol 128 (6) ◽  
pp. 556-562 ◽  
Author(s):  
Huang Xin ◽  
Wang Changchen ◽  
Liu Lei ◽  
Yang Meirong ◽  
Zhang Ye ◽  
...  

Objective: Microtia is a congenital malformation of the external ear. Great progress about the genetic of microtia has been made in recent years. This article was to prioritize the potential candidate pathogenic genes of microtia based on existing studies and reports, with the purpose of narrowing the range of following study scientifically and quickly. Method: A computational tool called Phenolyzer (phenotype-based gene analyzer) was used to prioritize microtia genes. Microtia, as a query term, was input in the interface of Phenolyzer. After several steps, including disease match, gene query, gene score system, seed gene growth, and gene ranking, the final results about genetic information of microtia were provided. Then we tracked details of the top 10 genes ranked by Phenolyzer on the basis of previous reports. Results: We detected 10 348 genes associated with microtia or related syndromes, and 78 genes of those genes belonged to seed genes. Every gene was given a score, and the gene with higher scores was more likely influence microtia. The top 10 ranked genes included HOXA2, CHD7, CDT1, ORC1, ORC4, ORC6, CDC6, MED12, TWIST1, and GLI3. Otherwise, four gene-gene interactions were displayed. Conclusion: This article prioritized candidate genes of microtia for the first time. High-throughput methods provide tens of thousands of single-nucleotide variants, indels, and structural variants, and only a handful are relevant to microtia or associated syndromes. Combine the ranked potential pathogenic genes list from Phenolyzer with the results of samples provided by high-throughput methods, and more precise research directions are presented.


Blood ◽  
2015 ◽  
Vol 125 (26) ◽  
pp. 3977-3987 ◽  
Author(s):  
Stephen P. Hunger ◽  
Charles G. Mullighan

Abstract Acute lymphoblastic leukemia (ALL) is the commonest childhood tumor and remains a leading cause of cancer death in the young. In the last decade, microarray and sequencing analysis of large ALL cohorts has revolutionized our understanding of the genetic basis of this disease. These studies have identified new ALL subtypes, each characterized by constellations of structural and sequence alterations that perturb key cellular pathways, including lymphoid development, cell-cycle regulation, and tumor suppression; cytokine receptor, kinase, and Ras signaling; and chromatin modifications. Several of these pathways, particularly kinase-activating lesions and epigenetic alterations, are logical targets for new precision medicine therapies. Genomic profiling has also identified important interactions between inherited genetic variants that influence the risk of leukemia development and the somatic genetic alterations that are required to establish the leukemic clone. Moreover, sequential sequencing studies at diagnosis, remission, and relapse have provided important insights into the relationship among genetic variants, clonal heterogeneity, and the risk of relapse. Ongoing studies are extending our understanding of coding and noncoding genetic alterations in B-progenitor and T-lineage ALL and using these insights to inform the development of faithful experimental models to test the efficacy of new treatment approaches.


Blood ◽  
2011 ◽  
Vol 118 (21) ◽  
pp. 872-872 ◽  
Author(s):  
Jan Stary ◽  
Martin Zimmermann ◽  
Myriam Campbell ◽  
Luis Castillo ◽  
Eduardo Dibar ◽  
...  

Abstract Abstract 872 Introduction: ALL IC-BFM 2002 is one of the most successful projects developed by the I-BFM-SG, which is one of the world largest societies involving national leukemia groups. The trial evaluated in a randomized manner the impact on outcome of intensified late reinduction in the context of a newly developed risk stratification. The main goal of the trial was improvement of outcome of children with ALL in each of the 3 risk groups (RG). Patients & methods: From Nov 2002-Nov 2007, 5060 eligible pts aged 0–18 yrs with newly diagnosed ALL from Argentina (1270), Chile (558), Croatia (122), Cuba (151), Czech Republic (291), Hong Kong (155), Hungary (259), Israel (292), Poland (908), Serbia (266), Slovakia (137), Slovenia (36), Ukraine (421), Uruguay (96), and Moscow (98) were enrolled in the trial (http:clinicaltrials.gov “NCT00764907”). Stratification into 3 RGs was based on early treatment response (evaluated in PB on day 8 and in BM on days 15 and 33), age, initial WBC, and presence/absence of BCR/ABL or MLL/AF4. Standard Risk (SR) criteria were: < 1,000 blasts/μL in PB day 8 after 7 days of oral prednisone with 1 intrathecal methotrexate (IT-MTX) and age ≥ 1 yr and < 6 yr and initial WBC < 20,000/μL and M1 or M2 marrow on day 15 and M1 marrow on day 33 (all criteria must be fulfilled). Intermediate Risk (IR) criteria were: < 1,000 blasts/μL in PB day 8 and age < 1 yr or ≥ 6 yr and/or WBC ≥ 20,000/μL and M1 or M2 marrow on day 15 and M1 marrow on day 33 (or SR criteria but M3 marrow on day 15 and M1 marrow on day 33). High Risk (HR) criteria were: IR and M3 marrow on day 15, PB on day 8 ≥ 1,000 blasts/μL, M2 or M3 marrow on day 33, translocation t(9;22) [BCR/ABL] or t(4;11) [MLL/AF4] (at least one criterion must be fulfilled). The majority of infants < 1 yr were treated in studies Interfant 99 and Interfant 06. Treatment consisted of protocol I‘/I, SR/IR consolidation with 6-MP and MD MTX 2g/m2 × 4 (with additional IT-MTX in maintenance) for BCP-ALL, 6-MP and HD MTX 5g/m2 × 4 for T-ALL, HR consolidation with 3 HR polychemotherapy blocks. A randomized question was asked in late intensification: SR: would 2 shorter elements (protocol III × 2) be more effective than 1 longer (protocol II × 1) even though the cumulative dose of most drugs is not increased? IR: could the risk of failure be reduced by a third reintensification element (protocol III × 3)? HR: could the use of 3 reintensification elements (protocol III × 3) achieve the same or better results than the HR approach applied in BFM (3 HR blocks + protocol II × 1) or AIEOP (protocol II × 2)? Chemotherapy was concluded by maintenance therapy (6-MP/MTX), making up a total of 2 yrs overall treatment. Prophylactic CNS radiotherapy 12 Gy was applied in T-ALL and HR pts. Results: With median follow-up 4.9 yr, the 5-yr EFS was 74 ± 1% and 5-yr OS 82 ± 1% for the whole group of 5060 pts. The CR rate was 97%, 255 (5%) children died in remission. The 5-yr EFS/OS was 81 ± 1%/90 ± 1% in 1564 SR pts (30.9% of all pts), 75 ± 1%/83 ± 1% in 2650 IR pts (52.4%) and 55 ± 2%/62 ± 2% in 846 HR pts (16.7%). Randomization rate was 79% of those patients who survived for at least 20 weeks (planned timepoint of randomization). None of the experimental arms achieved significantly better EFS compared to standard treatment. CI of relapses at 5 yr was 18 ± 1% overall, CI for isolated BM relapse was 12 ± 1%, isolated and combined CNS relapse 4 ± 0.3%, isolated and combined testicular relapse 2 ± 0.2%. Secondary malignancy was diagnosed so far in 26 patients (5-yr CI 0.6 ± 0.1%). Significantly better EFS was achieved in BCP-ALL(75 ± 1%) in comparison with T-ALL (69 ± 1%), girls vs. boys (76 ± 1% vs 72 ± 1%), children aged < 10 yr vs ≥ 10 yr (77 ± 1% vs 65 ± 1%), M1/M2 BM D15 vs. M3(76 ± 1% vs 50 ± 3%). 140 pts with Ph+ALL achieved a EFS of 47 ± 4% Allogeneic HSCT in CR1 was done in 139 pts with 5 -yr DFS of 64 ± 4%. Conclusions: Although the experimental arm was no better than the traditional one across individual RGs, the majority of participating countries, many of them were new-comers to this intensive therapy, improved their treatment results against previous national studies. The trial confirmed the validity and feasibility of a simple risk stratification of ALL applied in a complex and heterogeneous multinational environment. Despite the great differences between individual countries, the trial set a firm stage for willing national leukemia groups to run collaborative clinical trials in ALL under the umbrella of I-BFM-SG. Supported by MSM0021620813 and MZ0FNM2005. Disclosures: No relevant conflicts of interest to declare.


Blood ◽  
2013 ◽  
Vol 122 (21) ◽  
pp. 3422-3422
Author(s):  
Melinda M Dean ◽  
Katrina Kildey ◽  
Thu V Tran ◽  
Kelly Rooks ◽  
Shoma Baidya ◽  
...  

Abstract Introduction During routine storage packed red blood cells (PRBC) undergo biochemical and biophysical changes collectively referred to as the “RBC storage lesion”. Donor-to-donor variability in the severity of the storage lesion has been reported. The extent to which donor-associated differences in blood component storage affect blood product quality and post-transfusion outcome remains unknown. Murine models with single nucleotide variants (SNV) in gene encoding spectrin-1β were used to investigate the impact of mutations on the RBC storage lesion. Methods Two murine lineages with N-ethyl-N-nitrosourea (ENU) generated single SNV in Spnb1, encoding spectrin-1β (Table 1), were selected from the Australian Phenomics Facility library (http://databases.apf.edu.au/mutations). Using genetic selection, homozygous (HOM), heterozygous (HET) and unaffected (WT) mice from each strain were generated (C57BL/6 background strain). Murine blood was leucoreduced, prepared in SAGM (0.4 HCT) and stored at 4°C for time course assessment of RBC characteristics. At day (D), D2, D7, D14 and D21 of storage, RBC integrity and evidence of storage-related changes were investigated using RBC osmotic fragility and flow cytometric analysis of CD44, CD47, TER119 and phosphatidylserine (PS). Data were generated from analysis of blood from Spnb1 (pedigree spectrin-1β a) homozygous (HOM, n=3), heterozygous (HET, n=3) and unaffected (WT, n=2 ); Spnb1 (pedigree spectrin-1β b) HOM (n=4), HET( n=4); C57BL/6 (n=4). The Mann-Whitney Test and ANOVA were utilised for statistical analyses (95% CI). Results At D2 of storage SNV in Spnb1 did not alter RBC characteristics, with all mice studied demonstrating a similar resistance to osmotic lysis and levels of CD44, CD47, TER119 and PS. By D7 of storage, clear pedigree-related differences in RBC characteristics were evident. At D7, RBC from spectrin-1β(a) HOM mice had significantly increased osmotic fragility and exposure of PS as well as significantly reduced CD44 and TER119 expression compared to unaffected siblings and background strain. Of note, these changes were not evident in the spectrin-1β(b) HOM mice at D7. For both strains at D7, heterozygous SNV did not exhibit altered storage parameters. By D14 both HOM and HET spectrin-1β(a) mice demonstrated a phenotype consistent with an exacerbated RBC storage lesion, characterised by significantly increased osmotic fragility and exposure of PS, and reduced CD44 and CD47 compared to background strain. At D14 there was also evidence of exacerbation of the storage lesion in stored RBC from HOM spectrin-1β(b) mice (significantly increased PS), though this was not to the extent observed in the spectrin-1β(a) mice. By D21 all murine RBC were substantially degraded under these storage conditions. Conclusions SNV in Spnb1,encoding RBC structural protein spectrin-1β, resulted in both early onset and exacerbation of the RBC storage lesion. Further, the degree of storage lesion and the point at which RBC degradation was observed was not only dependent on the homozygous or heterozygous status, but the mutation itself. These data demonstrate that minor genetic variation in genes encoding important RBC proteins contribute to donor related differences in PRBC storage. Disclosures: No relevant conflicts of interest to declare.


Blood ◽  
2013 ◽  
Vol 122 (21) ◽  
pp. 228-228
Author(s):  
Joachim Kunz ◽  
Tobias Rausch ◽  
Obul R Bandapalli ◽  
Martina U. Muckenthaler ◽  
Adrian M Stuetz ◽  
...  

Abstract Acute precursor T-lymphoblastic leukemia (T-ALL) remains a serious challenge in pediatric oncology, because relapses carry a particularly poor prognosis with high rates of induction failure and death despite generally excellent treatment responses of the initial disease. It is critical, therefore, to understand the molecular evolution of pediatric T-ALL and to elucidate the mechanisms leading to T-ALL relapse and to understand the differences in treatment response between the two phases of the disease. We have thus subjected DNA from bone marrow samples obtained at the time of initial diagnosis, remission and relapse of 14 patients to whole exome sequencing (WES). Eleven patients suffered from early relapse (duration of remission 6-19 months) and 3 patients from late relapse (duration of remission 29-46 months).The Agilent SureSelect Target Enrichment Kit was used to capture human exons for deep sequencing. The captured fragments were sequenced as 100 bp paired reads using an Illumina HiSeq2000 sequencing instrument. All sequenced DNA reads were preprocessed using Trimmomatic (Lohse et al., Nucl. Acids Res., 2012) to clip adapter contaminations and to trim reads for low quality bases. The remaining reads greater than 36bp were mapped to build hg19 of the human reference genome with Stampy (Lunter & Goodson, Genome Res. 2011), using default parameters. Following such preprocessing, the number of mapped reads was >95% for all samples. Single-nucleotide variants (SNVs) were called using SAMtools mpileup (Li et al., Bioinformatics, 2009). The number of exonic SNVs varied between 23,741 and 31,418 per sample. To facilitate a fast classification and identification of candidate driver mutations, all identified coding SNVs were comprehensively annotated using the ANNOVAR framework (Wang et al., Nat. Rev. Genet., 2010). To identify possible somatic driver mutations, candidate SNVs were filtered for non-synonymous, stopgain or stoploss SNVs, requiring an SNV quality greater or equal to 50, and requiring absence of segmental duplications. Leukemia-specific mutations were identified by filtering against the corresponding remission sample and validated by Sanger sequencing of the genomic DNA following PCR amplification. We identified on average 9.3 somatic single nucleotide variants (SNV) and 0.6 insertions and deletions (indels) per patient sample at the time of initial diagnosis and 21.7 SNVs and 0.3 indels in relapse. On average, 6.3 SNVs were detected both at the time of initial diagnosis and in relapse. These SNVs were thus defined as leukemia specific. Further to SNVs, we have also estimated the frequency of copy number variations (CNV) at low resolution. Apart from the deletions resulting from T-cell receptor rearrangement, we identified on average for each patient 0.7 copy number gains and 2.2 copy number losses at the time of initial diagnosis and 0.5 copy number gains and 2.4 copy number losses in relapse. We detected 24/27 copy number alterations both in initial diagnosis and in relapse. The most common CNV detected was the CDKN2A/B deletion on chromosome 9p. Nine genes were recurrently mutated in 2 or more patients thus indicating the functional leukemogenic potential of these SNVs in T-ALL. These recurrent mutations included known oncogenes (Notch1), tumor suppressor genes (FBXW7, PHF6, WT1) and genes conferring drug resistance (NT5C2). In several patients one gene (such as Notch 1, PHF6, WT1) carried different mutations either at the time of initial diagnosis and or in relapse, indicating that the major leukemic clone had been eradicated by primary treatment, but that a minor clone had persisted and expanded during relapse. The types of mutations did not differ significantly between mutations that were either already present at diagnosis or those that were newly acquired in relapse, indicating that the treatment did not cause specific genomic damage. We will further characterize the clonal evolution of T-ALL into relapse by targeted re-sequencing at high depth of genes with either relapse specific or initial-disease specific mutations. In conclusion, T-ALL relapse differs from primary disease by a higher number of leukemogenic SNVs without gross genomic instability resulting in large CNVs. Disclosures: No relevant conflicts of interest to declare.


F1000Research ◽  
2014 ◽  
Vol 2 ◽  
pp. 217 ◽  
Author(s):  
Guillermo Barturen ◽  
Antonio Rueda ◽  
José L. Oliver ◽  
Michael Hackenberg

Whole genome methylation profiling at a single cytosine resolution is now feasible due to the advent of high-throughput sequencing techniques together with bisulfite treatment of the DNA. To obtain the methylation value of each individual cytosine, the bisulfite-treated sequence reads are first aligned to a reference genome, and then the profiling of the methylation levels is done from the alignments. A huge effort has been made to quickly and correctly align the reads and many different algorithms and programs to do this have been created. However, the second step is just as crucial and non-trivial, but much less attention has been paid to the final inference of the methylation states. Important error sources do exist, such as sequencing errors, bisulfite failure, clonal reads, and single nucleotide variants.We developed MethylExtract, a user friendly tool to: i) generate high quality, whole genome methylation maps and ii) detect sequence variation within the same sample preparation. The program is implemented into a single script and takes into account all major error sources. MethylExtract detects variation (SNVs – Single Nucleotide Variants) in a similar way to VarScan, a very sensitive method extensively used in SNV and genotype calling based on non-bisulfite-treated reads. The usefulness of MethylExtract is shown by means of extensive benchmarking based on artificial bisulfite-treated reads and a comparison to a recently published method, called Bis-SNP.MethylExtract is able to detect SNVs within High-Throughput Sequencing experiments of bisulfite treated DNA at the same time as it generates high quality methylation maps. This simultaneous detection of DNA methylation and sequence variation is crucial for many downstream analyses, for example when deciphering the impact of SNVs on differential methylation. An exclusive feature of MethylExtract, in comparison with existing software, is the possibility to assess the bisulfite failure in a statistical way. The source code, tutorial and artificial bisulfite datasets are available at http://bioinfo2.ugr.es/MethylExtract/ and http://sourceforge.net/projects/methylextract/, and also permanently accessible from 10.5281/zenodo.7144.


2019 ◽  
Vol 4 ◽  
pp. 145
Author(s):  
Matthew N. Wakeling ◽  
Thomas W. Laver ◽  
Kevin Colclough ◽  
Andrew Parish ◽  
Sian Ellard ◽  
...  

Multiple Nucleotide Variants (MNVs) are miscalled by the most widely utilised next generation sequencing analysis (NGS) pipelines, presenting the potential for missing diagnoses that would previously have been made by standard Sanger (dideoxy) sequencing. These variants, which should be treated as a single insertion-deletion mutation event, are commonly called as separate single nucleotide variants. This can result in misannotation, incorrect amino acid predictions and potentially false positive and false negative diagnostic results. This risk will be increased as confirmatory Sanger sequencing of Single Nucleotide variants (SNVs) ceases to be standard practice. Using simulated data and re-analysis of sequencing data from a diagnostic targeted gene panel, we demonstrate that the widely adopted pipeline, GATK best practices, results in miscalling of MNVs and that alternative tools can call these variants correctly. The adoption of calling methods that annotate MNVs correctly would present a solution for individual laboratories, however GATK best practices are the basis for important public resources such as the gnomAD database. We suggest integrating a solution into these guidelines would be the optimal approach.


Sign in / Sign up

Export Citation Format

Share Document