scholarly journals Enabling transparent and collaborative computational analysis of 12 tumor types within The Cancer Genome Atlas

2013 ◽  
Vol 45 (10) ◽  
pp. 1121-1126 ◽  
Author(s):  
Larsson Omberg ◽  
Kyle Ellrott ◽  
Yuan Yuan ◽  
Cyriac Kandoth ◽  
Chris Wong ◽  
...  
2019 ◽  
Author(s):  
Roozbeh Dehghannasiri ◽  
Donald Eric Freeman ◽  
Milos Jordanski ◽  
Gillian L. Hsieh ◽  
Ana Damljanovic ◽  
...  

Short AbstractThe extent to which gene fusions function as drivers of cancer remains a critical open question. Current algorithms do not sufficiently identify false-positive fusions arising during library preparation, sequencing, and alignment. Here, we introduce a new algorithm, DEEPEST, that uses statistical modeling to minimize false-positives while increasing the sensitivity of fusion detection. In 9,946 tumor RNA-sequencing datasets from The Cancer Genome Atlas (TCGA) across 33 tumor types, DEEPEST identifies 31,007 fusions, 30% more than identified by other methods, while calling ten-fold fewer false-positive fusions in non-transformed human tissues. We leverage the increased precision of DEEPEST to discover new cancer biology. For example, 888 new candidate oncogenes are identified based on over-representation in DEEPEST-Fusion calls, and 1,078 previously unreported fusions involving long intergenic noncoding RNAs partners, demonstrating a previously unappreciated prevalence and potential for function. Specific protein domains are enriched in DEEPEST calls, demonstrating a global selection for fusion functionality: kinase domains are nearly 2-fold more enriched in DEEPEST calls than expected by chance, as are domains involved in (anaerobic) metabolism and DNA binding. DEEPEST also reveals a high enrichment for fusions involving known and novel oncogenes in diseases including ovarian cancer, which has had minimal treatment advances in recent decades, finding that more than 50% of tumors harbor gene fusions predicted to be oncogenic. The statistical algorithms, population-level analytic framework, and the biological conclusions of DEEPEST call for increased attention to gene fusions as drivers of cancer and for future research into using fusions for targeted therapy.SignificanceGene fusions are tumor-specific genomic aberrations and are among the most powerful biomarkers and drug targets in translational cancer biology. The advent of RNA-Seq technologies over the past decade has provided a unique opportunity for detecting novel fusions via deploying computational algorithms on public sequencing databases. Yet, precise fusion detection algorithms are still out of reach. We develop DEEPEST, a highly specific and efficient statistical pipeline specially designed for mining massive sequencing databases, and apply it to all 33 tumor types and 10,500 samples in The Cancer Genome Atlas database. We systematically profile the landscape of detected fusions via employing classic statistical models and identify several signatures of selection for fusions in tumors.Software availabilityDEEPEST-Fusion workflow with a detailed readme file is available as a Github repository:https://github.com/salzmanlab/DEEPEST-Fusion. In addition to the main workflow, which is based on CWL, example input and batch scripts (for job submission on local clusters), and codes for building the SBT files and SBT querying are provided in the repository. All custom scripts used for systematic analysis of fusions are also available in the same repository.


2017 ◽  
pp. 1-12
Author(s):  
Manish R. Sharma ◽  
James T. Auman ◽  
Nirali M. Patel ◽  
Juneko E. Grilley-Olson ◽  
Xiaobei Zhao ◽  
...  

Purpose A 73-year-old woman with metastatic colon cancer experienced a complete response to chemotherapy with dose-intensified irinotecan that has been durable for 5 years. We sequenced her tumor and germ line DNA and looked for similar patterns in publicly available genomic data from patients with colorectal cancer. Patients and Methods Tumor DNA was obtained from a biopsy before therapy, and germ line DNA was obtained from blood. Tumor and germline DNA were sequenced using a commercial panel with approximately 250 genes. Whole-genome amplification and exome sequencing were performed for POLE and POLD1. A POLD1 mutation was confirmed by Sanger sequencing. The somatic mutation and clinical annotation data files from the colon (n = 461) and rectal (n = 171) adenocarcinoma data sets were downloaded from The Cancer Genome Atlas data portal and analyzed for patterns of mutations and clinical outcomes in patients with POLE- and/or POLD1-mutated tumors. Results The pattern of alterations included APC biallelic inactivation and microsatellite instability high (MSI-H) phenotype, with somatic inactivation of MLH1 and hypermutation (estimated mutation rate > 200 per megabase). The extremely high mutation rate led us to investigate additional mechanisms for hypermutation, including loss of function of POLE. POLE was unaltered, but a related gene not typically associated with somatic mutation in colon cancer, POLD1, had a somatic mutation c.2171G>A [p.Gly724Glu]. Additionally, we noted that the high mutation rate was largely composed of dinucleotide deletions. A similar pattern of hypermutation (dinucleotide deletions, POLD1 mutations, MSI-H) was found in tumors from The Cancer Genome Atlas. Conclusion POLD1 mutation with associated MSI-H and hyper-indel–hypermutated cancer genome characterizes a previously unrecognized variant of colon cancer that was found in this patient with an exceptional response to chemotherapy.


2018 ◽  
Vol Volume 11 ◽  
pp. 1-11 ◽  
Author(s):  
Chundi Gao ◽  
Huayao Li ◽  
Jing Zhuang ◽  
HongXiu Zhang ◽  
Kejia Wang ◽  
...  

2018 ◽  
Vol 17 (2) ◽  
pp. 476-487 ◽  
Author(s):  
Fengju Chen ◽  
Yiqun Zhang ◽  
Sooryanarayana Varambally ◽  
Chad J. Creighton

Sign in / Sign up

Export Citation Format

Share Document