IDENTIFY CANCER DRIVER GENES THROUGH SHARED MENDELIAN DISEASE PATHOGENIC VARIANTS AND CANCER SOMATIC MUTATIONS

Author(s):  
MENG MA ◽  
CHANGCHANG WANG ◽  
BENJAMIN S. GLICKSBERG ◽  
ERIC E. SCHADT ◽  
SHUYU D. LI ◽  
...  
2016 ◽  
Vol 113 (50) ◽  
pp. 14330-14335 ◽  
Author(s):  
Collin J. Tokheim ◽  
Nickolas Papadopoulos ◽  
Kenneth W. Kinzler ◽  
Bert Vogelstein ◽  
Rachel Karchin

Sequencing has identified millions of somatic mutations in human cancers, but distinguishing cancer driver genes remains a major challenge. Numerous methods have been developed to identify driver genes, but evaluation of the performance of these methods is hindered by the lack of a gold standard, that is, bona fide driver gene mutations. Here, we establish an evaluation framework that can be applied to driver gene prediction methods. We used this framework to compare the performance of eight such methods. One of these methods, described here, incorporated a machine-learning–based ratiometric approach. We show that the driver genes predicted by each of the eight methods vary widely. Moreover, the P values reported by several of the methods were inconsistent with the uniform values expected, thus calling into question the assumptions that were used to generate them. Finally, we evaluated the potential effects of unexplained variability in mutation rates on false-positive driver gene predictions. Our analysis points to the strengths and weaknesses of each of the currently available methods and offers guidance for improving them in the future.


2021 ◽  
Vol 22 (1) ◽  
Author(s):  
Ege Ülgen ◽  
O. Uğur Sezerman

Abstract Background Cancer develops due to “driver” alterations. Numerous approaches exist for predicting cancer drivers from cohort-scale genomics data. However, methods for personalized analysis of driver genes are underdeveloped. In this study, we developed a novel personalized/batch analysis approach for driver gene prioritization utilizing somatic genomics data, called driveR. Results Combining genomics information and prior biological knowledge, driveR accurately prioritizes cancer driver genes via a multi-task learning model. Testing on 28 different datasets, this study demonstrates that driveR performs adequately, achieving a median AUC of 0.684 (range 0.651–0.861) on the 28 batch analysis test datasets, and a median AUC of 0.773 (range 0–1) on the 5157 personalized analysis test samples. Moreover, it outperforms existing approaches, achieving a significantly higher median AUC than all of MutSigCV (Wilcoxon rank-sum test p < 0.001), DriverNet (p < 0.001), OncodriveFML (p < 0.001) and MutPanning (p < 0.001) on batch analysis test datasets, and a significantly higher median AUC than DawnRank (p < 0.001) and PRODIGY (p < 0.001) on personalized analysis datasets. Conclusions This study demonstrates that the proposed method is an accurate and easy-to-utilize approach for prioritizing driver genes in cancer genomes in personalized or batch analyses. driveR is available on CRAN: https://cran.r-project.org/package=driveR.


EBioMedicine ◽  
2018 ◽  
Vol 27 ◽  
pp. 156-166 ◽  
Author(s):  
Magali Champion ◽  
Kevin Brennan ◽  
Tom Croonenborghs ◽  
Andrew J. Gentles ◽  
Nathalie Pochet ◽  
...  

2021 ◽  
Author(s):  
Samah El Ghamrasni ◽  
Rene Quevedo ◽  
James R Hawley ◽  
Parisa Mazrooei ◽  
Youstina Hanna ◽  
...  

Whole-genome sequencing of primary breast tumors enabled the identification of cancer driver genes and non-coding cancer driver plexuses from somatic mutations. However, differentiating driver and passenger events among non-coding genetic variants remains a challenge to understand the etiology of cancer and inform the delivery of personalized cancer medicine. Herein, we reveal an enrichment of non-coding mutations in cis-regulatory elements that cover a subset of transcription factors linked to tumor progression in luminal breast cancers. Using a cohort of 26 primary luminal ER+PR+ breast tumors, we compiled a catalogue of ~100,000 unique cis-regulatory elements from ATACseq data. Integrating this catalogue with somatic mutations from 350 publicly available breast tumor whole genomes, we identified four recurrently mutated individual cis-regulatory elements. By then partitioning the non-coding genome into cistromes, defined as the sum of binding sites for a transcription factor, we uncovered cancer driver cistromes for ten transcription factors in luminal breast cancer, namely CTCF, ELF1, ESR1, FOSL2, FOXA1, FOXM1 GATA3, JUND, TFAP2A, and TFAP2C in luminal breast cancer. Nine of these ten transcription factors were shown to be essential for growth in breast cancer, with four exclusive to the luminal subtype. Collectively, we present a strategy to find cancer driver cistromes relying on quantifying the enrichment of non-coding mutations over cis-regulatory elements concatenated into a functional unit drawn from an accessible chromatin catalogue derived from primary cancer tissues.


2013 ◽  
Vol 3 (1) ◽  
Author(s):  
David Tamborero ◽  
Abel Gonzalez-Perez ◽  
Christian Perez-Llamas ◽  
Jordi Deu-Pons ◽  
Cyriac Kandoth ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document