A CATH domain functional family based approach to identify putative cancer driver genes and driver mutations

Tumour sequencing identifies highly recurrent point mutations in cancer driver genes, but rare functional mutations are hard to distinguish from large numbers of passengers. We developed a novel computational platform applying a multi-modal approach to filter out passengers and more robustly identify putative driver genes. The primary filter identifies enrichment of cancer mutations in CATH functional families (CATH-FunFams) – structurally and functionally coherent sets of evolutionary related domains. Using structural representatives from CATH-FunFams, we subsequently seek enrichment of mutations in 3D and show that these mutation clusters have a very significant tendency to lie close to known functional sites or conserved sites predicted using CATH-FunFams. Our third filter identifies enrichment of putative driver genes in functionally coherent protein network modules confirmed by literature analysis to be cancer associated.Our approach is complementary to other domain enrichment approaches exploiting Pfam families, but benefits from more functionally coherent groupings of domains. Using a set of mutations from 22 cancers we detect 151 putative cancer drivers, of which 79 are not listed in cancer resources and include recently validated cancer genes EPHA7, DCC netrin-1 receptor and zinc-finger protein ZNF479.

Download Full-text

Diversity spectrum analysis identifies mutation-specific effects of cancer driver genes

Communications Biology ◽

10.1038/s42003-019-0736-4 ◽

2020 ◽

Vol 3 (1) ◽

Cited By ~ 2

Author(s):

Xiaobao Dong ◽

Dandan Huang ◽

Xianfu Yi ◽

Shijie Zhang ◽

Zhao Wang ◽

...

Keyword(s):

Clinical Trials ◽

Spectrum Analysis ◽

Driver Mutations ◽

Driver Gene ◽

Cancer Type ◽

Driver Genes ◽

Drug Responses ◽

Cancer Driver ◽

Specific Effects ◽

Cancer Driver Genes

AbstractMutation-specific effects of cancer driver genes influence drug responses and the success of clinical trials. We reasoned that these effects could unbalance the distribution of each mutation across different cancer types, as a result, the cancer preference can be used to distinguish the effects of the causal mutation. Here, we developed a network-based framework to systematically measure cancer diversity for each driver mutation. We found that half of the driver genes harbor cancer type-specific and pancancer mutations simultaneously, suggesting that the pervasive functional heterogeneity of the mutations from even the same driver gene. We further demonstrated that the specificity of the mutations could influence patient drug responses. Moreover, we observed that diversity was generally increased in advanced tumors. Finally, we scanned potentially novel cancer driver genes based on the diversity spectrum. Diversity spectrum analysis provides a new approach to define driver mutations and optimize off-label clinical trials.

Download Full-text

WITER: A powerful method for the estimation of cancer-driver genes using a weighted iterative regression accurately modelling background mutation rate

10.1101/437061 ◽

2018 ◽

Author(s):

Lin Jiang ◽

Jingjing Zheng ◽

Johnny Sheung Him Kwan ◽

Sheng Dai ◽

Cong Li ◽

...

Keyword(s):

Negative Binomial ◽

Negative Binomial Regression ◽

Alternative Methods ◽

Driver Mutations ◽

Driver Genes ◽

Cancer Driver ◽

Binomial Regression ◽

Background Mutation Rate ◽

Technical Advances ◽

Cancer Driver Genes

AbstractGenomic identification of driver mutations and genes in cancer cells are critical for precision medicine. Due to difficulty in modeling distribution of background mutations, existing statistical methods are often underpowered to discriminate driver genes from passenger genes. Here we propose a novel statistical approach, weighted iterative zero-truncated negative-binomial regression (WITER), to detect cancer-driver genes showing an excess of somatic mutations. By solving the problem of inaccurately modeling background mutations, this approach works even in small or moderate samples. Compared to alternative methods, it detected more significant and cancer-consensus genes in all tested cancers. Applying this approach, we estimated 178 driver genes in 26 different cancers types. In silico validation confirmed 90.5% of predicted genes as likely known drivers and 7 genes unique for individual cancers as likely new drivers. The technical advances of WITER enable the detection of driver genes in TCGA datasets as small as 30 subjects, rescuing more genes missed by alternative tools.

Download Full-text

WITER: a powerful method for estimation of cancer-driver genes using a weighted iterative regression modelling background mutation counts

Nucleic Acids Research ◽

10.1093/nar/gkz566 ◽

2019 ◽

Vol 47 (16) ◽

pp. e96-e96 ◽

Cited By ~ 6

Author(s):

Lin Jiang ◽

Jingjing Zheng ◽

Johnny S H Kwan ◽

Sheng Dai ◽

Cong Li ◽

...

Keyword(s):

Negative Binomial ◽

Negative Binomial Regression ◽

Alternative Methods ◽

Small Samples ◽

Driver Mutations ◽

Driver Genes ◽

Cancer Driver ◽

Binomial Regression ◽

Technical Advances ◽

Cancer Driver Genes

Abstract Genomic identification of driver mutations and genes in cancer cells are critical for precision medicine. Due to difficulty in modelling distribution of background mutation counts, existing statistical methods are often underpowered to discriminate cancer-driver genes from passenger genes. Here we propose a novel statistical approach, weighted iterative zero-truncated negative-binomial regression (WITER, http://grass.cgs.hku.hk/limx/witer or KGGSeq,http://grass.cgs.hku.hk/limx/kggseq/), to detect cancer-driver genes showing an excess of somatic mutations. By fitting the distribution of background mutation counts properly, this approach works well even in small or moderate samples. Compared to alternative methods, it detected more significant and cancer-consensus genes in most tested cancers. Applying this approach, we estimated 229 driver genes in 26 different types of cancers. In silico validation confirmed 78% of predicted genes as likely known drivers and many other genes as very likely new drivers for corresponding cancers. The technical advances of WITER enable the detection of driver genes in TCGA datasets as small as 30 subjects and rescue of more genes missed by alternative tools in moderate or small samples.

Download Full-text

iCAGES: integrated CAncer GEnome Score for comprehensively prioritizing cancer driver genes in personal genomes

10.1101/015008 ◽

2015 ◽

Author(s):

Chengliang Dong ◽

Hui Yang ◽

Zeyu He ◽

Xiaoming Liu ◽

Kai Wang

Keyword(s):

Driver Mutations ◽

Patient Specific ◽

Support Vector ◽

Biological Knowledge ◽

Driver Genes ◽

Cancer Driver ◽

Prior Biological Knowledge ◽

Specific Cancer ◽

Cancer Driver Genes ◽

Better Than

All cancers arise as a result of the acquisition of somatic mutations that drive the disease progression. A number of computational tools have been developed to identify driver genes for a specific cancer from a group of cancer samples. However, it remains a challenge to identify driver mutations/genes for an individual patient and design drug therapies. We developed iCAGES, a novel statistical framework to rapidly analyze patient-specific cancer genomic data, prioritize personalized cancer driver events and predict personalized therapies. iCAGES includes three consecutive layers: the first layer integrates contributions from coding, non-coding and structural variations to infer driver variants. For coding mutations, we developed a radial support vector machine using manually curated mutations to predict their driver potential. The second layer identifies driver genes, by using information from the first layer and integrating prior biological knowledge on gene-gene and gene-phenotype networks. The third layer prioritizes personalized drug treatment, by classifying potential driver genes into different categories and querying drug-gene databases. Compared to currently available tools, iCAGES achieves better performance by correctly classifying point coding driver mutations (AUC=0.97, 95% CI: 0.97-0.97, significantly better than the second best tool with P=0.01) and genes (AUC=0.93, 95% CI: 0.93-0.94, significantly better than MutSigCV with P<1X10-15). We also illustrated two examples where iCAGES correctly nominated two targeted drugs for two advanced cancer patients with exceptional response, based on their somatic mutation profiles. iCAGES leverages personal genomic information and prior biological knowledge, effectively identifies cancer driver genes and predicts treatment strategies. iCAGES is available at http://icages.usc.edu.

Download Full-text

Ontology-based prediction of cancer driver genes

10.1101/561480 ◽

2019 ◽

Cited By ~ 1

Author(s):

Sara Althubaiti ◽

Andreas Karwath ◽

Ashraf Dallol ◽

Adeeb Noor ◽

Shadi Salem Alkhayyat ◽

...

Keyword(s):

Nasopharyngeal Cancer ◽

Whole Body ◽

Driver Mutations ◽

Driver Genes ◽

Accurate Identification ◽

Low Frequencies ◽

Cancer Driver ◽

Cancer Driver Genes ◽

Types Of Information ◽

Cellular Phenotypes

AbstractIdentifying and distinguishing cancer driver genes among thousands of candidate mutations remains a major challenge. Accurate identification of driver genes and driver mutations is critical for advancing cancer research and personalizing treatment based on accurate stratification of patients. Due to inter-tumor genetic heterogeneity, many driver mutations within a gene occur at low frequencies, which make it challenging to distinguish them from non-driver mutations. We have developed a novel method for identifying cancer driver genes. Our approach utilizes multiple complementary types of information, specifically cellular phenotypes, cellular locations, functions, and whole body physiological phenotypes as features. We demonstrate that our method can accurately identify known cancer driver genes and distinguish between their role in different types of cancer. In addition to confirming known driver genes, we identify several novel candidate driver genes. We demonstrate the utility of our method by validating its predictions in nasopharyngeal cancer and colorectal cancer using whole exome and whole genome sequencing.

Download Full-text

Specific clonal expansion at disease progression (PD) in solid cancers pinpointed by cell free DNA analysis.

Journal of Clinical Oncology ◽

10.1200/jco.2019.37.15_suppl.e13144 ◽

2019 ◽

Vol 37 (15_suppl) ◽

pp. e13144-e13144

Author(s):

Elisa Frullanti ◽

Maria Palmieri ◽

Margherita Baldassarri ◽

Francesca Fava ◽

Alessandra Fabbiani ◽

...

Keyword(s):

Solid Tumors ◽

Primary Tumor ◽

Dna Analysis ◽

Copy Number Variations ◽

Driver Mutations ◽

Molecular Profile ◽

Sequencing Analysis ◽

Driver Genes ◽

Cancer Driver ◽

Cancer Driver Genes

e13144 Background: More than 50% of solid cancers sooner or later escape control of standard treatments. Detection and analysis of cell free circulating DNA (cfDNA) now offer the possibility to detect key mutations of cancer driver genes which may play a major role in the therapy escaping mechanism. We sought to identify clones of solid tumors escaping standard treatments in order to assess personalized treatment at PD. Methods: A cohort of patients with 10 different solid tumors progressing after standard therapy were selected. CfDNA analysis was performed using PAXgene blood ccfDNA tubes (QIAGEN), MagMAX cell-free total nucleic acid isolation kit, and ION PROTON platform (ThermoFisher Scientific). Results: Next generation sequencing analysis of 52 cancer-driver genes of cfDNA samples of 39 patients allowed for picking up clones plausibly involved in the PD mechanism in 60% of cases. A mean of 1.3 mutated genes (range 1-3) for each tumor was found. Point mutations in TP53, PIK3CA, and CNV in FGFR3 were the most commonly observed, with a rate of 41%, 16%, and 13%, respectively. Increased copy number variations of FGF receptors were identified in patients with non-small cell lung, pancreatic, and gastric cancer, and cholangiocarcinoma. Other clones had mutations in ESR1 (breast), CTNNB1 (uterus), KRAS and CCND2 (pancreas), EGFR and BRAF (lung). Interestingly, retinoblastomas resistant to Melphalan showed expanding mutated clones in PTEN or SMAD4. Increased levels of cfDNA were observed in the plasma of all patients. Conclusions: The results presented here show that irrespective of the primary tumor mutational burden and subsequent complex clonal evolution, a simplified mutational load is present at PD. One or few “sniper” clones drive progression and the molecular profile has a weak correlation with the primary tumor. Single driver mutations in TP53 remain the main target of a not yet developed specific therapy in most tumors such as breast, ovarian, uterine, lung, gastric cancers and glioblastoma. Among the actionable mutations, PIK3CA were found, not only in breast cancers, but also in uterine carcinoma, Sezary syndrome and glioblastoma, pinpointing the needs of specific trials in these tumors.

Download Full-text

Ontology-based prediction of cancer driver genes

Scientific Reports ◽

10.1038/s41598-019-53454-1 ◽

2019 ◽

Vol 9 (1) ◽

Cited By ~ 4

Author(s):

Sara Althubaiti ◽

Andreas Karwath ◽

Ashraf Dallol ◽

Adeeb Noor ◽

Shadi Salem Alkhayyat ◽

...

Keyword(s):

Nasopharyngeal Cancer ◽

Whole Body ◽

Driver Mutations ◽

Driver Genes ◽

Accurate Identification ◽

Low Frequencies ◽

Cancer Driver ◽

Cancer Driver Genes ◽

Types Of Information ◽

Cellular Phenotypes

AbstractIdentifying and distinguishing cancer driver genes among thousands of candidate mutations remains a major challenge. Accurate identification of driver genes and driver mutations is critical for advancing cancer research and personalizing treatment based on accurate stratification of patients. Due to inter-tumor genetic heterogeneity many driver mutations within a gene occur at low frequencies, which make it challenging to distinguish them from non-driver mutations. We have developed a novel method for identifying cancer driver genes. Our approach utilizes multiple complementary types of information, specifically cellular phenotypes, cellular locations, functions, and whole body physiological phenotypes as features. We demonstrate that our method can accurately identify known cancer driver genes and distinguish between their role in different types of cancer. In addition to confirming known driver genes, we identify several novel candidate driver genes. We demonstrate the utility of our method by validating its predictions in nasopharyngeal cancer and colorectal cancer using whole exome and whole genome sequencing.

Download Full-text

Discovery of cancer driver genes based on nucleotide context

10.1101/485292 ◽

2018 ◽

Author(s):

Felix Dietlein ◽

Donate Weghorn ◽

Amaro Taylor-Weiner ◽

André Richters ◽

Brendan Reardon ◽

...

Keyword(s):

Mutation Rates ◽

Driver Mutations ◽

Supporting Evidence ◽

Cancer Genes ◽

Driver Genes ◽

High Background ◽

Cancer Driver ◽

Nucleotide Context ◽

Cancer Driver Genes ◽

Passenger Mutations

Many cancer genomes contain large numbers of somatic mutations, but few of these mutations drive tumor development. Current approaches to identify cancer driver genes are largely based on mutational recurrence, i.e. they search for genes with an increased number of nonsynonymous mutations relative to the local background mutation rate. Multiple studies have noted that the sensitivity of recurrence-based methods is limited in tumors with high background mutation rates, because passenger mutations dilute their statistical power. Here, we observe that passenger mutations tend to occur in characteristic nucleotide sequence contexts, while driver mutations follow a different distribution pattern determined by the location of functionally relevant genomic positions along the protein-coding sequence. To discover new cancer genes, we searched for genes with an excess of mutations in unusual nucleotide contexts that deviate from the characteristic context around passenger mutations. By applying this statistical framework to whole-exome sequencing data from 12,004 tumors, we discovered a long tail of novel candidate cancer genes with mutation frequencies as low as 1% and functional supporting evidence. Our results show that considering both the number and the nucleotide context around mutations helps identify novel cancer driver genes, particularly in tumors with high background mutation rates.

Download Full-text

Faculty Opinions recommendation of Evaluating the evaluation of cancer driver genes.

Faculty Opinions – Post-Publication Peer Review of the Biomedical Literature ◽

10.3410/f.727060594.793535346 ◽

2017 ◽

Author(s):

Ron Shamir

Keyword(s):

Driver Genes ◽

Cancer Driver ◽

Cancer Driver Genes

Download Full-text