Inferring perturbation profiles of cancer samples

Mapping Intimacies ◽

10.1101/2020.12.10.419077 ◽

2020 ◽

Author(s):

Martin Pirkl ◽

Niko Beerenwinkel

Keyword(s):

Indirect Evidence ◽

R Package ◽

The Cancer Genome Atlas ◽

Patient Specific ◽

Driver Genes ◽

Cancer Driver ◽

Molecular Alterations ◽

Incomplete Coverage ◽

Cancer Genome Atlas ◽

Gene Perturbations

AbstractMotivationCancer is one of the most prevalent diseases in the world. Tumors arise due to important genes changing their activity, e.g., when inhibited or over-expressed. But these gene perturbations are difficult to observe directly. Molecular profiles of tumors can provide indirect evidence of gene perturbations. However, inferring perturbation profiles from molecular alterations is challenging due to error-prone molecular measurements and incomplete coverage of all possible molecular causes of gene perturbations.ResultsWe have developed a novel mathematical method to analyze cancer driver genes and their patient-specific perturbation profiles. We combine genetic aberrations with gene expression data in a causal network derived across patients to infer unobserved perturbations. We show that our method can predict perturbations in simulations, CRISPR perturbation screens, and breast cancer samples from The Cancer Genome Atlas.AvailabilityThe method is available as the R-package nempi at https://github.com/cbg-ethz/[email protected], [email protected]

Download Full-text

Inferring perturbation profiles of cancer samples

Bioinformatics ◽

10.1093/bioinformatics/btab113 ◽

2021 ◽

Author(s):

Martin Pirkl ◽

Niko Beerenwinkel

Keyword(s):

Indirect Evidence ◽

R Package ◽

The Cancer Genome Atlas ◽

Supplementary Information ◽

Patient Specific ◽

Driver Genes ◽

Cancer Driver ◽

Molecular Alterations ◽

Incomplete Coverage ◽

Gene Perturbations

Abstract Motivation Cancer is one of the most prevalent diseases in the world. Tumors arise due to important genes changing their activity, e.g. when inhibited or over-expressed. But these gene perturbations are difficult to observe directly. Molecular profiles of tumors can provide indirect evidence of gene perturbations. However, inferring perturbation profiles from molecular alterations is challenging due to error-prone molecular measurements and incomplete coverage of all possible molecular causes of gene perturbations. Results We have developed a novel mathematical method to analyze cancer driver genes and their patient-specific perturbation profiles. We combine genetic aberrations with gene expression data in a causal network derived across patients to infer unobserved perturbations. We show that our method can predict perturbations in simulations, CRISPR perturbation screens and breast cancer samples from The Cancer Genome Atlas. Availability and implementation The method is available as the R-package nempi at https://github.com/cbg-ethz/nempi and http://bioconductor.org/packages/nempi. Supplementary information Supplementary data are available at Bioinformatics online.

Download Full-text

An integrative analysis of the age-associated multi-omic landscape across cancers

Nature Communications ◽

10.1038/s41467-021-22560-y ◽

2021 ◽

Vol 12 (1) ◽

Author(s):

Kasit Chatsirisupachai ◽

Tom Lesluyes ◽

Luminita Paraoan ◽

Peter Van Loo ◽

João Pedro de Magalhães

Keyword(s):

The Cancer Genome Atlas ◽

Driver Genes ◽

Cancer Driver ◽

Molecular Alterations ◽

Age Related ◽

Incidence And Mortality ◽

Cancer Genome Atlas ◽

Cancer Types ◽

Using Data ◽

Transcriptomic Changes

AbstractAge is the most important risk factor for cancer, as cancer incidence and mortality increase with age. However, how molecular alterations in tumours differ among patients of different age remains largely unexplored. Here, using data from The Cancer Genome Atlas, we comprehensively characterise genomic, transcriptomic and epigenetic alterations in relation to patients’ age across cancer types. We show that tumours from older patients present an overall increase in genomic instability, somatic copy-number alterations (SCNAs) and somatic mutations. Age-associated SCNAs and mutations are identified in several cancer-driver genes across different cancer types. The largest age-related genomic differences are found in gliomas and endometrial cancer. We identify age-related global transcriptomic changes and demonstrate that these genes are in part regulated by age-associated DNA methylation changes. This study provides a comprehensive, multi-omics view of age-associated alterations in cancer and underscores age as an important factor to consider in cancer research and clinical practice.

Download Full-text

An Integrative Analysis of the Age-Associated Genomic, Transcriptomic and Epigenetic Landscape across Cancers

10.1101/2020.08.25.266403 ◽

2020 ◽

Author(s):

Kasit Chatsirisupachai ◽

Tom Lesluyes ◽

Luminita Paraoan ◽

Peter Van Loo ◽

João Pedro de Magalhães

Keyword(s):

The Cancer Genome Atlas ◽

Driver Genes ◽

Cancer Driver ◽

Molecular Alterations ◽

Age Related ◽

Incidence And Mortality ◽

Cancer Genome Atlas ◽

Cancer Types ◽

Using Data ◽

Transcriptomic Changes

AbstractAge is the most important risk factor for cancer, as cancer incidence and mortality increase with age. However, how molecular alterations in tumours differ among patients of different age remains largely unexplored. Here, using data from The Cancer Genome Atlas, we comprehensively characterised genomic, transcriptomic and epigenetic alterations in relation to patients’ age across cancer types. We showed that tumours from older patients present an overall increase in genomic instability, somatic copy-number alterations (SCNAs) and somatic mutations. Age-associated SCNAs and mutations were identified in several cancer-driver genes across different cancer types. The largest age-related genomic differences were found in gliomas and endometrial cancer. We identified age-related global transcriptomic changes and demonstrated that these genes are controlled by age-associated DNA methylation changes. This study provides a comprehensive view of age-associated alterations in cancer and underscores age as an important factor to consider in cancer research and clinical practice.

Download Full-text

Identification of Potential Driver Genes Based on Multi-Genomic Data in Cervical Cancer

Frontiers in Genetics ◽

10.3389/fgene.2021.598304 ◽

2021 ◽

Vol 12 ◽

Author(s):

Yuexun Xu ◽

Hui Luo ◽

Qunchao Hu ◽

Haiyan Zhu

Keyword(s):

Cervical Cancer ◽

Molecular Classification ◽

The Cancer Genome Atlas ◽

Consensus Clustering ◽

Driver Genes ◽

Molecular Alterations ◽

Wide Range ◽

Number Variation ◽

Cancer Genome Atlas ◽

Significant Expression

Background: Cervical cancer became the third most common cancer among women, and genome characterization of cervical cancer patients has revealed the extensive complexity of molecular alterations. However, identifying driver mutation and depicting molecular classification in cervical cancer remain a challenge.Methods: We performed an integrative multi-platform analysis of a cervical cancer cohort from The Cancer Genome Atlas (TCGA) based on 284 clinical cases and identified the driver genes and possible molecular classification of cervical cancer.Results: Multi-platform integration showed that cervical cancer exhibited a wide range of mutation. The top 10 mutated genes were TTN, PIK3CA, MUC4, KMT2C, MUC16, KMT2D, SYNE1, FLG, DST, and EP300, with a mutation rate from 12 to 33%. Applying GISTIC to detect copy number variation (CNV), the most frequent chromosome arm-level CNVs included losses in 4p, 11p, and 11q and gains in 20q, 3q, and 1q. Then, we performed unsupervised consensus clustering of tumor CNV profiles and methylation profiles and detected four statistically significant expression subtypes. Finally, by combining the multidimensional datasets, we identified 10 potential driver genes, including GPR107, CHRNA5, ZBTB20, Rb1, NCAPH2, SCA1, SLC25A5, RBPMS, DDX3X, and H2BFM.Conclusions: This comprehensive analysis described the genetic characteristic of cervical cancer and identified novel driver genes in cervical cancer. These results provide insight into developing precision treatment in cervical cancer.

Download Full-text

A Pan-cancer catalogue of driver protein interaction interfaces

10.1101/015883 ◽

2015 ◽

Cited By ~ 1

Author(s):

Eduard Porta-Pardo ◽

Thomas Hrabe ◽

Adam Godzik

Keyword(s):

Protein Interaction ◽

Specific Protein ◽

The Cancer Genome Atlas ◽

Driver Genes ◽

Cancer Driver ◽

Protein Protein Interaction ◽

Cancer Mutations ◽

Cancer Genome Atlas ◽

Mutation Pattern ◽

Pan Cancer

Despite their critical importance in maintaining the integrity of all cellular pathways, the specific role of mutations on protein-protein interaction (PPI) interfaces as cancer drivers, though known for some specific examples, has not been systematically studied. We analyzed missense somatic mutations in a pan-cancer cohort of 5,989 tumors from 23 projects of The Cancer Genome Atlas (TCGA) for enrichment on PPI interfaces using e-Driver, an algorithm to analyze the mutation pattern of specific protein regions such as PPI interfaces. We identified 128 PPI interfaces enriched in somatic cancer mutations. Our results support the notion that many mutations in well-established cancer driver genes, particularly those in critical network positions, act by altering PPI interfaces. Finally, focusing on individual interfaces we are also able to show how tumors driven by the same gene can have different behaviors, including patient outcomes, depending on whether specific interfaces are mutated or not.

Download Full-text

Utilizing patient information to identify subtype heterogeneity of cancer driver genes

Statistical Methods in Medical Research ◽

10.1177/09622802211055854 ◽

2021 ◽

pp. 096228022110558

Author(s):

Ho-Hsiang Wu ◽

Xing Hua ◽

Jianxin Shi ◽

Nilanjan Chatterjee ◽

Bin Zhu

Keyword(s):

Type I Error ◽

Smoking Status ◽

The Cancer Genome Atlas ◽

Type I ◽

Driver Genes ◽

Cancer Subtypes ◽

Cancer Driver ◽

The Status ◽

Cancer Genome Atlas ◽

Cancer Driver Genes

Identifying cancer driver genes is essential for understanding the mechanisms of carcinogenesis and designing therapeutic strategies. Although driver genes have been identified for many cancer types, it is still not clear whether the selection pressure of driver genes is homogeneous across cancer subtypes. We propose a statistical framework MutScot to improve the identification of driver genes and to investigate the heterogeneity of driver genes across cancer subtypes. Through simulation studies, we show that MutScot properly controls the type I error in detecting driver genes. In addition, we demonstrate that MutScot can identify subtype heterogeneity of driver genes. Applications to three studies in The Cancer Genome Atlas (TCGA) project showcase that MutScot has a desirable sensitivity for detecting driver genes and that MutScot identifies subtype heterogeneity of driver genes in breast cancer and lung cancer with regards to the status of hormone receptor and to the smoking status, respectively.

Download Full-text

Ranking cancer drivers via betweenness-based outlier detection and random walks

BMC Bioinformatics ◽

10.1186/s12859-021-03989-w ◽

2021 ◽

Vol 22 (1) ◽

Author(s):

Cesim Erten ◽

Aissa Houdjedj ◽

Hilal Kazan

Keyword(s):

Cancer Genomics ◽

Interaction Network ◽

Molecular Data ◽

Alternative Methods ◽

Patient Specific ◽

Cancer Genes ◽

Driver Genes ◽

Cancer Driver ◽

Protein Protein Interaction ◽

Genomic Studies

Abstract Background Recent cancer genomic studies have generated detailed molecular data on a large number of cancer patients. A key remaining problem in cancer genomics is the identification of driver genes. Results We propose BetweenNet, a computational approach that integrates genomic data with a protein-protein interaction network to identify cancer driver genes. BetweenNet utilizes a measure based on betweenness centrality on patient specific networks to identify the so-called outlier genes that correspond to dysregulated genes for each patient. Setting up the relationship between the mutated genes and the outliers through a bipartite graph, it employs a random-walk process on the graph, which provides the final prioritization of the mutated genes. We compare BetweenNet against state-of-the art cancer gene prioritization methods on lung, breast, and pan-cancer datasets. Conclusions Our evaluations show that BetweenNet is better at recovering known cancer genes based on multiple reference databases. Additionally, we show that the GO terms and the reference pathways enriched in BetweenNet ranked genes and those that are enriched in known cancer genes overlap significantly when compared to the overlaps achieved by the rankings of the alternative methods.

Download Full-text

An R Package for Divergence Analysis of Omics Data

10.1101/720391 ◽

2019 ◽

Author(s):

Wikum Dinalankara ◽

Qian Ke ◽

Donald Geman ◽

Luigi Marchionni

Keyword(s):

High Throughput Sequencing ◽

R Package ◽

The Cancer Genome Atlas ◽

High Dimensional ◽

Omics Data ◽

Sequencing Data ◽

High Throughput Sequencing Data ◽

Ternary Code ◽

Cancer Genome Atlas ◽

Level Analysis

AbstractGiven the ever-increasing amount of high-dimensional and complex omics data becoming available, it is increasingly important to discover simple but effective methods of analysis. Divergence analysis transforms each entry of a high-dimensional omics profile into a digitized (binary or ternary) code based on the deviation of the entry from a given baseline population. This is a novel framework that is significantly different from existing omics data analysis methods: it allows digitization of continuous omics data at the univariate or multivariate level, facilitates sample level analysis, and is applicable on many different omics platforms. The divergence package, available on the R platform through the Bioconductor repository collection, provides easy-to-use functions for carrying out this transformation. Here we demonstrate how to use the package with sample high throughput sequencing data from the Cancer Genome Atlas.

Download Full-text

Novel cancer subtyping method based on patient-specific gene regulatory network

10.1101/2021.03.24.436731 ◽

2021 ◽

Author(s):

Mai Adachi Nakazawa ◽

Yoshinori Tamada ◽

Yoshihisa Tanaka ◽

Marie Ikeguchi ◽

Kako Higashihara ◽

...

Keyword(s):

Gene Networks ◽

Regulatory Networks ◽

The Cancer Genome Atlas ◽

Patient Specific ◽

Specific Gene ◽

Omics Data ◽

Cancer Subtypes ◽

Molecular Systems ◽

Molecular Features ◽

Cancer Genome Atlas

The identification of cancer subtypes is important for the understanding of tumor heterogeneity. In recent years, numerous computational methods have been proposed for this problem based on the multi-omics data of patients. It is widely accepted that different cancer subtypes are induced by different molecular regulatory networks. However, only a few incorporate the differences between their molecular systems into the classification processes. In this study, we present a novel method to classify cancer subtypes based on patient-specific molecular systems. Our method quantifies patient-specific gene networks, which are estimated from their transcriptome data. By clustering their quantified networks, our method allows for cancer subtyping, taking into consideration the differences in the molecular systems of patients. Comprehensive analyses of The Cancer Genome Atlas (TCGA) datasets applied to our method confirmed that they were able to identify more clinically meaningful cancer subtypes than the existing subtypes and found that the identified subtypes comprised different molecular features. Our findings show that the proposed method, based on a simple classification using the patient-specific molecular systems, can identify cancer subtypes even with single omics data, which cannot otherwise be captured by existing methods using multi-omics data.

Download Full-text

An R package for divergence analysis of omics data

PLoS ONE ◽

10.1371/journal.pone.0249002 ◽

2021 ◽

Vol 16 (4) ◽

pp. e0249002

Author(s):

Wikum Dinalankara ◽

Qian Ke ◽

Donald Geman ◽

Luigi Marchionni

Keyword(s):

R Package ◽

The Cancer Genome Atlas ◽

High Dimensional ◽

Omics Data ◽

Ternary Code ◽

Cancer Genome Atlas ◽

Level Analysis ◽

Data Analysis Methods ◽

Genome Atlas ◽

Omics Data Analysis

Given the ever-increasing amount of high-dimensional and complex omics data becoming available, it is increasingly important to discover simple but effective methods of analysis. Divergence analysis transforms each entry of a high-dimensional omics profile into a digitized (binary or ternary) code based on the deviation of the entry from a given baseline population. This is a novel framework that is significantly different from existing omics data analysis methods: it allows digitization of continuous omics data at the univariate or multivariate level, facilitates sample level analysis, and is applicable on many different omics platforms. The divergence package, available on the R platform through the Bioconductor repository collection, provides easy-to-use functions for carrying out this transformation. Here we demonstrate how to use the package with data from the Cancer Genome Atlas.

Download Full-text