scholarly journals Uncovering Potential Therapeutic Targets in Colorectal Cancer by Deciphering Mutational Status and Expression of Druggable Oncogenes

Cancers ◽  
2019 ◽  
Vol 11 (7) ◽  
pp. 983 ◽  
Author(s):  
Otília Menyhart ◽  
Tatsuhiko Kakisaka ◽  
Lőrinc Sándor Pongor ◽  
Hiroyuki Uetake ◽  
Ajay Goel ◽  
...  

Background: Numerous driver mutations have been identified in colorectal cancer (CRC), but their relevance to the development of targeted therapies remains elusive. The secondary effects of pathogenic driver mutations on downstream signaling pathways offer a potential approach for the identification of therapeutic targets. We aimed to identify differentially expressed genes as potential drug targets linked to driver mutations. Methods: Somatic mutations and the gene expression data of 582 CRC patients were utilized, incorporating the mutational status of 39,916 and the expression levels of 20,500 genes. To uncover candidate targets, the expression levels of various genes in wild-type and mutant cases for the most frequent disruptive mutations were compared with a Mann–Whitney test. A survival analysis was performed in 2100 patients with transcriptomic gene expression data. Up-regulated genes associated with worse survival were filtered for potentially actionable targets. The most significant hits were validated in an independent set of 171 CRC patients. Results: Altogether, 426 disruptive mutation-associated upregulated genes were identified. Among these, 95 were linked to worse recurrence-free survival (RFS). Based on the druggability filter, 37 potentially actionable targets were revealed. We selected seven genes and validated their expression in 171 patient specimens. The best independently validated combinations were DUSP4 (p = 2.6 × 10−12) in ACVR2A mutated (7.7%) patients; BMP4 (p = 1.6 × 10−04) in SOX9 mutated (8.1%) patients; TRIB2 (p = 1.35 × 10−14) in ACVR2A mutated patients; VSIG4 (p = 2.6 × 10−05) in ANK3 mutated (7.6%) patients, and DUSP4 (p = 7.1 × 10−04) in AMER1 mutated (8.2%) patients. Conclusions: The results uncovered potentially druggable genes in colorectal cancer. The identified mutations could enable future patient stratification for targeted therapy.

2010 ◽  
Vol 9 (1) ◽  
pp. 100 ◽  
Author(s):  
Marianne Berg ◽  
Trude H Agesen ◽  
Espen Thiis-Evensen ◽  
INFAC-study group [infac] ◽  
Marianne A Merok ◽  
...  

2021 ◽  
Author(s):  
Huan-Huan Wei ◽  
Hui Lu ◽  
Hongyu Zhao

AbstractMany computational methods have been developed for inferring causality among genes using cross-sectional gene expression data, such as single-cell RNA sequencing (scRNA-seq) data. However, due to the limitations of scRNA-seq technologies, time-lagged causal relationships may be missed by existing methods. In this work, we propose a method, called causal inference with time-lagged information (CITL), to infer time-lagged causal relationships from scRNA-seq data by assessing conditional independence between the changing and current expression levels of genes. CITL estimates the changing expression levels of genes by “RNA velocity”. We demonstrate the accuracy and stability of CITL for inferring time-lagged causality on simulation data against other leading approaches. We have applied CITL to real scRNA data and inferred 878 pairs of time-lagged causal relationships, with many of these inferred results supported by the literature.Author summaryComputational causal inference is a promising way to survey causal relationships between genes efficiently. Though many causal inference methods have been applied to gene expression data, none considers the time-lagged causal relationship, which means that some genes may take some time to affect their target genes with several reactions. If relationships between genes are time-lagged, the existing methods’ assumptions will be violated. The relationships will be challenging to recognize. We demonstrate that this is indeed the case through simulation. Therefore, we develop a method for inferring time-lagged causal relationships of single-cell gene expression data. We assume that a time-lagged causal relationship should present a strong association between the cause and the effect’s changing. To calculate such correlation, we first estimate the derivative of gene expression using the information from unspliced transcripts. Then, we use conditional independent tests to search gene pairs satisfying our assumption. Our results suggest that we could accurately infer time-lagged causal gene pairs validated by published literature. This method may complement gene regulatory analysis and provide candidate gene pairs for further controlled experiments.


BMC Genomics ◽  
2019 ◽  
Vol 20 (1) ◽  
Author(s):  
Yuanyuan Li ◽  
David M. Umbach ◽  
Adrienna Bingham ◽  
Qi-Jing Li ◽  
Yuan Zhuang ◽  
...  

Abstract Background Tumor purity is the percent of cancer cells present in a sample of tumor tissue. The non-cancerous cells (immune cells, fibroblasts, etc.) have an important role in tumor biology. The ability to determine tumor purity is important to understand the roles of cancerous and non-cancerous cells in a tumor. Methods We applied a supervised machine learning method, XGBoost, to data from 33 TCGA tumor types to predict tumor purity using RNA-seq gene expression data. Results Across the 33 tumor types, the median correlation between observed and predicted tumor-purity ranged from 0.75 to 0.87 with small root mean square errors, suggesting that tumor purity can be accurately predicted υσινγ expression data. We further confirmed that expression levels of a ten-gene set (CSF2RB, RHOH, C1S, CCDC69, CCL22, CYTIP, POU2AF1, FGR, CCL21, and IL7R) were predictive of tumor purity regardless of tumor type. We tested whether our set of ten genes could accurately predict tumor purity of a TCGA-independent data set. We showed that expression levels from our set of ten genes were highly correlated (ρ = 0.88) with the actual observed tumor purity. Conclusions Our analyses suggested that the ten-gene set may serve as a biomarker for tumor purity prediction using gene expression data.


2019 ◽  
Vol 15 (2) ◽  
pp. e1006826 ◽  
Author(s):  
David G. P. van IJzendoorn ◽  
Karoly Szuhai ◽  
Inge H. Briaire-de Bruijn ◽  
Marie Kostine ◽  
Marieke L. Kuijjer ◽  
...  

Blood ◽  
2013 ◽  
Vol 122 (21) ◽  
pp. 2779-2779 ◽  
Author(s):  
Andrea Pellagatti ◽  
Moritz Gerstung ◽  
Elli Papaemmanuil ◽  
Luca Malcovati ◽  
Aristoteles Giagounidis ◽  
...  

Abstract A particular profile of gene expression can reflect an underlying molecular abnormality in malignancy. Distinct gene expression profiles and deregulated gene pathways can be driven by specific gene mutations and may shed light on the biology of the disease and lead to the identification of new therapeutic targets. We selected 143 cases from our large-scale gene expression profiling (GEP) dataset on bone marrow CD34+ cells from patients with myelodysplastic syndromes (MDS), for which matching genotyping data were obtained using next-generation sequencing of a comprehensive list of 111 genes involved in myeloid malignancies (including the spliceosomal genes SF3B1, SRSF2, U2AF1 and ZRSR2, as well as TET2, ASXL1and many other). The GEP data were then correlated with the mutational status to identify significantly differentially expressed genes associated with each of the most common gene mutations found in MDS. The expression levels of the mutated genes analyzed were generally lower in patients carrying a mutation than in patients wild-type for that gene (e.g. SF3B1, ASXL1 and TP53), with the exception of RUNX1 for which patients carrying a mutation showed higher expression levels than patients without mutation. Principal components analysis showed that the main directions of gene expression changes (principal components) tend to coincide with some of the common gene mutations, including SF3B1, SRSF2 and TP53. SF3B1 and STAG2 were the mutated genes showing the highest number of associated significantly differentially expressed genes, including ABCB7 as differentially expressed in association with SF3B1 mutation and SULT2A1 in association with STAG2 mutation. We found distinct differentially expressed genes associated with the four most common splicing gene mutations (SF3B1, SRSF2, U2AF1 and ZRSR2) in MDS, suggesting that different phenotypes associated with these mutations may be driven by different effects on gene expression and that the target gene may be different. We have also evaluated the prognostic impact of the GEP data in comparison with that of the genotype data and importantly we have found a larger contribution of gene expression data in predicting progression free survival compared to mutation-based multivariate survival models. In summary, this analysis correlating gene expression data with genotype data has revealed that the mutational status shapes the gene expression landscape. We have identified deregulated genes associated with the most common gene mutations in MDS and found that the prognostic power of gene expression data is greater than the prognostic power provided by mutation data. AP and MG contributed equally to this work. JB and PJC are co-senior authors. Disclosures: No relevant conflicts of interest to declare.


2013 ◽  
Vol 31 (4_suppl) ◽  
pp. 383-383
Author(s):  
Martin K. H. Maus ◽  
Craig Stephens ◽  
Stephanie H. Astrow ◽  
Peter Philipp Grimminger ◽  
Dongyun Yang ◽  
...  

383 Background: Gene expression levels of ERCC1, TS, EGFR and VEGFR2 may have predictive value for the personalized use of standard chemotherapeutics as well as agents targeting the EGFR and VEGF pathways and the efficacy of EGFR directed monoclonal antibodies like panitumumab and cetuximab has been confirmed to be dependent on wt KRAS and wt BRAF in patients with advanced colorectal cancer. We investigated the correlations between KRAS/BRAF mutational status and the mRNA expression levels of these genes. Methods: Formalin-fixed paraffin-embedded tumor specimens from 600 patients with advanced colorectal adenocarcinoma were microdissected and DNA and RNA was extracted. Specifically designed primers and probes were used to detect 7 different base substitutions in codon 12 and 13 of KRAS, V600E mutations in BRAF and the expression levels of ERCC1, TS, EGFR and VEGFR2 by RT-PCR. Results: Mt KRAS tumors had significantly lower TS and EGFR gene expression levels compared with wt KRAS (p<0,001), whereas mt BRAF tumors showed significantly increased TS and EGFR mRNA levels compared to wt BRAF (p<0,001). Mt BRAF tumors showed significantly higher mRNA levels than mt KRAS tumors (p<0,001). ERCC1 and VEGFR2 mRNA levels were significantly down-regulated in mt KRAS specimen (p<0,001), but showed no significant correlation with BRAF mutational status. Conclusions: KRAS and BRAF mutations are associated with opposite mRNA expression levels for TS and EGFR. Recently, resistance to BRAF inhibition in mt BRAF colorectal tumors has been shown in preclinical models to be associated with up-regulation of EGFR. Our data suggests that BRAF mutants are associated with high EGFR levels at the time of diagnosis, and not necessarily part of an acquired mechanism of resistance. Significantly lower mRNA expression levels of VEGFR2 in mt KRAS tumors may explain lower response to angiogenesis inhibition seen in the TML study.


2020 ◽  
Vol 38 (15_suppl) ◽  
pp. e13506-e13506
Author(s):  
Li Chen ◽  
Rajesh Patidar ◽  
Biswajit Das ◽  
Yvonne A Evrard ◽  
Chris Alan Karlovich ◽  
...  

e13506 Background: The National Cancer Institute has developed a repository of preclinical models [Patient-Derived Models Repository (NCI PDMR, https://pdmr.cancer.gov )] including patient derived xenografts (PDXs), organoids (PDOrgs) and in vitro tumor cultures (PDCs) from patients with solid tumor cancer histologies. A subset of these preclinical models is derived from post-mortem collections from rapid autopsies representing the end point in disease progression. Clinical annotations and genomic datasets associated with these models provide a unique opportunity to study tumor evolution, mechanistic insights into the metastatic process, and treatment resistance. Methods: To date, 43 PDXs, 21 PDCs, and 23 PDOrgs using rapid autopsy specimens from 8 primary and 35 metastatic sites of 18 patients have been developed by the Biological Testing Branch (DTP, DCTD, NCI Frederick, MD) for the PDMR. Whole exome (WES) and total transcriptome (RNASeq) data were processed to generate mutation, copy number alteration (CNA) and gene expression data. Multi-model lineage trees were reconstructed based on putative somatic variants for all the models derived from all patients. The fraction of the genome affected by CNA was compared both within and across PDX models. Results: Most of the rapid autopsy PDX models (32/43) are derived from pancreatic adenocarcinoma (PAAD) patients (13/18), with metastatic specimens originating from sites including liver, colon, omentum, and lung. Driver mutations are present in all preclinical model specimens derived from the same patient. For instance, KRAS p.G12D is present in all patient-derived model specimens derived from PAAD patient 521955. The fraction of the genome affected by CNA remains stable within a PDX model across passages (n = 24, mean = 6.39%, sd = 5.90%). However, we found that this increased when comparing PDX models derived from metastatic sites versus the primary site (n = 19, mean = 16.92%, sd = 10.46%). This indicates presence of tumor heterogeneity between metastatic and primary sites. The lineage tree for models from patient 521955 indicates that one liver metastasis has a unique seeding event compared to the other 4 metastatic sites. Unsupervised clustering analysis on gene expression data also confirms the observed tumor site relationships. Conclusions: Our data demonstrate the potential use of these preclinical models available from the NCI PDMR. These models provide a unique resource for preclinical studies in tumor evolution, metastatic spread mediators, and drug resistance.


2017 ◽  
Author(s):  
Kathleen M. Chen ◽  
Jie Tan ◽  
Gregory P. Way ◽  
Georgia Doing ◽  
Deborah A. Hogan ◽  
...  

AbstractBackgroundInvestigators often interpret genome-wide data by analyzing the expression levels of genes within pathways. While this within-pathway analysis is routine, the products of any one pathway can affect the activity of other pathways. Past efforts to identify relationships between biological processes have evaluated overlap in knowledge bases or evaluated changes that occur after specific treatments. Individual experiments can highlight condition-specific pathway-pathway relationships; however, constructing a complete network of such relationships across many conditions requires analyzing results from many studies.ResultsWe developed PathCORE-T framework by implementing existing methods to identify pathway-pathway transcriptional relationships evident across a broad data compendium. PathCORE-T is applied to the output of feature construction algorithms; it identifies pairs of pathways observed in features more than expected by chance as functionally co-occurring. We demonstrate PathCORE-T by analyzing an existing eADAGE model of a microbial compendium and building and analyzing NMF features from the TCGA dataset of 33 cancer types. The PathCORE-T framework includes a demonstration web interface, with source code, that users can launch to (1) visualize the network and (2) review the expression levels of associated genes in the original data. PathCORE-T creates and displays the network of globally co-occurring pathways based on features observed in a machine learning analysis of gene expression data.ConclusionsThe PathCORE-T framework identifies transcriptionally co-occurring pathways from the results of unsupervised analysis of gene expression data and visualizes the relationships between pathways as a network. PathCORE-T recapitulated previously described pathway-pathway relationships and suggested experimentally testable additional hypotheses that remain to be explored.


2018 ◽  
Author(s):  
Ferenc Tajti ◽  
Christoph Kuppe ◽  
Asier Antoranz ◽  
Mahmoud M. Ibrahim ◽  
Hyojin Kim ◽  
...  

AbstractTo develop efficient therapies and identify novel early biomarkers for chronic kidney disease an understanding of the molecular mechanisms orchestrating it is essential. We here set out to understand how differences in CKD origin are reflected in gene expression. To this end, we integrated publicly available human glomerular microarray gene expression data for nine kidney disease entities that account for a majority of CKD worldwide. We included data from five distinct studies and compared glomerular gene expression profiles to that of non-tumor parts of kidney cancer nephrectomy tissues. A major challenge was the integration of the data from different sources, platforms and conditions, that we mitigated with a bespoke stringent procedure. This allowed us to perform a global transcriptome-based delineation of different kidney disease entities, obtaining a landscape of their similarities and differences based on the genes that acquire a consistent differential expression between each kidney disease entity and nephrectomy tissue. Furthermore, we derived functional insights by inferring activity of signaling pathways and transcription factors from the collected gene expression data, and identified potential drug candidates based on expression signature matching. We validated representative findings by immunostaining in human kidney biopsies indicating e.g. that the transcription factor FOXM1 is significantly and specifically expressed in parietal epithelial cells in RPGN whereas not expressed in control kidney tissue. These results provide a foundation to comprehend the specific molecular mechanisms underlying different kidney disease entities, that can pave the way to identify biomarkers and potential therapeutic targets. To facilitate this, we provide our results as a free interactive web application: https://saezlab.shinyapps.io/ckd_landscape/.Translational StatementChronic kidney disease is a combination of entities with different etiologies. We integrate and analyse transcriptomics analysis of glomerular from different entities to dissect their different pathophysiology, what might help to identify novel entity-specific therapeutic targets.


2015 ◽  
Author(s):  
Andrew Anand Brown ◽  
Zhihao Ding ◽  
Ana Viñuela ◽  
Dan Glass ◽  
Leopold Parts ◽  
...  

Statistical factor analysis methods have previously been used to remove noise components from high dimensional data prior to genetic association mapping, and in a guided fashion to summarise biologically relevant sources of variation. Here we show how the derived factors summarising pathway expression can be used to analyse the relationships between expression, heritability and ageing. We used skin gene expression data from 647 twins from the MuTHER Consortium and applied factor analysis to concisely summarise patterns of gene expression, both to remove broad confounding influences and to produce concise pathway-level phenotypes. We derived 930 "pathway phenotypes" which summarised patterns of variation across 186 KEGG pathways (five phenotypes per pathway). We identified 69 significant associations of age with phenotype from 57 distinct KEGG pathways at a stringent Bonferroni threshold (P<5.38E-5). These phenotypes are more heritable (h^2=0.32) than gene expression levels. On average, expression levels of 16% of genes within these pathways are associated with age. Several significant pathways relate to metabolising sugars and fatty acids, others with insulin signalling. We have demonstrated that factor analysis methods combined with biological knowledge can produce more reliable phenotypes with less stochastic noise than the individual gene expression levels, which increases our power to discover biologically relevant associations. These phenotypes could also be applied to discover associations with other environmental factors.


Sign in / Sign up

Export Citation Format

Share Document