scholarly journals Discovering novel driver mutations from pan-cancer analysis of mutational and gene expression profiles

PLoS ONE ◽  
2020 ◽  
Vol 15 (11) ◽  
pp. e0242780
Author(s):  
Houriiyah Tegally ◽  
Kevin H. Kensler ◽  
Zahra Mungloo-Dilmohamud ◽  
Anisah W. Ghoorah ◽  
Timothy R. Rebbeck ◽  
...  

As the genomic profile across cancers varies from person to person, patient prognosis and treatment may differ based on the mutational signature of each tumour. Thus, it is critical to understand genomic drivers of cancer and identify potential mutational commonalities across tumors originating at diverse anatomical sites. Large-scale cancer genomics initiatives, such as TCGA, ICGC and GENIE have enabled the analysis of thousands of tumour genomes. Our goal was to identify new cancer-causing mutations that may be common across tumour sites using mutational and gene expression profiles. Genomic and transcriptomic data from breast, ovarian, and prostate cancers were aggregated and analysed using differential gene expression methods to identify the effect of specific mutations on the expression of multiple genes. Mutated genes associated with the most differentially expressed genes were considered to be novel candidates for driver mutations, and were validated through literature mining, pathway analysis and clinical data investigation. Our driver selection method successfully identified 116 probable novel cancer-causing genes, with 4 discovered in patients having no alterations in any known driver genes: MXRA5, OBSCN, RYR1, and TG. The candidate genes previously not officially classified as cancer-causing showed enrichment in cancer pathways and in cancer diseases. They also matched expectations pertaining to properties of cancer genes, for instance, showing larger gene and protein lengths, and having mutation patterns suggesting oncogenic or tumor suppressor properties. Our approach allows for the identification of novel putative driver genes that are common across cancer sites using an unbiased approach without any a priori knowledge on pathways or gene interactions and is therefore an agnostic approach to the identification of putative common driver genes acting at multiple cancer sites.

BMC Genomics ◽  
2021 ◽  
Vol 22 (1) ◽  
Author(s):  
Yanan Ren ◽  
Ting-You Wang ◽  
Leah C. Anderton ◽  
Qi Cao ◽  
Rendong Yang

Abstract Background Long non-coding RNAs (lncRNAs) are a growing focus in cancer research. Deciphering pathways influenced by lncRNAs is important to understand their role in cancer. Although knock-down or overexpression of lncRNAs followed by gene expression profiling in cancer cell lines are established approaches to address this problem, these experimental data are not available for a majority of the annotated lncRNAs. Results As a surrogate, we present lncGSEA, a convenient tool to predict the lncRNA associated pathways through Gene Set Enrichment Analysis of gene expression profiles from large-scale cancer patient samples. We demonstrate that lncGSEA is able to recapitulate lncRNA associated pathways supported by literature and experimental validations in multiple cancer types. Conclusions LncGSEA allows researchers to infer lncRNA regulatory pathways directly from clinical samples in oncology. LncGSEA is written in R, and is freely accessible at https://github.com/ylab-hi/lncGSEA.


PeerJ ◽  
2019 ◽  
Vol 7 ◽  
pp. e8135 ◽  
Author(s):  
Salma Begum Bhyan ◽  
Li Zhao ◽  
YongKiat Wee ◽  
Yining Liu ◽  
Min Zhao

Endometriosis is a chronic disease occurring during the reproductive stage of women. Although there is only limited association between endometriosis and gynecological cancers with regard to clinical features, the molecular basis of the relationship between these diseases is unexplored. We conducted a systematic study by integrating literature-based evidence, gene expression and large-scale cancer genomics data in order to reveal any genetic relationships between endometriosis and cancers in women. We curated 984 endometriosis-related genes from 3270 PubMed articles and then conducted a meta-analysis of the two public gene expression profiles related to endometriosis which identified Differential Expression of Genes (DEGs). Following an overlapping analysis, we identified 39 key endometriosis-related genes common in both literature and DEG analysis. Finally, the functional analysis confirmed that all the 39 genes were associated with the vital processes of tumour formation and cancer progression and that two genes (PGR and ESR1) were common to four cancers of women. From network analysis, we identified a novel linker gene, C3AR1, which had not been implicated previously in endometriosis. The shared genetic mechanisms of endometriosis and cancers in women identified in this study provided possible new avenues of multiple disease management and treatments through early diagnosis.


BioTechniques ◽  
2019 ◽  
Vol 67 (4) ◽  
pp. 172-176 ◽  
Author(s):  
Mohammed Khurshed ◽  
Remco J Molenaar ◽  
Cornelis JF van Noorden

In biomedical research, large-scale profiling of gene expression has become routine and offers a valuable means to evaluate changes in onset and progression of diseases, in particular cancer. An overwhelming amount of cancer genomics data has become publicly available, and the complexity of these data makes it a challenge to perform in silico data exploration, integration and analysis, in particular for scientists lacking a background in computational programming or informatics. Many web interface tools make these large datasets accessible but are limited to process large datasets. To accelerate the translation of genomic data into new insights, we provide a simple method to explore and select data from cancer genomic datasets to generate gene-expression profiles of subsets that are of specific genetic, biological or clinical interest.


2021 ◽  
Vol 14 (1) ◽  
pp. 41
Author(s):  
Hana Votavova ◽  
Zuzana Urbanova ◽  
David Kundrat ◽  
Michaela Dostalova Merkerova ◽  
Martin Vostry ◽  
...  

Deferasirox (DFX) is an oral iron chelator used to reduce iron overload (IO) caused by frequent blood cell transfusions in anemic myelodysplastic syndrome (MDS) patients. To study the molecular mechanisms by which DFX improves outcome in MDS, we analyzed the global gene expression in untreated MDS patients and those who were given DFX treatment. The gene expression profiles of bone marrow CD34+ cells were assessed by whole-genome microarrays. Initially, differentially expressed genes (DEGs) were determined between patients with normal ferritin levels and those with IO to address the effect of excessive iron on cellular pathways. These DEGs were annotated to Gene Ontology terms associated with cell cycle, apoptosis, adaptive immune response and protein folding and were enriched in cancer-related pathways. The deregulation of multiple cancer pathways in iron-overloaded patients suggests that IO is a cofactor favoring the progression of MDS. The DEGs between patients with IO and those treated with DFX were involved predominantly in biological processes related to the immune response and inflammation. These data indicate DFX modulates the immune response mainly via neutrophil-related genes. Suppression of negative regulators of blood cell differentiation essential for cell maturation and upregulation of heme metabolism observed in DFX-treated patients may contribute to the hematopoietic improvement.


Neurology ◽  
2017 ◽  
Vol 89 (16) ◽  
pp. 1676-1683 ◽  
Author(s):  
Ron Shamir ◽  
Christine Klein ◽  
David Amar ◽  
Eva-Juliane Vollstedt ◽  
Michael Bonin ◽  
...  

Objective:To examine whether gene expression analysis of a large-scale Parkinson disease (PD) patient cohort produces a robust blood-based PD gene signature compared to previous studies that have used relatively small cohorts (≤220 samples).Methods:Whole-blood gene expression profiles were collected from a total of 523 individuals. After preprocessing, the data contained 486 gene profiles (n = 205 PD, n = 233 controls, n = 48 other neurodegenerative diseases) that were partitioned into training, validation, and independent test cohorts to identify and validate a gene signature. Batch-effect reduction and cross-validation were performed to ensure signature reliability. Finally, functional and pathway enrichment analyses were applied to the signature to identify PD-associated gene networks.Results:A gene signature of 100 probes that mapped to 87 genes, corresponding to 64 upregulated and 23 downregulated genes differentiating between patients with idiopathic PD and controls, was identified with the training cohort and successfully replicated in both an independent validation cohort (area under the curve [AUC] = 0.79, p = 7.13E–6) and a subsequent independent test cohort (AUC = 0.74, p = 4.2E–4). Network analysis of the signature revealed gene enrichment in pathways, including metabolism, oxidation, and ubiquitination/proteasomal activity, and misregulation of mitochondria-localized genes, including downregulation of COX4I1, ATP5A1, and VDAC3.Conclusions:We present a large-scale study of PD gene expression profiling. This work identifies a reliable blood-based PD signature and highlights the importance of large-scale patient cohorts in developing potential PD biomarkers.


2008 ◽  
Vol 5 (2) ◽  
Author(s):  
Li Teng ◽  
Laiwan Chan

SummaryTraditional analysis of gene expression profiles use clustering to find groups of coexpressed genes which have similar expression patterns. However clustering is time consuming and could be diffcult for very large scale dataset. We proposed the idea of Discovering Distinct Patterns (DDP) in gene expression profiles. Since patterns showing by the gene expressions reveal their regulate mechanisms. It is significant to find all different patterns existing in the dataset when there is little prior knowledge. It is also a helpful start before taking on further analysis. We propose an algorithm for DDP by iteratively picking out pairs of gene expression patterns which have the largest dissimilarities. This method can also be used as preprocessing to initialize centers for clustering methods, like K-means. Experiments on both synthetic dataset and real gene expression datasets show our method is very effective in finding distinct patterns which have gene functional significance and is also effcient.


2005 ◽  
Vol 289 (4) ◽  
pp. L545-L553 ◽  
Author(s):  
Joseph Zabner ◽  
Todd E. Scheetz ◽  
Hakeem G. Almabrazi ◽  
Thomas L. Casavant ◽  
Jian Huang ◽  
...  

Cystic fibrosis (CF) is caused by mutations in the cystic fibrosis transmembrane conductance regulator (CFTR), an epithelial chloride channel regulated by phosphorylation. Most of the disease-associated morbidity is the consequence of chronic lung infection with progressive tissue destruction. As an approach to investigate the cellular effects of CFTR mutations, we used large-scale microarray hybridization to contrast the gene expression profiles of well-differentiated primary cultures of human CF and non-CF airway epithelia grown under resting culture conditions. We surveyed the expression profiles for 10 non-CF and 10 ΔF508 homozygote samples. Of the 22,283 genes represented on the Affymetrix U133A GeneChip, we found evidence of significant changes in expression in 24 genes by two-sample t-test ( P < 0.00001). A second, three-filter method of comparative analysis found no significant differences between the groups. The levels of CFTR mRNA were comparable in both groups. There were no significant differences in the gene expression patterns between male and female CF specimens. There were 18 genes with significant increases and 6 genes with decreases in CF relative to non-CF samples. Although the function of many of the differentially expressed genes is unknown, one transcript that was elevated in CF, the KCl cotransporter (KCC4), is a candidate for further study. Overall, the results indicate that CFTR dysfunction has little direct impact on airway epithelial gene expression in samples grown under these conditions.


Author(s):  
Gustavo Deco ◽  
Kevin Aquino ◽  
Aurina Arnatkevičiūtė ◽  
Stuart Oldham ◽  
Kristina Sabaroedin ◽  
...  

AbstractBrain regions vary in their molecular and cellular composition, but how this heterogeneity shapes neuronal dynamics is unclear. Here, we investigate the dynamical consequences of regional heterogeneity using a biophysical model of whole-brain functional magnetic resonance imaging (MRI) dynamics in humans. We show that models in which transcriptional variations in excitatory and inhibitory receptor (E:I) gene expression constrain regional heterogeneity more accurately reproduce the spatiotemporal structure of empirical functional connectivity estimates than do models constrained by global gene expression profiles and MRI-derived estimates of myeloarchitecture. We further show that regional heterogeneity is essential for yielding both ignition-like dynamics, which are thought to support conscious processing, and a wide variance of regional activity timescales, which supports a broad dynamical range. We thus identify a key role for E:I heterogeneity in generating complex neuronal dynamics and demonstrate the viability of using transcriptional data to constrain models of large-scale brain function.


2021 ◽  
Vol 12 ◽  
Author(s):  
Kaisong Bai ◽  
Tong Zhao ◽  
Yilong Li ◽  
Xinjian Li ◽  
Zhantian Zhang ◽  
...  

Pancreatic adenocarcinoma (PAAD) is one of the deadliest malignancies and mortality for PAAD have remained increasing under the conditions of substantial improvements in mortality for other major cancers. Although multiple of studies exists on PAAD, few studies have dissected the oncogenic mechanisms of PAAD based on genomic variation. In this study, we integrated somatic mutation data and gene expression profiles obtained by high-throughput sequencing to characterize the pathogenesis of PAAD. The mutation profile containing 182 samples with 25,470 somatic mutations was obtained from The Cancer Genome Atlas (TCGA). The mutation landscape was generated and somatic mutations in PAAD were found to have preference for mutation location. The combination of mutation matrix and gene expression profiles identified 31 driver genes that were closely associated with tumor cell invasion and apoptosis. Co-expression networks were constructed based on 461 genes significantly associated with driver genes and the hub gene FAM133A in the network was identified to be associated with tumor metastasis. Further, the cascade relationship of somatic mutation-Long non-coding RNA (lncRNA)-microRNA (miRNA) was constructed to reveal a new mechanism for the involvement of mutations in post-transcriptional regulation. We have also identified prognostic markers that are significantly associated with overall survival (OS) of PAAD patients and constructed a risk score model to identify patients’ survival risk. In summary, our study revealed the pathogenic mechanisms and prognostic markers of PAAD providing theoretical support for the development of precision medicine.


2020 ◽  
Author(s):  
Mengying Sun ◽  
Rama Shankar ◽  
Meehyun Ko ◽  
Christopher Daniel Chang ◽  
Shan-Ju Yeh ◽  
...  

Abstract Epidemiological studies suggest that men exhibit a higher mortality rate to COVID-19 than women, yet the underlying biology is largely unknown. Here, we seek to delineate sex differences in the gene expression of viral entry proteins ACE2 and TMPRSS2, and host transcriptional responses to SARS-CoV-2 through large-scale analysis of genomic and clinical data. We first compiled 220,000 human gene expression profiles from three databases and completed the meta-information through machine learning and manual annotation. Large scale analysis of these profiles indicated that male samples show higher expression levels of ACE2 and TMPRSS2 than female samples, especially in the older group (>60 years) and in the kidney. Subsequent analysis of 6,031 COVID-19 patients at Mount Sinai Health System revealed that men have significantly higher creatinine levels, an indicator of impaired kidney function. Further analysis of 782 COVID-19 patient gene expression profiles taken from upper airway and blood suggested men and women present distinct expression changes. Computational deconvolution analysis of these profiles revealed male COVID-19 patients have enriched kidney-specific mesangial cells in blood compared to healthy patients. Together, this study suggests biological differences in the kidney between sexes may contribute to sex disparity in COVID-19.


Sign in / Sign up

Export Citation Format

Share Document