scholarly journals Co-expression analysis reveals interpretable gene modules controlled by trans-acting genetic variants

eLife ◽  
2020 ◽  
Vol 9 ◽  
Author(s):  
Liis Kolberg ◽  
Nurlan Kerimov ◽  
Hedi Peterson ◽  
Kaur Alasoo

Understanding the causal processes that contribute to disease onset and progression is essential for developing novel therapies. Although trans-acting expression quantitative trait loci (trans-eQTLs) can directly reveal cellular processes modulated by disease variants, detecting trans-eQTLs remains challenging due to their small effect sizes. Here, we analysed gene expression and genotype data from six blood cell types from 226 to 710 individuals. We used co-expression modules inferred from gene expression data with five methods as traits in trans-eQTL analysis to limit multiple testing and improve interpretability. In addition to replicating three established associations, we discovered a novel trans-eQTL near SLC39A8 regulating a module of metallothionein genes in LPS-stimulated monocytes. Interestingly, this effect was mediated by a transient cis-eQTL present only in early LPS response and lost before the trans effect appeared. Our analyses highlight how co-expression combined with functional enrichment analysis improves the identification and prioritisation of trans-eQTLs when applied to emerging cell-type-specific datasets.

2020 ◽  
Vol 48 (21) ◽  
pp. e124-e124
Author(s):  
Yuzhu Duan ◽  
Daniel S Evans ◽  
Richard A Miller ◽  
Nicholas J Schork ◽  
Steven R Cummings ◽  
...  

Abstract signatureSearch is an R/Bioconductor package that integrates a suite of existing and novel algorithms into an analysis environment for gene expression signature (GES) searching combined with functional enrichment analysis (FEA) and visualization methods to facilitate the interpretation of the search results. In a typical GES search (GESS), a query GES is searched against a database of GESs obtained from large numbers of measurements, such as different genetic backgrounds, disease states and drug perturbations. Database matches sharing correlated signatures with the query indicate related cellular responses frequently governed by connected mechanisms, such as drugs mimicking the expression responses of a disease. To identify which processes are predominantly modulated in the GESS results, we developed specialized FEA methods combined with drug-target network visualization tools. The provided analysis tools are useful for studying the effects of genetic, chemical and environmental perturbations on biological systems, as well as searching single cell GES databases to identify novel network connections or cell types. The signatureSearch software is unique in that it provides access to an integrated environment for GESS/FEA routines that includes several novel search and enrichment methods, efficient data structures, and access to pre-built GES databases, and allowing users to work with custom databases.


2020 ◽  
Vol 20 (1) ◽  
Author(s):  
Haoran Jia ◽  
Zibo Zhang ◽  
Ehsan Sadeghnezhad ◽  
Qianqian Pang ◽  
Shangyun Li ◽  
...  

Abstract Background Grape buds and leaves are directly associated with the physiology and metabolic activities of the plant, which is monitored by epigenetic modifications induced by environment and endogenous factors. Methylation is one of the epigenetic regulators that could be involved in DNA levels and affect gene expression in response to stimuli. Therefore, changes of gene expression profile in leaves and bud through inhibitors of DNA methylation provide a deep understanding of epigenetic effects in regulatory networks. Results In this study, we carried out a transcriptome analysis of ‘Kyoho’ buds and leaves under 5-azacytidine (5-azaC) exposure and screened a large number of differentially expressed genes (DEGs). GO and KEGG annotations showed that they are mainly involved in photosynthesis, flavonoid synthesis, glutathione metabolism, and other metabolic processes. Functional enrichment analysis also provided a holistic perspective on the transcriptome profile when 5-azaC bound to methyltransferase and induced demethylation. Enrichment analysis of transcription factors (TFs) also showed that the MYB, C2H2, and bHLH families are involved in the regulation of responsive genes under epigenetic changes. Furthermore, hormone-related genes have also undergone significant changes, especially gibberellin (GA) and abscisic acid (ABA)-related genes that responded to bud germination. We also used protein-protein interaction network to determine hub proteins in response to demethylation. Conclusions These findings provide new insights into the establishment of molecular regulatory networks according to how methylation as an epigenetic modification alters transcriptome patterns in bud and leaves of grape.


2020 ◽  
Vol 2020 ◽  
pp. 1-16 ◽  
Author(s):  
Yun Zhong ◽  
Zhe Liu ◽  
Dangchi Li ◽  
Qinyuan Liao ◽  
Jingao Li

Background. An increasing number of studies have indicated that the abnormal expression of certain long noncoding RNAs (lncRNAs) is linked to the overall survival (OS) of patients with myeloma. Methods. Gene expression data of myeloma patients were downloaded from the Gene Expression Omnibus (GEO) database (GSE4581 and GSE57317). Cox regression analysis, Kaplan-Meier, and receiver operating characteristic (ROC) analysis were performed to construct and validate the prediction model. Single sample gene set enrichment (ssGSEA) and Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis were used to predict the function of a specified lncRNA. Results. In this study, a seven-lncRNA signature was identified and used to construct a risk score system for myeloma prognosis. This system was used to stratify patients with different survival rates in the training set into high-risk and low-risk groups. Test set, the entire test set, the external validation set, and the myeloma subtype achieved the authentication of the results. In addition, functional enrichment analysis indicated that 7 prognostic lncRNAs may be involved in the tumorigenesis of myeloma through cancer-related pathways and biological processes. The results of the immune score showed that IF_I was negatively correlated with the risk score. Compared with the published gene signature, the 7-lncRNA model has a higher C-index (above 0.8). Conclusion. In summary, our data provide evidence that seven lncRNAs could be used as independent biomarkers to predict the prognosis of myeloma, which also indicated that these 7 lncRNAs may be involved in the progression of myeloma.


Author(s):  
Liis Kolberg ◽  
Nurlan Kerimov ◽  
Hedi Peterson ◽  
Kaur Alasoo

AbstractBackgroundDeveloping novel therapies for complex disease requires better understanding of the causal processes that contribute to disease onset and progression. Although trans-acting gene expression quantitative trait loci (trans-eQTLs) can be a powerful approach to directly reveal cellular processes modulated by disease variants, detecting trans-eQTLs remains challenging due to their small effect sizes and large number of genes tested. However, if a single trans-eQTL controls a group of co-regulated genes, then multiple testing burden can be greatly reduced by summarising gene expression at the level of co-expression modules prior to trans-eQTL analysis.ResultsWe analysed gene expression and genotype data from six blood cell types from 226 to 710 individuals. We inferred gene co-expression modules with five methods on the full dataset, as well as in each cell type separately. We detected a number of established co-expression module trans-eQTLs, such as the monocyte-specific associations at the IFNB1 and LYZ loci, as well as a platelet-specific ARHGEF3 locus associated with mean platelet volume. We also discovered a novel trans association near the SLC39A8 gene in LPS-stimulated monocytes. Here, we linked an early-response cis-eQTL of the SLC39A8 gene to a module of co-expressed metallothionein genes upregulated more than 20 hours later and used motif analysis to identify zinc-induced activation of the MTF1 transcription factor as a likely mediator of this effect.ConclusionsOur analysis provides a rare detailed characterisation of a trans-eQTL effect cascade from a proximal cis effect to the affected signalling pathway, transcription factor, and target genes. This highlights how co-expression analysis combined with functional enrichment analysis can greatly improve the identification and prioritisation of trans-eQTLs when applied to emerging cell-type specific datasets.


2021 ◽  
Author(s):  
Mohib kakar ◽  
Muhammad Mehboob ◽  
Muhammad Akram ◽  
Imran Iqbal ◽  
Hafza Ijaz ◽  
...  

Abstract Objective The goal of this study was to understand possible core genes associated with hepatocellular carcinoma (HCC) pathogenesis and prognosis. Methods GEO contains datasets of gene expression, miRNA and methylation patterns of diseased and healthy/control patients. GSE62232 Dataset was selected by employing the server Gene Expression Omnibus. A total of 91 samples were collected, including 81 HCC samples and 10 healthy samples as control. GSE62232 was analyzed through GEO2R, and Functional Enrichment Analysis was performed to extract rational information from a set of DEGs. The Protein-Protein Relationship Networking search method has been used for extracting genes interacting. MCC method was used to calculate the top 10 genes according to their importance. Hub genes in the network were analyzed using GEPIA to estimate the effect of their differential expression on cancer progression. Results We identified the top 10 hub genes through Cytohubba plugin. These genes include Cell Cycle Regulatory Cyclins and Cyclin-dependent proteins CCNA2, CCNB1 and CDK1. The pathogenesis and prognosis of HCC may be directly linked with the aforementioned genes. Conclusion In this analysis, we found critical genes for HCC that showed recommendations for more diagnostic and predictive biomarkers studies that could promote selective molecular therapy for HCC.


2021 ◽  
Author(s):  
Mohib kakar ◽  
Muhammad Mehboob ◽  
Muhammad Akram ◽  
Imran Iqbal ◽  
Hafza Ijaz ◽  
...  

Abstract The goal of this study was to understand possible core genes associated with hepatocellular carcinoma (HCC) pathogenesis and prognosis. Gene Expression Omnibus (GEO) contains datasets of gene expression, miRNA and methylation patterns of diseased and healthy/control patients. GSE62232 Dataset was selected by employing the server GEO. A total of 91 samples were collected, including 81 HCC samples and 10 healthy samples as control. GSE62232 was analyzed through GEO2R, and functional enrichment analysis was performed to extract rational information from a set of DEGs. The protein-protein relationship networking search method was used for extracting interacting genes. MCC method was used to calculate the top 10 genes according to their importance. Hub genes in the network were analyzed using GEPIA to estimate the effect of their differential expression on cancer progression. We identified the top 10 hub genes through Cytohubba plugin. These genes include cell cycle regulatory cyclins and cyclin-dependent proteins CCNA2, CCNB1 and CDK1. The pathogenesis and prognosis of HCC may be directly linked with the aforementioned genes. In this analysis, we found critical genes for HCC that showed recommendations for more diagnostic and predictive biomarker studies that could promote selective molecular therapy for HCC.


2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Stefano Manzini ◽  
Marco Busnelli ◽  
Alice Colombo ◽  
Elsa Franchi ◽  
Pasquale Grossano ◽  
...  

AbstractFunctional enrichment analysis is an analytical method to extract biological insights from gene expression data, popularized by the ever-growing application of high-throughput techniques. Typically, expression profiles are generated for hundreds to thousands of genes/proteins from samples belonging to two experimental groups, and after ad-hoc statistical tests, researchers are left with lists of statistically significant entities, possibly lacking any unifying biological theme. Functional enrichment tackles the problem of putting overall gene expression changes into a broader biological context, based on pre-existing knowledge bases of reference: database collections of known expression regulation, relationships and molecular interactions. STRING is among the most popular tools, providing both protein–protein interaction networks and functional enrichment analysis for any given set of identifiers. For complex experimental designs, manually retrieving, interpreting, analyzing and abridging functional enrichment results is a daunting task, usually performed by hand by the average wet-biology researcher. We have developed reString, a cross-platform software that seamlessly retrieves from STRING functional enrichments from multiple user-supplied gene sets, with just a few clicks, without any need for specific bioinformatics skills. Further, it aggregates all findings into human-readable table summaries, with built-in features to easily produce user-customizable publication-grade clustermaps and bubble plots. Herein, we outline a complete reString protocol, showcasing its features on a real use-case.


2021 ◽  
Vol 8 ◽  
Author(s):  
Ningyuan Chen ◽  
Liu Miao ◽  
Wei Lin ◽  
Donghua Zou ◽  
Ling Huang ◽  
...  

Background: To explore the association of DNA methylation and gene expression in the pathology of obesity.Methods: (1) Genomic DNA methylation and mRNA expression profile of visceral adipose tissue (VAT) were performed in a comprehensive database of gene expression in obese and normal subjects. (2) Functional enrichment analysis and construction of differential methylation gene regulatory networks were performed. (3) Validation of the two different methylation sites and corresponding gene expression was done in a separate microarray dataset. (4) Correlation analysis was performed on DNA methylation and mRNA expression data.Results: A total of 77 differentially expressed mRNAs matched with differentially methylated genes. Analysis revealed two different methylation sites corresponding to two unique genes—s100a8-cg09174555 and s100a9-cg03165378. Through the verification test of two interesting different expression positions [differentially methylated positions (DMPs)] and their corresponding gene expression, we found that methylation in these genes was negatively correlated to gene expression in the obesity group. Higher S100A8 and S100A9 expressions in obese subjects were validated in a separate microarray dataset.Conclusion: This study confirmed the relationship between DNA methylation and gene expression and emphasized the important role of S100A8 and S100A9 in the pathogenesis of obesity.


2019 ◽  
Vol 47 (W1) ◽  
pp. W206-W211 ◽  
Author(s):  
Shaojuan Li ◽  
Changxin Wan ◽  
Rongbin Zheng ◽  
Jingyu Fan ◽  
Xin Dong ◽  
...  

AbstractCharacterizing the ontologies of genes directly regulated by a transcription factor (TF), can help to elucidate the TF’s biological role. Previously, we developed a widely used method, BETA, to integrate TF ChIP-seq peaks with differential gene expression (DGE) data to infer direct target genes. Here, we provide Cistrome-GO, a website implementation of this method with enhanced features to conduct ontology analyses of gene regulation by TFs in human and mouse. Cistrome-GO has two working modes: solo mode for ChIP-seq peak analysis; and ensemble mode, which integrates ChIP-seq peaks with DGE data. Cistrome-GO is freely available at http://go.cistrome.org/.


2020 ◽  
Vol 19 ◽  
pp. 153303382097748
Author(s):  
Shao-wei Zhang ◽  
Nan Zhang ◽  
Na Wang

Background: Esophageal cancer (EC) is a primary malignant tumor originating from the esophageal of the epithelium. Surgical resection is a potential treatment for EC, but this is only appropriate for patients who have locally resectable lesions suitable for surgery. However, most patients with EC are at a late stage when diagnosed. Therefore, there is an urgent need to further explore the pathogenesis of EC to enable early diagnosis and treatment. Methods: Our study downloaded 2 expression spectrum datasets (GSE92396 and GSE100942) in the Gene Expression Omnibus (GEO) database. GEO2 R was used to identify the Differentially expressed genes (DEGs) between the samples of EC and control. Using the DAVID tool to make the Functional enrichment analysis. Constructing A protein–protein interaction (PPI) network. Identifying the Hub genes. The impact of hub gene expression on overall survival and their expression based on immunohistochemistry were analyzed. Associated microRNAs were also predicted. Results: There were 36 common DEGs identified. The analysis of GO and KEGG results shown that the variations were predominantly concentrated in the extracellular matrix (ECM), ECM organization, DNA binding, platelet activation, and ECM-receptor interactions. COL3A1 and POSTN had high expression in EC tissues which was compared with their expression in healthy tissues. Analysis of pathologic stages showed that when COL3A1 and POSTN were highly expressed, the stage of the pathologic of EC patients was relatively high (P < 0.005). Conclusions: COL3A1 and POSTN may play an important role in the advancement and occurrence of EC. These genes could provide some novel ideas and basis for the diagnosis and targeted treatment of EC.


Sign in / Sign up

Export Citation Format

Share Document