scholarly journals Interpretation of biological experiments changes with evolution of Gene Ontology and its annotations

2017 ◽  
Author(s):  
Aurelie Tomczak ◽  
Jonathan M. Mortensen ◽  
Rainer Winnenburg ◽  
Charles Liu ◽  
Dominique T. Alessi ◽  
...  

ABSTRACTGene Ontology (GO) enrichment analysis is ubiquitously used for interpreting high throughput molecular data and generating hypotheses about underlying biological phenomena of experiments. However, the two building blocks of this analysis — the ontology and the annotations — evolve rapidly. We used gene signatures derived from 104 disease analyses to systematically evaluate how enrichment analysis results were affected by evolution of the GO over a decade. We found low consistency between enrichment analyses results obtained with early and more recent GO versions. Furthermore, there continues to be strong annotation bias in the GO annotations where 58% of the annotations are for 16% of the human genes. Our analysis suggests that GO evolution may have affected the interpretation and possibly reproducibility of experiments over time. Hence, researchers must exercise caution when interpreting GO enrichment analyses and should reexamine previous analyses with the most recent GO version.

2020 ◽  
Author(s):  
Vijayakrishna Kolur ◽  
Basavaraj Vastrad ◽  
Chanabasayya Vastrad ◽  
Iranna Kotturshetti ◽  
Anandkumar Tengli

Abstract BackgroundCoronary artery disease (CAD) is one of the most common disorders in the cardiovascular system. This study aims to explore potential signaling pathways and important biomarkers that drive CAD development. MethodsThe CAD GEO Dataset GSE113079 was featured to screen differentially expressed genes (DEGs). The pathway and Gene Ontology (GO) enrichment analysis of DEGs were analyzed using the ToppGene. We screened hub and target genes from protein-protein interaction (PPI) networks, target gene - miRNA regulatory network and target gene - TF regulatory network, and Cytoscape software. Validations of hub genes were performed to evaluate their potential prognostic and diagnostic value for CAD. Results1,036 DEGs were captured according to screening criteria (525upregulated genes and 511downregulated genes). Pathway and Gene Ontology (GO) enrichment analysis of DEGs revealed that these up and down regulated genes are mainly enriched in thyronamine and iodothyronamine metabolism, cytokine-cytokine receptor interaction, nervous system process, cell cycle and nuclear membrane. Hub genes were validated to find out potential prognostic biomarkers, diagnostic biomarkers and novel therapeutic target for CAD. ConclusionsIn summary, our findings discovered pivotal gene expression signatures and signaling pathways in the progression of CAD. CAPN13, ACTBL2, ERBB3, GATA4, GNB4, NOTCH2, EXOSC10, RNF2, PSMA1 and PRKAA1 might contribute to the progression of CAD, which could have potential as biomarkers or therapeutic targets for CAD.


2019 ◽  
Vol 20 (S18) ◽  
Author(s):  
Jiajie Peng ◽  
Guilin Lu ◽  
Hansheng Xue ◽  
Tao Wang ◽  
Xuequn Shang

Abstract Background The Gene Ontology (GO) knowledgebase is the world’s largest source of information on the functions of genes. Since the beginning of GO project, various tools have been developed to perform GO enrichment analysis experiments. GO enrichment analysis has become a commonly used method of gene function analysis. Existing GO enrichment analysis tools do not consider tissue-specific information, although this information is very important to current research. Results In this paper, we built an easy-to-use web tool called TS−GOEA that allows users to easily perform experiments based on tissue-specific GO enrichment analysis. TS−GOEA uses strict threshold statistical method for GO enrichment analysis, and provides statistical tests to improve the reliability of the analysis results. Meanwhile, TS−GOEA provides tools to compare different experimental results, which is convenient for users to compare the experimental results. To evaluate its performance, we tested the genes associated with platelet disease with TS−GOEA. Conclusions TS−GOEA is an effective GO analysis tool with unique features. The experimental results show that our method has better performance and provides a useful supplement for the existing GO enrichment analysis tools. TS−GOEA is available at http://120.77.47.2:5678.


2020 ◽  
Author(s):  
Vijayakrishna Kolur ◽  
Basavaraj Vastrad ◽  
Anandkumar Tengli ◽  
Chanabasayya Vastrad ◽  
Iranna Kotturshetti

Abstract Coronary artery disease (CAD) is one of the most common disorders in the cardiovascular system. This study aims to explore potential signaling pathways and important biomarkers that drive CAD development. The CAD GEO Dataset GSE113079 was featured to screen differentially expressed genes (DEGs). The pathway and Gene Ontology (GO) enrichment analysis of DEGs were analyzed using the ToppGene. We screened hub and target genes from protein-protein interaction (PPI) networks, target gene - miRNA regulatory network and target gene - TF regulatory network, and Cytoscape software. Validations of hub genes were performed to evaluate their potential prognostic and diagnostic value for CAD. A final, molecular docking study was performed. 1,036 DEGs were captured according to screening criteria (525upregulated genes and 511downregulated genes). Pathway and Gene Ontology (GO) enrichment analysis of DEGs revealed that these up and down regulated genes are mainly enriched in thyronamine and iodothyronamine metabolism, cytokine-cytokine receptor interaction, nervous system process, cell cycle and nuclear membrane. Hub genes were validated to find out potential prognostic biomarkers, diagnostic biomarkers and novel therapeutic target for CAD. A small drug molecule was predicted. In summary, our findings discovered pivotal gene expression signatures and signaling pathways in the progression of CAD. CAPN13, ACTBL2, ERBB3, GATA4, GNB4, NOTCH2, EXOSC10, RNF2, PSMA1 and PRKAA1 might contribute to the progression of CAD, which could have potential as biomarkers or therapeutic targets for CAD.


2020 ◽  
Author(s):  
Basavaraj Vastrad ◽  
Chanabasayya Vastrad ◽  
Iranna Kotturshetti

AbstractHepatoblastoma is the childhood liver cancer. Profound efforts have been made to illuminate the pathology, but the molecular mechanisms of hepatoblastoma are still not well understood. To identify the candidate genes in the carcinogenesis and progression of hepatoblastoma, microarray dataset GSE131329 was downloaded from Gene Expression Omnibus (GEO) database. The differentially expressed genes (DEGs) were identified, and pathway and Gene Ontology (GO) enrichment analysis were performed. The protein-protein interaction network (PPI), module analysis, target gene - miRNA regulatory network and target gene - TF regulatory network were constructed and analyzed. A total of 996 DEGs were identified, consisting of 499 up regulated genes and 497 down regulated genes. The pathway and Gene Ontology (GO) enrichment analysis of the DEGs include proline biosynthesis, superpathway of tryptophan utilization, chromosome organization and organic acid metabolic process. Twenty-four hub genes were identified and biological process analysis revealed that these genes were mainly enriched in cell cycle, chromosome organization, lipid metabolic process and oxidation-reduction process. Validation of hub genes showed that TP53, PLK1, AURKA, CDK1, ANLN, ESR1, FGB, ACAT1, GOT1 and ALAS1 may be involved in the carcinogenesis, invasion or recurrence of hepatoblastoma. In conclusion, DEGs and hub genes identified in the present study help us understand the molecular mechanisms underlying the carcinogenesis and progression of hepatoblastoma, and provide candidate targets for diagnosis and treatment of hepatoblastoma.


2019 ◽  
Author(s):  
Robert Ietswaart ◽  
Benjamin M. Gyori ◽  
John A. Bachman ◽  
Peter K. Sorger ◽  
L. Stirling Churchman

AbstractThe primary bottleneck in high-throughput genomics experiments is identifying the most important genes and their relevant functions from a list of gene hits. Existing methods such as Gene Ontology (GO) enrichment analysis provide insight at the gene set level. For individual genes, GO annotations are static and biological context can only be added by manual literature searches. Here, we introduce GeneWalk (github.com/churchmanlab/genewalk), a method that identifies individual genes and their relevant functions under a particular experimental condition. After automatic assembly of an experiment-specific gene regulatory network, GeneWalk quantifies the similarity between vector representations of each gene and its GO annotations through representation learning, yielding annotation significance scores that reflect their functional relevance for the experimental context. We demonstrate the use of GeneWalk analysis of RNA-seq and nascent transcriptome (NET-seq) data from human cells and mouse brains, validating the methodology. By performing gene- and condition-specific functional analysis that converts a list of genes into data-driven hypotheses, GeneWalk accelerates the interpretation of high-throughput genetics experiments.


2020 ◽  
Author(s):  
Vikrant Ghatnatti ◽  
Basavaraj Vastrad ◽  
Swetha Patil ◽  
Chanabasayya Vastrad ◽  
Iranna Kotturshetti

AbstractPituitary prolactinoma is one of the most complicated and fatally pathogenic pituitary adenomas. Therefore, there is an urgent need to improve our understanding of the underlying molecular mechanism that drives the initiation, progression, and metastasis of pituitary prolactinoma. The aim of the present study was to identify the key genes and signaling pathways associated with pituitary prolactinoma using bioinformatics analysis. Transcriptome microarray dataset GSE119063 was acquired from Gene Expression Omnibus datasets, which included 5 pituitary prolactinoma samples and 4 normal pituitaries samples. We screened differentially expressed genes (DEGs) with limma and investigated their biological function by pathway and Gene Ontology (GO) enrichment analysis. A protein-protein interaction (PPI) network of the up and down DEGs were constructed and analyzed by HIPPIE and Cytoscape software. Module analyses were performed. In addition, a target gene - miRNA network and target gene - TF network of the up and down DEGs were constructed by NetworkAnalyst and Cytoscape software. The set of DEGs exhibited an intersection consisting of 989 genes (461 up-regulated and 528 down-regulated), which may be associated with pituitary prolactinoma. Pathway enrichment analysis showed that the 989 DEGs were significantly enriched in the retinoate biosynthesis II, signaling pathways regulating pluripotency of stem cells, ALK2 signaling events, vitamin D3 biosynthesis, cell cycle and aurora B signaling. Gene Ontology (GO) enrichment analysis also showed that sensory organ morphogenesis, extracellular matrix, hormone activity, nuclear division, condensed chromosome and microtubule binding. In the PPI network and modules, SOX2, PRSS45, CLTC, PLK1, B4GALT6, RUNX1 and GTSE1 were considered as hub genes. In the target gene miRNA network and target gene - TF network, LINC00598, SOX4, IRX1 and UNC13A were considered as hub genes. Using integrated bioinformatics analysis, we identified candidate genes in pituitary prolactinoma, which may improve our understanding of the mechanisms of the pathogenesis and integration; genes may be therapeutic targets and prognostic markers for pituitary prolactinoma.


2021 ◽  
Author(s):  
Hang Zhang ◽  
Wenhan Zhou ◽  
Xiaoyi Yang ◽  
Shuzhan Wen ◽  
Baicheng Zhao ◽  
...  

Abstract Background PTEN is a multifunctional tumor suppressor gene mutating at high frequency in a variety of cancers. However, its expression in pan-cancer, correlated genes, survival prognosis, and regulatory pathways are not completely described. Here, we aimed to conduct a comprehensive analysis from the above perspectives in order to provide reference for clinical application. Methods we studied the expression levels in cancers by using data from TCGA and GTEx database. Obtain expression box plot from UALCAN database. Perform mutation analysis on the cBioportal website. Obtain correlation genes on the GEPIA website. Construct protein network and perform KEGG and GO enrichment analysis on the STRING database. Perform prognostic analysis on the Kaplan-Meier Plotter website. We also performed transcription factor prediction on the PROMO database and performed RNA-RNA association and RNA-protein interaction on the RNAup Web server and RPISEq. The gene 3D structure, protein sequence and conserved domain were obtained in NCBI respectively. Results PTEN was underexpressed in all cancers we studied. It was closely related to the clinical stage of tumors, suggesting PTEN may involved in cancer development and progression. The mutations of PTEN were present in a variety of cancers, most of which were truncation mutations and missense mutations. Among cancers (KIRC, LUAD, THYM, UCEC, Gastric Cancer, Liver Cancer, Lung Cancer, Breast Cancer), patients with low expression of PTEN had a shorter OS time and poorer OS prognosis. The low expression of PTEN can cause the deterioration of RFS in certain cancers (TGCT, UCEC, LIHC, LUAD, THCA), suggesting that the expression of PTEN was related to the clinical prognosis. Our study identified genes correlated with PTEN and performed GO enrichment analysis on 100 PTEN-related genes obtained from the GEPIA website. Conclusions The understanding of PTEN gene and the in-depth exploration of its related regulatory pathways may provide insight for the discovery of tumor-specific biomarkers and clinical potential therapeutic targets.


Diagnostics ◽  
2019 ◽  
Vol 9 (2) ◽  
pp. 39
Author(s):  
◽  
Chanabasayya Vastrad ◽  
◽  

: Epithelial ovarian cancer (EOC) is the18th most common cancer worldwide and the 8th most common in women. The aim of this study was to diagnose the potential importance of, as well as novel genes linked with, EOC and to provide valid biological information for further research. The gene expression profiles of E-MTAB-3706 which contained four high-grade ovarian epithelial cancer samples, four normal fallopian tube samples and four normal ovarian epithelium samples were downloaded from the ArrayExpress database. Pathway enrichment and Gene Ontology (GO) enrichment analysis of differentially expressed genes (DEGs) were performed, and protein-protein interaction (PPI) network, microRNA-target gene regulatory network and TFs (transcription factors ) -target gene regulatory network for up- and down-regulated were analyzed using Cytoscape. In total, 552 DEGs were found, including 276 up-regulated and 276 down-regulated DEGs. Pathway enrichment analysis demonstrated that most DEGs were significantly enriched in chemical carcinogenesis, urea cycle, cell adhesion molecules and creatine biosynthesis. GO enrichment analysis showed that most DEGs were significantly enriched in translation, nucleosome, extracellular matrix organization and extracellular matrix. From protein-protein interaction network (PPI) analysis, modules, microRNA-target gene regulatory network and TFs-target gene regulatory network for up- and down-regulated, and the top hub genes such as E2F4, SRPK2, A2M, CDH1, MAP1LC3A, UCHL1, HLA-C (major histocompatibility complex, class I, C) , VAT1, ECM1 and SNRPN (small nuclear ribonucleoprotein polypeptide N) were associated in pathogenesis of EOC. The high expression levels of the hub genes such as CEBPD (CCAAT enhancer binding protein delta) and MID2 in stages 3 and 4 were validated in the TCGA (The Cancer Genome Atlas) database. CEBPD andMID2 were associated with the worst overall survival rates in EOC. In conclusion, the current study diagnosed DEGs between normal and EOC samples, which could improve our understanding of the molecular mechanisms in the progression of EOC. These new key biomarkers might be used as therapeutic targets for EOC.


Sign in / Sign up

Export Citation Format

Share Document