scholarly journals Minimum redundancy maximal relevance gene selection of apoptosis pathway genes in peripheral blood mononuclear cells of HIV-infected patients with antiretroviral therapy-associated mitochondrial toxicity

2021 ◽  
Vol 14 (1) ◽  
Author(s):  
Eliezer Bose ◽  
Elijah Paintsil ◽  
Musie Ghebremichael

Abstract Background We previously identified differentially expressed genes on the basis of false discovery rate adjusted P value using empirical Bayes moderated tests. However, that approach yielded a subset of differentially expressed genes without accounting for redundancy between the selected genes. Methods This study is a secondary analysis of a case–control study of the effect of antiretroviral therapy on apoptosis pathway genes comprising of 16 cases (HIV infected with mitochondrial toxicity) and 16 controls (uninfected). We applied the maximum relevance minimum redundancy (mRMR) algorithm on the genes that were differentially expressed between the cases and controls. The mRMR algorithm iteratively selects features (genes) that are maximally relevant for class prediction and minimally redundant. We implemented several machine learning classifiers and tested the prediction accuracy of the two mRMR genes. We next used network analysis to estimate and visualize the association among the differentially expressed genes. We employed Markov Random Field or undirected network models to identify gene networks related to mitochondrial toxicity. The Spinglass model was used to identify clusters of gene communities. Results The mRMR algorithm ranked DFFA and TNFRSF1A, two of the upregulated proapoptotic genes, on the top. The overall prediction accuracy was 86%, the two mRMR genes correctly classified 86% of the participants into their respective groups. The estimated network models showed different patterns of gene networks. In the network of the cases, FASLG was the most central gene. However, instead of FASLG, ABL1 and LTBR had the highest centrality in controls. Conclusion The mRMR algorithm and network analysis revealed a new correlation of genes associated with mitochondrial toxicity.

2021 ◽  
Author(s):  
Eliezer Bose ◽  
Elijah Paintsil ◽  
Musie Ghebremichael

Abstract Background: We previously identified differentially expressed genes on the basis of false discovery rate adjusted P value using empirical Bayes moderated tests. However, that approach yielded a subset of differentially expressed genes without accounting for redundancy between the selected genes. Methods: This study is a secondary analysis of a case-control study of the effect of antiretroviral therapy (ART) on apoptosis pathway genes comprising of 16 cases (HIV infected with mitochondrial toxicity) and 16 controls (uninfected). We applied the maximum relevance minimum redundancy (mRMR) algorithm on the genes that were differentially expressed between the cases and controls. The mRMR algorithm iteratively selects features (genes) that are maximally relevant for class prediction and minimally redundant. We implemented m-fold cross validation using machine learning classifiers and tested the prediction accuracy of the two mRMR genes. We next used network analysis to estimate the association between the differentially expressed genes using partial correlations, relative importance estimates, and centrality measures of each item. The Spinglass algorithm was used to identify clusters of gene communities.Results: The mRMR algorithm ranked DFFA and TNFRSF1A, two of the upregulated proapoptotic genes, on the top. The overall prediction accuracy was 90%, which clearly show that the mRMR gene sets outperforms the performance of the gene sets based on gene expression analyses. FASLG had highest centrality in cases with ABL1 in controls.Conclusion: The mRMR algorithm and network analysis revealed a new correlation of genes associated with mitochondrial toxicity.


2020 ◽  
Author(s):  
Zhengzhong Gu ◽  
Xiaohan Cui ◽  
Xudong Wang

Abstract Background: Prognostic prediction models have been developed to detect new biomarkers of gastric cancer (GC). The identification of new biomarkers could provide theoretical foundations for the application of molecular targeted therapy in advanced GC. The aim of this study was to construct a prognostic prediction model for stomach adenocarcinoma (STAD) based on The Cancer Genome Atlas (TCGA) database. Methods: First, we used the "limma" package to screen differentially expressed genes (DEGs) based on TCGA database. Gene ontology (GO) analysis was performed using the "ClusterProfiler" package. The interactions between proteins and the relationships between differentially expressed genes and clinical features were analyzed by protein-protein interaction (PPI) network analysis and weighted gene coexpression network analysis (WGCNA), respectively. Then, gene set enrichment analysis (GSEA) and gene set variation analysis (GSVA) were used to identify differentially enriched pathways. The GenVisR package and CIBERSORT were used to identify mutations and assess immune infiltration. Finally, the expression of COL3A1 in STAD tissues was verified by reverse transcription quantitative polymerase chain reaction (RT-qPCR) and western blotting.Results: Six differentially expressed genes were screened out, namely, COL3A1, ADAMTS12, BGN, FNDC1, AEBP1 and HTRA3. The enrichment results showed that differentially expressed genes were involved in multiple pathways in STAD, such as those related to the extracellular matrix, extracellular structure organization, and extracellular matrix organization. The differentially expressed genes were related to immune infiltration via the mitogen-activated protein kinase (MAPK) and phosphatidylinositol 3-kinase/protein kinase B (PI3K/AKT) pathways. The western blotting and RT-qPCR results suggested that COL3A1 was overexpressed in STAD tissues compared with normal tissues.Conclusion: COL3A1, ADAMTS12, BGN, FNDC1, AEBP1 and HTRA3 could play important roles in the tumorigenesis and progression of STAD via various pathways, including those involving the extracellular matrix, extracellular structure organization, and extracellular matrix organization. COL3A1, ADAMTS12, BGN, FNDC1, AEBP1, and HTRA3 act as oncogenes in most cancers and may be biomarkers. Additionally, the identification of COL3A1 as a candidate biomarker provides a direction for further research on the role of tumor immunity in gastric cancer.


2020 ◽  
Author(s):  
Na Li ◽  
Ru-feng Bai ◽  
Chun Li ◽  
Li-hong Dang ◽  
Qiu-xiang Du ◽  
...  

Abstract Background: Muscle trauma frequently occurs in daily life. However, the molecular mechanisms of muscle healing, which partly depend on the extent of the damage, are not well understood. This study aimed to investigate gene expression profiles following mild and severe muscle contusion, and to provide more information about the molecular mechanisms underlying the repair process.Methods: A total of 33 rats were divided randomly into control (n = 3), mild contusion (n = 15), and severe contusion (n = 15) groups; the contusion groups were further divided into five subgroups (1, 3, 24, 48, and 168 h post-injury; n = 3 per subgroup). Then full genome microarray of RNA isolated from muscle tissue was performed to access the gene expression changes during healing process.Results: A total of 2,844 and 2,298 differentially expressed genes were identified in the mild and severe contusion groups, respectively. The analysis of the overlapping differentially expressed genes showed that there are common mechanisms of transcriptomic repair of mild and severe contusion within 48 h post-contusion. This was supported by the results of principal component analysis, hierarchical clustering, and weighted gene co‐expression network analysis of the 1,620 coexpressed genes in mildly and severely contused muscle. From these analyses, we discovered that the gene profiles in functional modules and temporal clusters were similar between the mild and severe contusion groups; moreover, the genes showed time-dependent patterns of expression, which allowed us to identify useful markers of wound age. We then performed an analysis of the functions of genes (including Gene Ontology and Kyoto Encyclopedia of Genes and Genomes pathway annotation, and protein–protein interaction network analysis) in the functional modules and temporal clusters, and the hub genes in each module–cluster pair were identified. Interestingly, we found that genes downregulated within 24−48 h of the healing process were largely associated with metabolic processes, especially oxidative phosphorylation of reduced nicotinamide adenine dinucleotide phosphate, which has been rarely reported. Conclusions: These results improve our understanding of the molecular mechanisms underlying muscle repair, and provide a basis for further studies of wound age estimation.


2021 ◽  
Vol 17 ◽  
Author(s):  
Hui Zhang ◽  
Qidong Liu ◽  
Xiaoru Sun ◽  
Yaru Xu ◽  
Yiling Fang ◽  
...  

Background: The pathophysiology of Alzheimer's disease (AD) is still not fully studied. Objective: This study aimed to explore the differently expressed key genes in AD and build a predictive model of diagnosis and treatment. Methods: Gene expression data of the entorhinal cortex of AD, asymptomatic AD, and control samples from the GEO database were analyzed to explore the relevant pathways and key genes in the progression of AD. Differentially expressed genes between AD and the other two groups in the module were selected to identify biological mechanisms in AD through KEGG and PPI network analysis in Metascape. Furthermore, genes with a high connectivity degree by PPI network analysis were selected to build a predictive model using different machine learning algorithms. Besides, model performance was tested with five-fold cross-validation to select the best fitting model. Results: A total of 20 co-expression gene clusters were identified after the network was constructed. Module 1 (in black) and module 2 (in royal blue) were most positively and negatively correlated with AD, respectively. Total 565 genes in module 1 and 215 genes in module 2, respectively, overlapped in two differentially expressed genes lists. They were enriched in the G protein-coupled receptor signaling pathway, immune-related processes, and so on. 11 genes were screened by using lasso logistic regression, and they were considered to play an important role in predicting AD samples. The model built by the support vector machine algorithm with 11 genes showed the best performance. Conclusion: This result shed light on the diagnosis and treatment of AD.


2019 ◽  
Vol 16 (1) ◽  
Author(s):  
Mohadeseh Zarei Ghobadi ◽  
Sayed-Hamidreza Mozhgani ◽  
Mahdieh Farzanehpour ◽  
Farida Behzadian

Abstract Background Despite the high yearly prevalence of Influenza, the pathogenesis mechanism and involved genes have not been fully known. Finding the patterns and mapping the complex interactions between different genes help us to find the possible biomarkers and treatment targets. Methods Herein, weighted gene co-expression network analysis (WGCNA) was employed to construct a co-expression network among genes identified by microarray analysis of the pediatric influenza-infected samples. Results Three of the 38 modules were found as the most related modules to influenza infection. At a functional level, we found that the genes in these modules regulate the immune responses, protein targeting, and defense to virus. Moreover, the analysis of differentially expressed genes disclosed 719 DEGs between the normal and infected subjects. The comprehensive investigation of genes in the module involved in immune system and viral defense (yellow module) revealed that SP110, HERC5, SAMD9L, RTP4, C19orf66, HELZ2, EPSTI1, and PHF11 which were also identified as DEGs (except C19orf66) have the potential to be as the biomarkers and also drug targeting for the treatment of pediatric influenza. Conclusions The WGCN analysis revealed co-expressed genes which were involved in the innate immune system and defense to virus. The differentially expressed genes in the identified modules can be considered for designing drug targets. Moreover, modules can help to find pathogenesis routes in the future.


2019 ◽  
Vol 2019 ◽  
pp. 1-11 ◽  
Author(s):  
Yan Li ◽  
Xiao_nan He ◽  
Chao Li ◽  
Ling Gong ◽  
Min Liu

Background. Identification of potential molecular targets of acute myocardial infarction is crucial to our comprehensive understanding of the disease mechanism. However, studies of gene coexpression analysis via jointing multiple microarray data of acute myocardial infarction still remain restricted. Methods. Microarray data of acute myocardial infarction (GSE48060, GSE66360, GSE97320, and GSE19339) were downloaded from Gene Expression Omnibus database. Three data sets without heterogeneity (GSE48060, GSE66360, and GSE97320) were subjected to differential expression analysis using MetaDE package. Differentially expressed genes having upper 25% variation across samples were imported in weighted gene coexpression network analysis. Functional and pathway enrichment analyses were conducted for genes in the most significant module using DAVID. The predicted microRNAs to regulate target genes in the most significant module were identified using TargetScan. Moreover, subpathway analyses using iSubpathwayMiner package and GenCLiP 2.0 were performed on hub genes with high connective weight in the most significant module. Results. A total of 1027 differentially expressed genes and 33 specific modules were screened out between acute myocardial infarction patients and control samples. Ficolin (collagen/fibrinogen domain containing) 1 (FCN1), CD14 molecule (CD14), S100 calcium binding protein A9 (S100A9), and mitochondrial aldehyde dehydrogenase 2 (ALDH2) were identified as critical target molecules; hsa-let-7d, hsa-let-7b, hsa-miR-124-3, and hsa-miR-9-1 were identified as potential regulators of the expression of the key genes in the two biggest modules. Conclusions. FCN1, CD14, S100A9, ALDH2, hsa-let-7d, hsa-let-7b, hsa-miR-124-3, and hsa-miR-9-1 were identified as potential candidate regulators in acute myocardial infarction. These findings might provide new comprehension into the underlying molecular mechanism of disease.


2017 ◽  
Vol 15 (05) ◽  
pp. 1750020 ◽  
Author(s):  
Na You ◽  
Xueqin Wang

The microarray technology is widely used to identify the differentially expressed genes due to its high throughput capability. The number of replicated microarray chips in each group is usually not abundant. It is an efficient way to borrow information across different genes to improve the parameter estimation which suffers from the limited sample size. In this paper, we use a hierarchical model to describe the dispersion of gene expression profiles and model the variance through the gene expression level via a link function. A heuristic algorithm is proposed to estimate the hyper-parameters and link function. The differentially expressed genes are identified using a multiple testing procedure. Compared to SAM and LIMMA, our proposed method shows a significant superiority in term of detection power as the false discovery rate being controlled.


PeerJ ◽  
2019 ◽  
Vol 7 ◽  
pp. e7968 ◽  
Author(s):  
Jingwei Liu ◽  
Weixin Liu ◽  
Hao Li ◽  
Qiuping Deng ◽  
Meiqi Yang ◽  
...  

Background As the most frequently occurred tumor in biliary tract, cholangiocarcinoma (CCA) is mainly characterized by its late diagnosis and poor outcome. It is therefore urgent to identify specific genes and pathways associated with its progression and prognosis. Materials and Methods The differentially expressed genes in The Cancer Genome Atlas were analyzed to build the co-expression network by Weighted gene co-expression network analysis (WGCNA). Gene ontology (GO) as well as Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis were conducted for the selected genes. Module–clinical trait relationships were analyzed to explore the association with clinicopathological parameters. Log-rank tests and cox regression were used to identify the prognosis-related genes. Results The most related modules with CCA development were tan module containing 181 genes and salmon module with 148 genes. GO analysis suggested enrichment terms of digestion, hormone transport and secretion, epithelial cell proliferation, signal release, fibroblast activation, response to acid chemical, wnt, Nicotinamide adenine dinucleotide phosphate metabolism. KEGG analysis demonstrated 15 significantly altered pathways including glutathione metabolism, wnt, central carbon metabolism, mTOR, pancreatic secretion, protein digestion, axon guidance, retinol metabolism, insulin secretion, salivary secretion, fat digestion. Key genes of SOX2, KIT, PRSS56, WNT9A, SLC4A4, PRRG4, PANX2, PIR, RASSF8, MFSD4A, INS, RNF39, IL1R2, CST1, and PPP3CA might be potential prognostic markers for CCA, of which RNF39 and PRSS56 also showed significant correlation with clinical stage. Discussion Differentially expressed genes and key modules contributing to CCA development were identified by WGCNA. Our results offer novel insights into the characteristics in the etiology, prognosis, and treatment of CCA.


Sign in / Sign up

Export Citation Format

Share Document