scholarly journals Identification of Diagnostic Markers for Breast Cancer Based on Differential Gene Expression and Pathway Network

Author(s):  
Shumei Zhang ◽  
Haoran Jiang ◽  
Bo Gao ◽  
Wen Yang ◽  
Guohua Wang

Background: Breast cancer is the second largest cancer in the world, the incidence of breast cancer continues to rise worldwide, and women’s health is seriously threatened. Therefore, it is very important to explore the characteristic changes of breast cancer from the gene level, including the screening of differentially expressed genes and the identification of diagnostic markers.Methods: The gene expression profiles of breast cancer were obtained from the TCGA database. The edgeR R software package was used to screen the differentially expressed genes between breast cancer patients and normal samples. The function and pathway enrichment analysis of these genes revealed significant enrichment of functions and pathways. Next, download these pathways from KEGG website, extract the gene interaction relations, construct the KEGG pathway gene interaction network. The potential diagnostic markers of breast cancer were obtained by combining the differentially expressed genes with the key genes in the network. Finally, these markers were used to construct the diagnostic prediction model of breast cancer, and the predictive ability of the model and the diagnostic ability of the markers were verified by internal and external data.Results: 1060 differentially expressed genes were identified between breast cancer patients and normal controls. Enrichment analysis revealed 28 significantly enriched pathways (p < 0.05). They were downloaded from KEGG website, and the gene interaction relations were extracted to construct the gene interaction network of KEGG pathway, which contained 1277 nodes and 7345 edges. The key nodes with a degree greater than 30 were extracted from the network, containing 154 genes. These 154 key genes shared 23 genes with differentially expressed genes, which serve as potential diagnostic markers for breast cancer. The 23 genes were used as features to construct the SVM classification model, and the model had good predictive ability in both the training dataset and the validation dataset (AUC = 0.960 and 0.907, respectively).Conclusion: This study showed that the difference of gene expression level is important for the diagnosis of breast cancer, and identified 23 breast cancer diagnostic markers, which provides valuable information for clinical diagnosis and basic treatment experiments.

PeerJ ◽  
2020 ◽  
Vol 8 ◽  
pp. e9119
Author(s):  
Priyansh Srivastava ◽  
Alakto Choudhury ◽  
Mehak Talwar ◽  
Sabyasachi Mohanty ◽  
Priyanka Narad ◽  
...  

Vitiligo is a chronic asymptomatic disorder affecting melanocytes from the basal layer of the epidermis which leads to a patchy loss of skin color. Even though it is one of the neglected disease conditions, people suffering from vitiligo are more prone to psychological disorders. As of now, various studies have been done in order to project auto-immune implications as the root cause. To understand the complexity of vitiligo, we propose the Vitiligo Information Resource (VIRdb) that integrates both the drug-target and systems approach to produce a comprehensive repository entirely devoted to vitiligo, along with curated information at both protein level and gene level along with potential therapeutics leads. These 25,041 natural compounds are curated from Natural Product Activity and Species Source Database. VIRdb is an attempt to accelerate the drug discovery process and laboratory trials for vitiligo through the computationally derived potential drugs. It is an exhaustive resource consisting of 129 differentially expressed genes, which are validated through gene ontology and pathway enrichment analysis. We also report 22 genes through enrichment analysis which are involved in the regulation of epithelial cell differentiation. At the protein level, 40 curated protein target molecules along with their natural hits that are derived through virtual screening. We also demonstrate the utility of the VIRdb by exploring the Protein–Protein Interaction Network and Gene–Gene Interaction Network of the target proteins and differentially expressed genes. For maintaining the quality and standard of the data in the VIRdb, the gold standard in bioinformatics toolkits like Cytoscape, Schrödinger’s GLIDE, along with the server installation of MATLAB, are used for generating results. VIRdb can be accessed through “http://www.vitiligoinfores.com/”.


2020 ◽  
Vol 83 (5) ◽  
pp. 458-467
Author(s):  
Guanchuan Lin ◽  
Kaiyuan Ji ◽  
Shiyu Li ◽  
Wenli Ma ◽  
Xinghua Pan

<b><i>Introduction:</i></b> The molecular pathogenesis of Alzheimer’s disease (AD) is still not clear, and the relationship between gene expression profile for different brain regions has not been studied. <b><i>Objective:</i></b> Bioinformatic analysis at the genetic level has become the best way for the pathogenesis research of AD, which can analyze the abovementioned relationship. <b><i>Methods:</i></b> In this study, the datasets of AD were obtained from the Gene Expression Omnibus (GEO), and Qlucore Omics Explorer (QOE) software was used to screen differentially expressed genes of GSE36980 and GSE9770 and verify gene expression of GSE63060. The Gene Ontology (GO) function enrichment analysis of these selected genes was conducted by Database for Annotation, Visualization, and Integrated Discovery (DAVID), and then the gene/protein interaction network was established by STRING to find the related proteins. R language was used for drafting maps and plots. <b><i>Results:</i></b> There were 20 differentially expressed genes related to AD selected from GSE36980 (<i>p</i> = 6.2e<sup>−6</sup>, <i>q</i> = 2.9422e<sup>−4</sup>) and GSE9770 (<i>p</i> = 3.3e<sup>−4</sup>, <i>q</i> = 0.016606). Their expression levels of the AD group were lower than those in the control group and varied among different brain regions. Cellular morphogenesis and establishment or maintenance of cell polarity were enriched, and <i>LRRTM1</i> and <i>RASAL1</i> were identified by the integration network. Moreover, the analysis of GSE63060 verified the expression level of <i>LRRTM1</i> and <i>RASAL1</i> in Alzheimer’s patients, which was much lower than that in normal people aged &#x3e;65 years. <b><i>Conclusions:</i></b> The pathogenesis of AD at molecular levels may link to cell membrane structures and signal transduction; hence, a list of 20 genes, including <i>LRRTM1</i> and <i>RASAL1,</i>potentially are important for the discovery of treatment target or molecular marker of AD.


2020 ◽  
Author(s):  
Jingdi Yang ◽  
Bo Peng ◽  
Xianzheng Qin ◽  
Tian Zhou

Abstract Background: Although the morbidity and mortality of gastric cancer are declining, gastric cancer is still one of the most common causes of death. Early detection of gastric cancer is of great help to improve the survival rate, but the existing biomarkers are not sensitive to diagnose early gastric cancer. The aim of this study is to identify the novel biomarkers for gastric cancer.Methods: Three gene expression profiles (GSE27342, GSE63089, GSE33335) were downloaded from Gene Expression Omnibus database to select differentially expressed genes. Then, Gene Ontology and Kyoto Encyclopedia of Genes and Genomes analysis were performed to explore the biological functions of differentially expressed genes. Cytoscape was utilized to construct protein-protein interaction network and hub genes were analyzed by plugin cytoHubba of Cytoscape. Furthermore, Gene Expression Profiling Interactive Analysis and Kaplan-Meier plotter were used to verify the identified hub genes.Results: 35 overlapping differentially expressed genes were screened from gene expression datasets, which consisted of 11 up-regulated genes and 24 down-regulated genes. Gene Ontology functional enrichment analysis revealed that differentially expressed genes were significantly enriched in digestion, regulation of biological quality, response to hormone and steroid hormone, and homeostatic process. Kyoto Encyclopedia of Genes and Genomes pathway enrichment analysis showed differentially expressed genes were enriched in the secretion of gastric acid and collecting duct acid, leukocyte transendothelial migration and ECM-receptor interaction. According to protein-protein interaction network, 10 hub genes were identified by Maximal Clique Centrality method.Conclusion: By using bioinformatics analysis, COL1A1, BGN, THY1, TFF2 and SST were identified as the potential biomarkers for early detection of gastric cancer.


Vaccines ◽  
2021 ◽  
Vol 9 (12) ◽  
pp. 1427
Author(s):  
Mumdooh J. Sabir ◽  
Ross Low ◽  
Neil Hall ◽  
Majid Rasool Kamli ◽  
Md. Zubbair Malik

Cryptosporidium parvum (C. parvum) is a protozoan parasite known for cryptosporidiosis in pre-weaned calves. Animals and patients with immunosuppression are at risk of developing the disease, which can cause potentially fatal diarrhoea. The present study aimed to construct a network biology framework based on the differentially expressed genes (DEGs) of C. parvum infected subjects. In this way, the gene expression profiling analysis of C. parvum infected individuals can give us a snapshot of actively expressed genes and transcripts under infection conditions. In the present study, we have analyzed microarray data sets and compared the gene expression profiles of the patients with the different data sets of the healthy control. Using a network medicine approach to identify the most influential genes in the gene interaction network, we uncovered essential genes and pathways related to C. parvum infection. We identified 164 differentially expressed genes (109 up- and 54 down-regulated DEGs) and allocated them to pathway and gene set enrichment analysis. The results underpin the identification of seven significant hub genes with high centrality values: ISG15, MX1, IFI44L, STAT1, IFIT1, OAS1, IFIT3, RSAD2, IFITM1, and IFI44. These genes are associated with diverse biological processes not limited to host interaction, type 1 interferon production, or response to IL-gamma. Furthermore, four genes (IFI44, IFIT3, IFITM1, and MX1) were also discovered to be involved in innate immunity, inflammation, apoptosis, phosphorylation, cell proliferation, and cell signaling. In conclusion, these results reinforce the development and implementation of tools based on gene profiles to identify and treat Cryptosporidium parvum-related diseases at an early stage.


Author(s):  
Yuanyuan Chen ◽  
Yu Gu ◽  
Zixi Hu ◽  
Xiao Sun

Abstract Breast cancer is a highly heterogeneous disease, and there are many forms of categorization for breast cancer based on gene expression profiles. Gene expression profiles are variables and may show differences if measured at different time points or under different conditions. In contrast, biological networks are relatively stable over time and under different conditions. In this study, we used a gene interaction network from a new point of view to explore the subtypes of breast cancer based on individual-specific edge perturbations measured by relative gene expression value. Our study reveals that there are four breast cancer subtypes based on gene interaction perturbations at the individual level. The new network-based subtypes of breast cancer show strong heterogeneity in prognosis, somatic mutations, phenotypic changes and enriched pathways. The network-based subtypes are closely related to the PAM50 subtypes and immunohistochemistry index. This work helps us to better understand the heterogeneity and mechanisms of breast cancer from a network perspective.


2021 ◽  
Vol 80 (Suppl 1) ◽  
pp. 1037.2-1038
Author(s):  
X. Sun ◽  
S. X. Zhang ◽  
S. Song ◽  
T. Kong ◽  
C. Zheng ◽  
...  

Background:Psoriasis is an immune-mediated, genetic disease manifesting in the skin or joints or both, and also has a strong genetic predisposition and autoimmune pathogenic traits1. The hallmark of psoriasis is sustained inflammation that leads to uncontrolled keratinocyte proliferation and dysfunctional differentiation. And it’s also a chronic relapsing disease, which often necessitates a long-term therapy2.Objectives:To investigate the molecular mechanisms of psoriasis and find the potential gene targets for diagnosis and treating psoriasis.Methods:Total 334 gene expression data of patients with psoriasis research (GSE13355 GSE14905 and GSE30999) were obtained from the Gene Expression Omnibus database. After data preprocessing and screening of differentially expressed genes (DEGs) by R software. Online toll Metascape3 was used to analyze Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analysis of DEGs. Interactions of proteins encoded by DEGs were discovered by Protein-protein interaction network (PPI) using STRING online software. Cytoscape software was utilized to visualize PPI and the degree of each DEGs was obtained by analyzing the topological structure of the PPI network.Results:A total of 611 DEGs were found to be differentially expressed in psoriasis. GO analysis revealed that up-regulated DEGs were mostly associated with defense and response to external stimulus while down-regulated DEGs were mostly associated with metabolism and synthesis of lipids. KEGG enrichment analysis suggested they were mainly enriched in IL-17 signaling, Toll-like receptor signaling and PPAR signaling pathways, Cytokine-cytokine receptor interaction and lipid metabolism. In addition, top 9 key genes (CXCL10, OASL, IFIT1, IFIT3, RSAD2, MX1, OAS1, IFI44 and OAS2) were identified through Cytoscape.Conclusion:DEGs of psoriasis may play an essential role in disease development and may be potential pathogeneses of psoriasis.References:[1]Boehncke WH, Schon MP. Psoriasis. Lancet 2015;386(9997):983-94. doi: 10.1016/S0140-6736(14)61909-7 [published Online First: 2015/05/31].[2]Zhang YJ, Sun YZ, Gao XH, et al. Integrated bioinformatic analysis of differentially expressed genes and signaling pathways in plaque psoriasis. Mol Med Rep 2019;20(1):225-35. doi: 10.3892/mmr.2019.10241 [published Online First: 2019/05/23].[3]Zhou Y, Zhou B, Pache L, et al. Metascape provides a biologist-oriented resource for the analysis of systems-level datasets. Nat Commun 2019;10(1):1523. doi: 10.1038/s41467-019-09234-6 [published Online First: 2019/04/05].Acknowledgements:This project was supported by National Science Foundation of China (82001740), Open Fund from the Key Laboratory of Cellular Physiology (Shanxi Medical University) (KLCP2019) and Innovation Plan for Postgraduate Education in Shanxi Province (2020BY078).Disclosure of Interests:None declared


Genes ◽  
2021 ◽  
Vol 12 (1) ◽  
pp. 82
Author(s):  
Yunxiao Wei ◽  
Guoliang Li ◽  
Shujiang Zhang ◽  
Shifan Zhang ◽  
Hui Zhang ◽  
...  

Allopolyploidy is an evolutionary and mechanistically intriguing process involving the reconciliation of two or more sets of diverged genomes and regulatory interactions, resulting in new phenotypes. In this study, we explored the gene expression patterns of eight F2 synthetic Brassica napus using RNA sequencing. We found that B. napus allopolyploid formation was accompanied by extensive changes in gene expression. A comparison between F2 and the parent shows a certain proportion of differentially expressed genes (DEG) and activation\silent gene, and the two genomes (female parent (AA)\male parent (CC) genomes) showed significant differences in response to whole-genome duplication (WGD); non-additively expressed genes represented a small portion, while Gene Ontology (GO) enrichment analysis showed that it played an important role in responding to WGD. Besides, genome-wide expression level dominance (ELD) was biased toward the AA genome, and the parental expression pattern of most genes showed a high degree of conservation. Moreover, gene expression showed differences among eight individuals and was consistent with the results of a cluster analysis of traits. Furthermore, the differential expression of waxy synthetic pathways and flowering pathway genes could explain the performance of traits. Collectively, gene expression of the newly formed allopolyploid changed dramatically, and this was different among the selfing offspring, which could be a prominent cause of the trait separation. Our data provide novel insights into the relationship between the expression of differentially expressed genes and trait segregation and provide clues into the evolution of allopolyploids.


2021 ◽  
Vol 20 ◽  
pp. 153303382098329
Author(s):  
Yujie Weng ◽  
Wei Liang ◽  
Yucheng Ji ◽  
Zhongxian Li ◽  
Rong Jia ◽  
...  

Human epidermal growth factor 2 (HER2)+ breast cancer is considered the most dangerous type of breast cancers. Herein, we used bioinformatics methods to identify potential key genes in HER2+ breast cancer to enable its diagnosis, treatment, and prognosis prediction. Datasets of HER2+ breast cancer and normal tissue samples retrieved from Gene Expression Omnibus and The Cancer Genome Atlas databases were subjected to analysis for differentially expressed genes using R software. The identified differentially expressed genes were subjected to gene ontology and Kyoto Encyclopedia of Genes and Genomes pathway enrichment analyses followed by construction of protein-protein interaction networks using the STRING database to identify key genes. The genes were further validated via survival and differential gene expression analyses. We identified 97 upregulated and 106 downregulated genes that were primarily associated with processes such as mitosis, protein kinase activity, cell cycle, and the p53 signaling pathway. Visualization of the protein-protein interaction network identified 10 key genes ( CCNA2, CDK1, CDC20, CCNB1, DLGAP5, AURKA, BUB1B, RRM2, TPX2, and MAD2L1), all of which were upregulated. Survival analysis using PROGgeneV2 showed that CDC20, CCNA2, DLGAP5, RRM2, and TPX2 are prognosis-related key genes in HER2+ breast cancer. A nomogram showed that high expression of RRM2, DLGAP5, and TPX2 was positively associated with the risk of death. TPX2, which has not previously been reported in HER2+ breast cancer, was associated with breast cancer development, progression, and prognosis and is therefore a potential key gene. It is hoped that this study can provide a new method for the diagnosis and treatment of HER2 + breast cancer.


2021 ◽  
Author(s):  
Li Guoquan ◽  
Du Junwei ◽  
He Qi ◽  
Fu Xinghao ◽  
Ji Feihong ◽  
...  

Abstract BackgroundHashimoto's thyroiditis (HT), also known as chronic lymphocytic thyroiditis, is a common autoimmune disease, which mainly occurs in women. The early manifestation was hyperthyroidism, however, hypothyroidism may occur if HT was not controlled for a long time. Numerous studies have shown that multiple factors, including genetic, environmental, and autoimmune factors, were involved in the pathogenesis of the disease, but the exact mechanisms were not yet clear. The aim of this study was to identify differentially expressed genes (DEGs) by comprehensive analysis and to provide specific insights into HT. MethodsTwo gene expression profiles (GSE6339, GSE138198) about HT were downloaded from the Gene Expression Omnibus (GEO) database. The DEGs were assessed between the HT and normal groups using the GEO2R. The DEGs were then sent to the Gene Ontology (GO) and the Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analysis. The hub genes were discovered using Cytoscape and CytoHubba. Finally, NetworkAnalyst was utilized to create the hub genes' targeted microRNAs (miRNAs). ResultsA total of 62 DEGs were discovered, including 60 up-regulated and 2 down-regulated DEGs. The signaling pathways were mainly engaged in cytokine interaction and cytotoxicity, and the DEGs were mostly enriched in immunological and inflammatory responses. IL2RA, CXCL9, IL10RA, CCL3, CCL4, CCL2, STAT1, CD4, CSF1R, and ITGAX were chosen as hub genes based on the results of the protein-protein interaction (PPI) network and CytoHubba. Five miRNAs, including mir-24-3p, mir-223-3p, mir-155-5p, mir-34a-5p, mir-26b-5p, and mir-6499-3p, were suggested as likely important miRNAs in HT. ConclusionsThese hub genes, pathways and miRNAs contribute to a better understanding of the pathophysiology of HT and offer potential treatment options for HT.


PeerJ ◽  
2020 ◽  
Vol 8 ◽  
pp. e8831 ◽  
Author(s):  
Xiaojiao Guan ◽  
Yao Yao ◽  
Guangyao Bao ◽  
Yue Wang ◽  
Aimeng Zhang ◽  
...  

Esophageal cancer is a common malignant tumor in the world, and the aim of this study was to screen key genes related to the development of esophageal cancer using a variety of bioinformatics analysis tools and analyze their biological functions. The data of esophageal squamous cell carcinoma from the Gene Expression Omnibus (GEO) were selected as the research object, processed and analyzed to screen differentially expressed microRNAs (miRNAs) and differential methylation genes. The competing endogenous RNAs (ceRNAs) interaction network of differentially expressed genes was constructed by bioinformatics tools DAVID, String, and Cytoscape. Biofunctional enrichment analysis was performed using Gene Ontology (GO) and the Kyoto Encyclopedia of Genes and Genomes (KEGG). The expression of the screened genes and the survival of the patients were verified. By analyzing GSE59973 and GSE114110, we found three down-regulated and nine up-regulated miRNAs. The gene expression matrix of GSE120356 was calculated by Pearson correlation coefficient, and the 11696 pairs of ceRNA relation were determined. In the ceRNA network, 643 lncRNAs and 147 mRNAs showed methylation difference. Functional enrichment analysis showed that these differentially expressed genes were mainly concentrated in the FoxO signaling pathway and were involved in the corresponding cascade of calcineurin. By analyzing the clinical data in The Cancer Genome Atlas (TCGA) database, it was found that four lncRNAs had an important impact on the survival and prognosis of esophageal carcinoma patients. QRT-PCR was also conducted to identify the expression of the key lncRNAs (RNF217-AS1, HCP5, ZFPM2-AS1 and HCG22) in ESCC samples. The selected key genes can provide theoretical guidance for further research on the molecular mechanism of esophageal carcinoma and the screening of molecular markers.


Sign in / Sign up

Export Citation Format

Share Document