scholarly journals Identification of a prognostic gene signature of colon cancer using integrated bioinformatics analysis

2021 ◽  
Vol 19 (1) ◽  
Author(s):  
Zhengyu Fang ◽  
Sumei Xu ◽  
Yiwen Xie ◽  
Wenxi Yan

Abstract Background Colon cancer is a worldwide leading cause of cancer-related mortality, and the prognosis of colon cancer is still needed to be improved. This study aimed to construct a prognostic model for predicting the prognosis of colon cancer. Methods The gene expression profile data of colon cancer were obtained from the TCGA, GSE44861, and GSE44076 datasets. The WGCNA module genes and common differentially expressed genes (DEGs) were used to screen out the prognosis-associated DEGs, which were used to construct a prognostic model. The performance of the prognostic model was assessed and validated in the TCGA training and microarray validation sets (GSE38832 and GSE17538). At last, the model and prognosis-associated clinical factors were used for the construction of the nomogram. Results Five colon cancer-related WGCNA modules (including 1160 genes) and 1153 DEGs between tumor and normal tissues were identified, inclusive of 556 overlapping DEGs. Stepwise Cox regression analyses identified there were 14 prognosis-associated DEGs, of which 12 DEGs were included in the optimized prognostic gene signature. This prognostic model presented a high forecast ability for the prognosis of colon cancer both in the TCGA training dataset and the validation datasets (GSE38832 and GSE17538; AUC > 0.8). In addition, patients’ age, T classification, recurrence status, and prognostic risk score were associated with the prognosis of TCGA patients with colon cancer. The nomogram was constructed using the above factors, and the predictive 3- and 5-year survival probabilities had high compliance with the actual survival proportions. Conclusions The 12-gene signature prognostic model had a high predictive ability for the prognosis of colon cancer.

2019 ◽  
Vol 2019 ◽  
pp. 1-9 ◽  
Author(s):  
Ziming Hou ◽  
Jun Yang ◽  
Hao Wang ◽  
Dongyuan Liu ◽  
Hongbing Zhang

Objective. This study aimed to screen prognostic gene signature of glioblastoma (GBM) to construct prognostic model.Methods. Based on the GBM information in the Cancer Genome Atlas (TCGA, training set), prognostic genes (Set X) were screened by Cox regression. Then, the optimized prognostic gene signature (Set Y) was further screened by the Cox-Proportional Hazards (Cox-PH). Next, two prognostic models were constructed: model A was based on the Set Y; model B was based on part of the Set X. The samples were divided into low- and high-risk groups according to the median prognosis index (PI). GBM datasets in Gene Expression Ominous (GEO, GSE13041) and Chinese Glioma Genome Atlas (CGGA) were used as the testing datasets to confirm the prognostic models constructed based on TCGA.Results. We identified that the prognostic 14-gene signature was significantly associated with the overall survival (OS) in the TCGA. In model A, patients in high- and low-risk groups showed the significantly different OS (P = 7.47 × 10−9, area under curve (AUC) 0.995) and the prognostic ability were also confirmed in testing sets (P=0.0098 and 0.037). The model B in training set was significant but failed in testing sets.Conclusion. The prognostic model which was constructed based on the prognostic 14-gene signature presented a high predictive ability for GBM. The 14-gene signature may have clinical implications in the subclassification of GBM.


2020 ◽  
Author(s):  
Zhengyu Fang ◽  
Sumei Xu ◽  
Yiwen Xie ◽  
Wenxi Yan

Abstract Background This study aimed to construct prognostic model by screening prognostic gene signature of colon cancer. Methods The gene expression profile data of colon cancer were obtained from The Cancer Genome Atlas (TCGA) and gene expression omnibus (GEO) and differently expressed genes (DEGs) between tumor and control samples were identified. Prognosis-associated genes were then identified and used for the construction of prognostic model. The independent factors that associated with the prognosis of colon in the TCGA cohort was identified. Results Totally, 1153 consistent DEGs were screened out between tumor and normal tissues in the TCGA cohort, GSE44861 and GSE44076 datasets. Among these genes, 12 DEGs were related to the prognosis of colon cancer and were used for constructing the prognostic model. This model presented a high predictive power for the prognosis of colon cancer both in the training dataset and in the validation datasets (AUC > 0.8). Statistical analysis showed that age, pathological T, tumor recurrence, and model status were the independent factors for prognosis of patients with colon cancer in TCGA. Conclusions The 12-gene signature prognostic model had a high predictive power for colon cancer prognosis.


2020 ◽  
Author(s):  
Cuiyun Wu ◽  
Yaosheng Luo ◽  
Yinghui Chen ◽  
Hongling Qu ◽  
Lin Zheng ◽  
...  

Abstract Background: Accurate prediction of overall survival is important for prognosis and the assignment of appropriate personalized clinical treatment in hepatocellular carcinoma (HCC) patients. The aim of the present study was to establish an optimal gene model for the independent prediction of prognosis associated with common clinical patterns.Methods: Gene expression profiles and the corresponding clinical information of the LIHC cohort were obtained from The Cancer Genome Atlas. Differentially expressed genes were found using the R package “limma”. Subsequently, a prognostic gene signature was developed using the LASSO Cox regression model. Kaplan–Meier, log-rank, and receiver operating characteristic (ROC) analyses were performed to verify the predictive accuracy of the prognostic model. Finally, a nomogram and calibration plot were created using the “rms” package.Results: Differentially expressed genes were screened with threshold criteria (FDR < 0.01 and |log FC|>3) and 563 differentially expressed genes were obtained, including 448 downregulated and 115 upregulated genes. Using the LASSO Cox regression model, a prognostic gene signature was developed based on nine genes,IQGAP3, BIRC5, PTTG1, STC2, CDKN3, PBK, EXO1, NEIL3, and HOXD9, the expression levels of which were quantitated using RT-qPCR. According to the risk scores, patients were separated into high-risk and low-risk groups. Patients with lower risk scores generally had a better survival rate than those with higher risk scores. The mortality rate in the high-risk group was 42.02%, while that in the low-risk group was 12.50%. Results of the log-rank test showed significant differences in mortality between the two groups (HR: 4.86; 95% CI: 2.72–8.69; P = 1.01E-08). Subsequently, we assessed the prognostic accuracy of the gene signature using an ROC curve and the results show good sensitivity and specificity, with an average area under the curve (AUC) of 0.81 at 5 years (P < 0.01). Following multivariate adjustment for conventional clinical patterns, the prognostic gene signature remained a powerful and independent factor (HR: 4.70; 95% CI: 2.61–8.38; P = 2.06E-07), confirming its robust predictive ability of overall survival in HCC patients. Finally, a nomogram was established based on the gene signature and four clinicopathological features, which demonstrated an advantageous discriminating ability with the potential to facilitate clinical decision-making in HCC.Conclusion: Our prognostic gene signature can be used as a combined biomarker for the independent prediction of overall survival in HCC patients. Moreover, we created a nomogram that can be used to infer prognosis and aid individualized decisions regarding treatment and surveillance.


2021 ◽  
Vol 22 (4) ◽  
pp. 1632
Author(s):  
Eskezeia Yihunie Dessie ◽  
Siang-Jyun Tu ◽  
Hui-Shan Chiang ◽  
Jeffrey J.P. Tsai ◽  
Ya-Sian Chang ◽  
...  

Hepatocellular carcinoma (HCC) is one of the most common lethal cancers worldwide and is often related to late diagnosis and poor survival outcome. More evidence is demonstrating that gene-based prognostic models can be used to predict high-risk HCC patients. Therefore, our study aimed to construct a novel prognostic model for predicting the prognosis of HCC patients. We used multivariate Cox regression model with three hybrid penalties approach including least absolute shrinkage and selection operator (Lasso), adaptive lasso and elastic net algorithms for informative prognostic-related genes selection. Then, the best subset regression was used to identify the best prognostic gene signature. The prognostic gene-based risk score was constructed using the Cox coefficient of the prognostic gene signature. The model was evaluated by Kaplan–Meier (KM) and receiver operating characteristic curve (ROC) analyses. A novel four-gene signature associated with prognosis was identified and the risk score was constructed based on the four-gene signature. The risk score efficiently distinguished the patients into a high-risk group with poor prognosis. The time-dependent ROC analysis revealed that the risk model had a good performance with an area under the curve (AUC) of 0.780, 0.732, 0.733 in 1-, 2- and 3-year prognosis prediction in The Cancer Genome Atlas (TCGA) dataset. Moreover, the risk score revealed a high diagnostic performance to classify HCC from normal samples. The prognosis and diagnosis prediction performances of risk scores were verified in external validation datasets. Functional enrichment analysis of the four-gene signature and its co-expressed genes involved in the metabolic and cell cycle pathways was constructed. Overall, we developed a novel-gene-based prognostic model to predict high-risk HCC patients and we hope that our findings can provide promising insight to explore the role of the four-gene signature in HCC patients and aid risk classification.


2021 ◽  
Author(s):  
Jun Liu ◽  
Jianjun Lu ◽  
Wenli Li

Abstract Background This study aims to construct a new prognostic gene signature based on cancer hallmarks for patients with Head and neck squamous cell carcinoma (HNSCC). Method The transcriptome profiling data and hallmark gene sets in the Molecular Signatures Database was used to explore the cancer hallmarks most relevant to the prognosis of HNSCC patients. Differential gene expression analysis, weighted gene co-expression network analysis, univariate COX regression analysis, random forest algorithm and multiple combinatorial screening were used to construct the prognostic gene signature. The predictive ability of gene signature was verified in the TCGA HNSCC cohort as the training set and the GEO HNSCC cohorts (GSE41613 and GSE42743) as the validation sets, respectively. Moreover, the correlations between risk scores and immune infiltration patterns, as well as risk scores and genomic changes were explored. Results A total of 3391 differentially expressed genes in HNSCC were screened. Glycolysis and hypoxia were screened as the main risk factors for OS in HNSCC. Using univariate Cox analysis, 97 prognostic candidates were identified (P<0.05). Top 10 important genes were then screened out by random forest. Using multiple combinatorial screening, a combination with less genes and more significant P value was used to construct the prognostic gene signature (RNF144A, STC1, P4HA1, FMNL3, ANO1, BASP1, MME, PLEKHG2 and DKK1). Kaplan-Meier analysis showed that patients with higher risk scores had worse overall survival (p <0.001). The ROC curve showed that the risk score had a good predictive efficiency (AUC> 0.66). Subsequently, the predictive ability of the risk score was verified in the validation sets. Moreover, the two-factor survival analysis combining the cancer hallmarks and risk scores suggested that HNSCC patients with the high hypoxia or glycolysis & high risk-score showed the worst prognosis. Besides, a nomogram based on the nine-gene signature was established for clinical practice. Furthermore, the risk score was significantly related to tumor immune infiltration profiles and genome changes. Conclusion This nine-gene signature associated with glycolysis and hypoxia can not only be used for prognosis prediction and risk stratification, but also may be a potential therapeutic target for patients with HNSCC.


PeerJ ◽  
2020 ◽  
Vol 8 ◽  
pp. e9624 ◽  
Author(s):  
Zhuo Zeng ◽  
Daxing Xie ◽  
Jianping Gong

Background Gastric cancer (GC) is one of the most fatal cancers in the world. Results of previous studies on the association of the CpG island methylator phenotype (CIMP) with GC prognosis are conflicting and mainly based on selected CIMP markers. The current study attempted to comprehensively assess the association between CIMP status and GC survival and to develop a CIMP-related prognostic gene signature of GC. Methods We used a hierarchical clustering method based on 2,082 GC-related methylation sites to stratify GC patients from the cancer genome atlas into three different CIMP subgroups according to the CIMP status. Gene set enrichment analysis, tumor-infiltrating immune cells, and DNA somatic mutations analysis were conducted to reveal the genomic characteristics in different CIMP-related patients. Cox regression analysis and the least absolute shrinkage and selection operator were performed to develop a CIMP-related prognostic signature. Analyses involving a time-dependent receiver operating characteristic (ROC) curve and calibration plot were adopted to assess the performance of the prognostic signature. Results We found a positive relationship between CIMP and prognosis in GC. Gene set enrichment analysis indicated that cancer-progression-related pathways were enriched in the CIMP-L group. High abundances of CD8+ T cells and M1 macrophages were found in the CIMP-H group, meanwhile more plasma cells, regulatory T cells and CD4+ memory resting T cells were detected in the CIMP-L group. The CIMP-H group showed higher tumor mutation burden, more microsatellite instability-H, less lymph node metastasis, and more somatic mutations favoring survival. We then established a CIMP-related prognostic gene signature comprising six genes (CST6, SLC7A2, RAB3B, IGFBP1, VSTM2L and EVX2). The signature was capable of classifying patients into high‐and low‐risk groups with significant difference in overall survival (OS; p < 0.0001). To assess performance of the prognostic signature, the area under the ROC curve (AUC) for OS was calculated as 0.664 at 1 year, 0.704 at 3 years and 0.667 at 5 years. When compared with previously published gene-based signatures, our CIMP-related signature was comparable or better at predicting prognosis. A multivariate Cox regression analysis indicated the CIMP-related prognostic gene signature was an independent prognostic indicator of GC. In addition, Gene ontology analysis indicated that keratinocyte differentiation and epidermis development were enriched in the high-risk group. Conclusion Collectively, we described a positive association between CIMP status and prognosis in GC and proposed a CIMP-related gene signature as a promising prognostic biomarker for GC.


2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Feng Jiang ◽  
Chuyan Wu ◽  
Ming Wang ◽  
Ke Wei ◽  
Jimei Wang

AbstractOne of the most frequently identified tumors and a contributing cause of death in women is breast cancer (BC). Many biomarkers associated with survival and prognosis were identified in previous studies through database mining. Nevertheless, the predictive capabilities of single-gene biomarkers are not accurate enough. Genetic signatures can be an enhanced prediction method. This research analyzed data from The Cancer Genome Atlas (TCGA) for the detection of a new genetic signature to predict BC prognosis. Profiling of mRNA expression was carried out in samples of patients with TCGA BC (n = 1222). Gene set enrichment research has been undertaken to classify gene sets that vary greatly between BC tissues and normal tissues. Cox models for additive hazards regression were used to classify genes that were strongly linked to overall survival. A subsequent Cox regression multivariate analysis was used to construct a predictive risk parameter model. Kaplan–Meier survival predictions and log-rank validation have been used to verify the value of risk prediction parameters. Seven genes (PGK1, CACNA1H, IL13RA1, SDC1, AK3, NUP43, SDC3) correlated with glycolysis were shown to be strongly linked to overall survival. Depending on the 7-gene-signature, 1222 BC patients were classified into subgroups of high/low-risk. Certain variables have not impaired the prognostic potential of the seven-gene signature. A seven-gene signature correlated with cellular glycolysis was developed to predict the survival of BC patients. The results include insight into cellular glycolysis mechanisms and the detection of patients with poor BC prognosis.


2020 ◽  
Author(s):  
Dai Zhang ◽  
Si Yang ◽  
Yiche Li ◽  
Meng Wang ◽  
Jia Yao ◽  
...  

Abstract Background: Ovarian cancer (OV) is deemed as the most lethal gynecological cancer in women. The aim of this study was construct an effective gene prognostic model for OV patients.Methods: The expression profiles of glycolysis-related genes (GRGs) and clinical data of patients with OV were extracted from The Cancer Genome Atlas (TCGA) database. Univariate, multivariate, and least absolute shrinkage and selection operator Cox regression analyses were conducted, and a prognostic signature based on GRGs was constructed. The predictive ability of the signature was analyzed in training and test sets.Results: Based on nine GRGs (ISG20, CITED2, PYGB, IRS2, ANGPTL4, TGFBI, LHX9, PC, and DDIT4), a gene risk signature was identified to predict the outcome of patients with OV. The signature showed a good prognostic ability for OV, particularly high-grade OV, in the TCGA dataset, with areas under the curve of 0.709, 0.762, and 0.808 for 3-, 5- and 10-year survival, respectively. Similar results were found in the test sets, and the signature was also an independent prognostic factor. Moreover, a nomogram combining the prediction model and clinical factors was constructed.Conclusion: Our study established a nine-GRG risk model and a nomogram to better perform on OV patients’ survival prediction. The risk model represents a promising and independent prognostic predictor for OV patients. Moreover, our study of GRGs could offer guidances for underlying mechanisms explorations in the future.


PeerJ ◽  
2021 ◽  
Vol 9 ◽  
pp. e12304
Author(s):  
Zhengyuan Wu ◽  
Leilei Chen ◽  
Chaojie Jin ◽  
Jing Xu ◽  
Xingqun Zhang ◽  
...  

Background Cutaneous melanoma (CM) is a life-threatening destructive malignancy. Pyroptosis significantly correlates with programmed tumor cell death and its microenvironment through active host-tumor crosstalk. However, the prognostic value of pyroptosis-associated gene signatures in CM remains unclear. Methods Gene profiles and clinical data of patients with CM were downloaded from The Cancer Genome Atlas (TCGA) to identify differentially expressed genes associated with pyroptosis and overall survival (OS). We constructed a prognostic gene signature using LASSO analysis, then applied immune cell infiltration scores and Kaplan-Meier, Cox, and pathway enrichment analyses to determine the roles of the gene signature in CM. A validation cohort was collected from the Gene Expression Omnibus (GEO) database. Results Four pyroptosis-associated genes were identified and incorporated into a prognostic gene signature. Integrated bioinformatics findings showed that the signature correlated with patient survival and was associated with tumor growth and metastasis. The results of Gene Set Enrichment Analysis of a risk signature indicated that several enriched pathways are associated with cancer and immunity. The risk signature for immune status significantly correlated with tumor stem cells, the immune microenvironment, immune cell infiltration and immune subtypes. The expression of four pyroptosis genes significantly correlated with the OS of patients with CM and was related to the sensitivity of cancer cells to several antitumor drugs. A signature comprising four genes associated with pyroptosis offers a novel approach to the prognosis and survival of patients with CM and will facilitate the development of individualized therapy.


Sign in / Sign up

Export Citation Format

Share Document