scholarly journals Development of a novel embryonic germline gene-related prognostic model of lung adenocarcinoma

PeerJ ◽  
2021 ◽  
Vol 9 ◽  
pp. e12257
Author(s):  
Linjun Liu ◽  
Ke Xu ◽  
Yubai Zhou

Background Emerging evidence implicates the correlation of embryonic germline genes with the tumor progress and patient’s outcome. However, the prognostic value of these genes in lung adenocarcinoma (LUAD) has not been fully studied. Here we systematically evaluated this issue, and constructed a novel signature and a nomogram associated with embryonic germline genes for predicting the outcomes of lung adenocarcinoma. Methods The LUAD cohorts retrieved from The Cancer Genome Atlas (TCGA) and Gene Expression Omnibus (GEO) database were used as training set and testing set, respectively. The embryonic germline genes were downloaded from the website https://venn.lodder.dev. Then, the differentially expressed embryonic germline genes (DEGGs) between the tumor and normal samples were identified by limma package. The functional enrichment and pathway analyses were also performed by clusterProfiler package. The prognostic model was constructed by the least absolute shrinkage and selection operator (LASSO)-Cox regression method. Survival and Receiver Operating Characteristic (ROC) analyses were performed to validate the model using training set and four testing GEO datasets. Finally, a prognostic nomogram based on the signature genes was constructed using multivariate regression method. Results Among the identified 269 DEGGs, 249 were up-regulated and 20 were down-regulated. GO and KEGG analyses revealed that these DEGGs were mainly enriched in the process of cell proliferation and DNA damage repair. Then, 103 DEGGs with prognostic value were identified by univariate Cox regression and further filtered by LASSO method. The resulting sixteen DEGGs were included in step multivariate Cox regression and an eleven embryonic germline gene related signature (EGRS) was constructed. The model could robustly stratify the LUAD patients into high-risk and low-risk groups in both training and testing sets, and low-risk patients had much better outcomes. The multi-ROC analysis also showed that the EGRS model had the best predictive efficacy compared with other common clinicopathological factors. The EGRS model also showed robust predictive ability in four independent external datasets, and the area under curve (AUC) was 0.726 (GSE30219), 0.764 (GSE50081), 0.657 (GSE37745) and 0.668 (GSE72094). More importantly, the expression level of some genes in EGRS has a significant correlation with the progression of LUAD clinicopathology, suggesting these genes might play an important role in the progression of LUAD. Finally, based on EGRS genes, we built and calibrated a nomogram for conveniently evaluating patients’ outcomes.

Author(s):  
Yongmei Wang ◽  
Guimin Zhang ◽  
Ruixian Wang

Background: This study aims to explore the prognostic values of CT83 and CT83-related genes in lung adenocarcinoma (LUAD). Methods: We downloaded the mRNA profiles of 513 LUAD patients (RNA sequencing data) and 246 NSCLC patients (Affymetrix Human Genome U133 Plus 2.0 Array) from TCGA and GEO databases. According to the median expression of CT83, the TCGA samples were divided into high and low expression groups, and differential expression analysis between them was performed. Functional enrichment analysis of differential expression genes (DEGs) was conducted. Univariate Cox regression analysis and LASSO Cox regression analysis were performed to screen the optimal prognostic DEGs. Then we established the prognostic model. A Nomogram model was constructed to predict the overall survival (OS) probability of LUAD patients. Results: CT83 expression was significantly correlated to the prognosis of LUAD patients. A total of 59 DEGs were identified, and a predictive model was constructed based on six optimal CT83-related DEGs, including CPS1, RHOV, TNNT1, FAM83A, IGF2BP1, and GRIN2A, could effectively predict the prognosis of LUAD patients. The nomogram could reliably predict the OS of LUAD patients. Moreover, the six important immune checkpoints (CTLA4, PD1, IDO1, TDO2, LAG3, and TIGIT) were closely correlated with the Risk Score, which was also differentially expressed between the LUAD samples with high and low-Risk Scores, suggesting that the poor prognosis of LUAD patients with high-Risk Score might be due to the immunosuppressive microenvironments. Conclusion: A prognostic model based on six optimal CT83 related genes could effectively predict the prognosis of LUAD patients.


PeerJ ◽  
2019 ◽  
Vol 7 ◽  
pp. e8128 ◽  
Author(s):  
Cheng Yue ◽  
Hongtao Ma ◽  
Yubai Zhou

Background Lung cancer has the highest morbidity and mortality worldwide, and lung adenocarcinoma (LADC) is the most common pathological subtype. Accumulating evidence suggests the tumor microenvironment (TME) is correlated with the tumor progress and the patient’s outcome. As the major components of TME, the tumor-infiltrated immune cells and stromal cells have attracted more and more attention. In this study, differentially expressed immune and stromal signature genes were used to construct a TME-related prognostic model for predicting the outcomes of LADC patients. Methods The expression profiles of LADC samples with clinical information were obtained from The Cancer Genome Atlas (TCGA) and Gene Expression Omnibus (GEO). The differentially expressed genes (DEGs) related to the TME of LADC were identified using TCGA dataset by Wilcoxon rank sum test. The prognostic effects of TME-related DEGs were analyzed using univariate Cox regression. Then, the least absolute shrinkage and selection operator (LASSO) regression was performed to reduce the overfit and the number of genes for further analysis. Next, the prognostic model was constructed by step multivariate Cox regression and risk score of each sample was calculated. Then, survival and Receiver Operating Characteristic (ROC) analyses were conducted to validate the model using TCGA and GEO datasets, respectively. The Kyoto Encyclopedia of Genes and Genomes analysis of gene signature was performed using Gene Set Enrichment Analysis (GSEA). Finally, the overall immune status, tumor purity and the expression profiles of HLA genes of high- and low-risk samples was further analyzed to reveal the potential mechanisms of prognostic effects of the model. Results A total of 93 TME-related DEGs were identified, of which 23 DEGs were up-regulated and 70 DEGs were down-regulated. The univariate cox analysis indicated that 23 DEGs has the prognostic effects, the hazard ratio ranged from 0.65 to 1.25 (p < 0.05). Then, seven genes were screened out from the 23 DEGs by LASSO regression method and were further analyzed by step multivariate Cox regression. Finally, a three-gene (ADAM12, Bruton Tyrosine Kinase (BTK), ERG) signature was constructed, and ADAM12, BTK can be used as independent prognostic factors. The three-gene signature well stratified the LADC patients in both training (TCGA) and testing (GEO) datasets as high-risk and low-risk groups, the 3-year area under curve (AUC) of ROC curves of three GEO sets were 0.718 (GSE3141), 0.646 (GSE30219) and 0.643 (GSE50081). The GSEA analysis indicated that highly expressed ADAM12, BTK, ERG mainly correlated with the activation of pathways involving in focal adhesion, immune regulation. The immune analysis indicated that the low-risk group has more immune activities and higher expression of HLA genes than that of the high-risk group. In sum, we identified and constructed a three TME-related DEGs signature, which could be used to predict the prognosis of LADC patients.


2021 ◽  
Vol 11 ◽  
Author(s):  
Lu Lu ◽  
Le-Ping Liu ◽  
Qiang-Qiang Zhao ◽  
Rong Gui ◽  
Qin-Yu Zhao

Lung adenocarcinoma (LUAD) is a highly heterogeneous malignancy, which makes prognosis prediction of LUAD very challenging. Ferroptosis is an iron-dependent cell death mechanism that is important in the survival of tumor cells. Long non-coding RNAs (lncRNAs) are considered to be key regulators of LUAD development and are involved in ferroptosis of tumor cells, and ferroptosis-related lncRNAs have gradually emerged as new targets for LUAD treatment and prognosis. It is essential to determine the prognostic value of ferroptosis-related lncRNAs in LUAD. In this study, we obtained RNA sequencing (RNA-seq) data and corresponding clinical information of LUAD patients from The Cancer Genome Atlas (TCGA) and Gene Expression Omnibus (GEO) database and ferroptosis-related lncRNAs by co-expression analysis. The best predictors associated with LUAD prognosis, including C5orf64, LINC01800, LINC00968, LINC01352, PGM5-AS1, LINC02097, DEPDC1-AS1, WWC2-AS2, SATB2-AS1, LINC00628, LINC01537, LMO7DN, were identified by Least Absolute Shrinkage and Selection Operator (LASSO) Cox regression analysis, and the LUAD risk prediction model was successfully constructed. Kaplan-Meier analysis, receiver operating characteristic (ROC) time curve analysis and univariate and multivariate Cox regression analysis and further demonstrated that the model has excellent robustness and predictive ability. Further, based on the risk prediction model, functional enrichment analysis revealed that 12 prognostic indicators involved a variety of cellular functions and signaling pathways, and the immune status was different in the high-risk and low-risk groups. In conclusion, a risk model of 12 ferroptosis related lncRNAs has important prognostic value for LUAD and may be ferroptosis-related therapeutic targets in the clinic.


2022 ◽  
Vol 2022 ◽  
pp. 1-16
Author(s):  
Jin Zhou ◽  
Zheming Liu ◽  
Huibo Zhang ◽  
Tianyu Lei ◽  
Jiahui Liu ◽  
...  

Purpose. Recent researches showed the vital role of BACH1 in promoting the metastasis of lung cancer. We aimed to explore the value of BACH1 in predicting the overall survival (OS) of early-stage (stages I-II) lung adenocarcinoma. Patients and Methods. Lung adenocarcinoma cases were screened from the Cancer Genome Atlas (TCGA) database. Functional enrichment analysis was performed to obtain the biological mechanisms of BACH1. Gene set enrichment analysis (GSEA) was performed to identify the difference of biological pathways between high- and low-BACH1 groups. Univariate and multivariate COX regression analysis had been used to screen prognostic factors, which were used to establish the BACH1 expression-based prognostic model in the TCGA dataset. The C-index and time-dependent AUC curve were used to evaluate predictive power of the model. External validation of prognostic value was performed in two independent datasets from Gene Expression Omnibus (GEO). Decision analysis curve was finally used to evaluate clinical usefulness of the BACH1-based model beyond pathologic stage alone. Results. BACH1 was an independent prognostic factor for lung adenocarcinoma. High-expression BACH1 cases had worse OS. BACH1-based prognostic model showed an ideal C-index and t -AUC and validated by two GEO datasets, independently. More importantly, the BACH1-based model indicated positive clinical applicability by DCA curves. Conclusion. Our research confirmed that BACH1 was an important predictor of prognosis in early-stage lung adenocarcinoma. The higher the expression of BACH1, the worse OS of the patients.


BMC Cancer ◽  
2020 ◽  
Vol 20 (1) ◽  
Author(s):  
Min Wang ◽  
Shan Huang ◽  
Zefeng Chen ◽  
Zhiwei Han ◽  
Kezhi Li ◽  
...  

Abstract Background Hepatocellular carcinoma (HCC) is among the deadliest forms of cancer. While RNA-binding proteins (RBPs) have been shown to be key regulators of oncogenesis and tumor progression, their dysregulation in the context of HCC remains to be fully characterized. Methods Data from the Cancer Genome Atlas - liver HCC (TCGA-LIHC) database were downloaded and analyzed in order to identify RBPs that were differentially expressed in HCC tumors relative to healthy normal tissues. Functional enrichment analyses of these RBPs were then conducted using the GO and KEGG databases to understand their mechanistic roles. Central hub RBPs associated with HCC patient prognosis were then detected through Cox regression analyses, and were incorporated into a prognostic model. The prognostic value of this model was then assessed through the use of Kaplan-Meier curves, time-related ROC analyses, univariate and multivariate Cox regression analyses, and nomograms. Lastly, the relationship between individual hub RBPs and HCC patient overall survival (OS) was evaluated using Kaplan-Meier curves. Finally, find protein-coding genes (PCGs) related to hub RBPs were used to construct a hub RBP-PCG co-expression network. Results In total, we identified 81 RBPs that were differentially expressed in HCC tumors relative to healthy tissues (54 upregulated, 27 downregulated). Seven prognostically-relevant hub RBPs (SMG5, BOP1, LIN28B, RNF17, ANG, LARP1B, and NR0B1) were then used to generate a prognostic model, after which HCC patients were separated into high- and low-risk groups based upon resultant risk score values. In both the training and test datasets, we found that high-risk HCC patients exhibited decreased OS relative to low-risk patients, with time-dependent area under the ROC curve values of 0.801 and 0.676, respectively. This model thus exhibited good prognostic performance. We additionally generated a prognostic nomogram based upon these seven hub RBPs and found that four other genes were significantly correlated with OS. Conclusion We herein identified a seven RBP signature that can reliably be used to predict HCC patient OS, underscoring the prognostic relevance of these genes.


BMC Cancer ◽  
2021 ◽  
Vol 21 (1) ◽  
Author(s):  
Yang Zhai ◽  
Bin Zhao ◽  
Yuzhen Wang ◽  
Lina Li ◽  
Jingjin Li ◽  
...  

Abstract Background Lung adenocarcinoma (LUAD) is the most common pathology subtype of lung cancer. In recent years, immunotherapy, targeted therapy and chemotherapeutics conferred a certain curative effects. However, the effect and prognosis of LUAD patients are different, and the efficacy of existing LUAD risk prediction models is unsatisfactory. Methods The Cancer Genome Atlas (TCGA) LUAD dataset was downloaded. The differentially expressed immune genes (DEIGs) were analyzed with edgeR and DESeq2. The prognostic DEIGs were identified by COX regression. Protein-protein interaction (PPI) network was inferred by STRING using prognostic DEIGs with p value< 0.05. The prognostic model based on DEIGs was established using Lasso regression. Immunohistochemistry was used to assess the expression of FERMT2, FKBP3, SMAD9, GATA2, and ITIH4 in 30 cases of LUAD tissues. Results In total,1654 DEIGs were identified, of which 436 genes were prognostic. Gene functional enrichment analysis indicated that the DEIGs were involved in inflammatory pathways. We constructed 4 models using DEIGs. Finally, model 4, which was constructed using the 436 DEIGs performed the best in prognostic predictions, the receiver operating characteristic curve (ROC) was 0.824 for 3 years, 0.838 for 5 years, 0.834 for 10 years. High levels of FERMT2, FKBP3 and low levels of SMAD9, GATA2, ITIH4 expression are related to the poor overall survival in LUAD (p < 0.05). The prognostic model based on DEIGs reflected infiltration by immune cells. Conclusions In our study, we built an optimal prognostic signature for LUAD using DEIGs and verified the expression of selected genes in LUAD. Our result suggests immune signature can be harnessed to obtain prognostic insights.


2021 ◽  
Vol 2021 ◽  
pp. 1-14
Author(s):  
Xiaotao Jiang ◽  
Qiaofeng Yan ◽  
Linling Xie ◽  
Shijie Xu ◽  
Kailin Jiang ◽  
...  

Background. Gastric cancer (GC), an extremely aggressive tumor with a very different prognosis, is the third leading cause of cancer-related mortality. We aimed to construct a ferroptosis-related prognostic model that can be distinguished prognostically. Methods. The gene expression and the clinical data of GC patients were downloaded from The Cancer Genome Atlas (TCGA) and Gene Expression Omnibus database (GEO). The ferroptosis-related genes were obtained from the FerrDb. Using the “limma” R package and univariate Cox analysis, ferroptosis-related genes with differential expression and prognostic value were identified in the TCGA cohort. Last absolute shrinkage and selection operator (LASSO) Cox regression was applied to shrink ferroptosis-related predictors and construct a prognostic model. Functional enrichment, ESTIMATE algorithm, and single-sample gene set enrichment analysis (ssGSEA) were applied for exploring the potential mechanism. GC patients from the GEO cohort were used for validation. Results. An 8-gene prognostic model was constructed and stratified GC patients from TCGA and meta-GEO cohort into high-risk groups or low-risk groups. GC patients in high-risk groups have significantly poorer OS compared with those in low-risk groups. The risk score was identified as an independent predictor for OS. Functional analysis revealed that the risk score was mainly associated with the biological function of extracellular matrix (ECM) organization and tumor immunity. Conclusion. In conclusion, the ferroptosis-related model can be utilized for the clinical prognostic prediction in GC.


2021 ◽  
Author(s):  
Junqi Qin ◽  
Zhanyu Xu ◽  
Fanglu Qin ◽  
Jiangbo Wei ◽  
Liqiang Yuan ◽  
...  

Abstract Background: There are few studies on the role of iron metabolism genes in predicting the prognosis of lung adenocarcinoma (LUAD). Our research aims to screen key genes and to establish a prognostic signature that can predict the overall survival rate of lung adenocarcinoma patients. Methods: Genes related to iron metabolism were downloaded from the GeneCards database; in addition, RNA-Seq data and corresponding clinical materials of 594 adenocarcinoma patients from The Cancer Genome Atlas(TCGA) were downloaded. GSE42127 of Gene Expression Omnibus (GEO) database was also further verified. The multi-gene prognostic signature was constructed by the Cox regression model of the Least Absolute Shrinkage and Selection Operator (LASSO). The clinical applicability of the model and its connection with immune cell infiltration was then analyzed. Results: We constructed a prediction signature with 12 genes (HAVCR1, SPN, GAPDH, ANGPTL4, PRSS3, KRT8, LDHA, HMMR, SLC2A1, CYP24A1, LOXL2, TIMP1) in the TCGA test set, and counted the patient's risk value based on this 12-gene signature; patients were split into high and low-risk groups. The survival graph results revealed that the survival prognosis between the high and low-risk groups was significantly different (TCGA: P <0.001, GEO: P = 0.001). Univariate and multivariate Cox regression analysis confirmed that the risk value is a predictor of patient OS (P<0.001). The area under the time-dependent ROC curve (AUC) indicated that our signature had a relatively high true positive rate when predicting the 1-year, 3-year, and 5-year OS of the TCGA cohort, which was 0.735, 0.711, and 0.601, respectively. The analysis of the nomogram and calibration curve showed the predictive ability of the gene model. In addition, immune-related pathways were highlighted in the functional enrichment analysis, and immune response between the two risk groups was observed to be significantly different. All of the results proved the reliability of our iron metabolism-related gene risk prognostic model. Conclusion: We developed and verified a 12-gene prognostic signature, which can help predict the prognosis of lung adenocarcinoma and offer a variety of targeted options for the precise treatment of lung cancer.


2020 ◽  
Author(s):  
Min wang ◽  
Shan Huang ◽  
Zefeng Chen ◽  
Zhiwei Han ◽  
Kezhi Li ◽  
...  

Abstract Background: Hepatocellular carcinoma (HCC) is among the deadliest forms of cancer. While RNA-binding proteins (RBPs) have been shown to be key regulators of oncogenesis and tumor progression, their dysregulation in the context of HCC remains to be fully characterized. Methods: Data from the Cancer Genome Atlas - liver HCC (TCGA-LIHC) database were downloaded and analyzed in order to identify RBPs that were differentially expressed in HCC tumors relative to healthy normal tissues. Functional enrichment analyses of these RBPs were then conducted using the GO and KEGG databases to understand their mechanistic roles. Central hub RBPs associated with HCC patient prognosis were then detected through Cox regression analyses, and were incorporated into a prognostic model. The prognostic value of this model was then assessed through the use of Kaplan-Meier curves, time-related ROC analyses, univariate and multivariate Cox regression analyses, and nomograms. Lastly, the relationship between individual hub RBPs and HCC patient overall survival (OS) was evaluated using Kaplan-Meier curves. Results: In total, we identified 81 RBPs that were differentially expressed in HCC tumors relative to healthy tissues (54 upregulated, 27 downregulated). Seven prognostically-relevant hub RBPs (SMG5, BOP1, LIN28B, RNF17, ANG, LARP1B, and NR0B1) were then used to generate a prognostic model, after which HCC patients were separated into high- and low-risk groups based upon resultant risk score values. In both the training and test datasets, we found that high-risk HCC patients exhibited decreased OS relative to low-risk patients, with time-dependent area under the ROC curve values of 0.801 and 0.676, respectively. This model thus exhibited good prognostic performance. We additionally generated a prognostic nomogram based upon these seven hub RBPs and found that four other genes were significantly correlated with OS. Conclusion: We herein identified a seven RBP signature that can reliably be used to predict HCC patient OS, underscoring the prognostic relevance of these genes.


2020 ◽  
Author(s):  
Min wang ◽  
Shan Huang ◽  
Zefeng Chen ◽  
Zhiwei Han ◽  
Kezhi Li ◽  
...  

Abstract Background: Hepatocellular carcinoma (HCC) is among the deadliest forms of cancer. While RNA-binding proteins (RBPs) have been shown to be key regulators of oncogenesis and tumor progression, their dysregulation in the context of HCC remains to be fully characterized. Methods: Data from the Cancer Genome Atlas - liver HCC (TCGA-LIHC) database were downloaded and analyzed in order to identify RBPs that were differentially expressed in HCC tumors relative to healthy normal tissues. Functional enrichment analyses of these RBPs were then conducted using the GO and KEGG databases to understand their mechanistic roles. Central hub RBPs associated with HCC patient prognosis were then detected through Cox regression analyses, and were incorporated into a prognostic model. The prognostic value of this model was then assessed through the use of Kaplan-Meier curves, time-related ROC analyses, univariate and multivariate Cox regression analyses, and nomograms. Lastly, the relationship between individual hub RBPs and HCC patient overall survival (OS) was evaluated using Kaplan-Meier curves. Finally, find protein-coding genes (PCGs) related to hub RBPs were used to construct a hub RBP-PCG co-expression network.Results: In total, we identified 81 RBPs that were differentially expressed in HCC tumors relative to healthy tissues (54 upregulated, 27 downregulated). Seven prognostically-relevant hub RBPs (SMG5, BOP1, LIN28B, RNF17, ANG, LARP1B, and NR0B1) were then used to generate a prognostic model, after which HCC patients were separated into high- and low-risk groups based upon resultant risk score values. In both the training and test datasets, we found that high-risk HCC patients exhibited decreased OS relative to low-risk patients, with time-dependent area under the ROC curve values of 0.801 and 0.676, respectively. This model thus exhibited good prognostic performance. We additionally generated a prognostic nomogram based upon these seven hub RBPs and found that four other genes were significantly correlated with OS.Conclusion: We herein identified a seven RBP signature that can reliably be used to predict HCC patient OS, underscoring the prognostic relevance of these genes.


Sign in / Sign up

Export Citation Format

Share Document