scholarly journals A Prognostic 14-Gene Expression Signature for Lung Adenocarcinoma: A Study Based on TCGA Data Mining

2020 ◽  
Vol 2020 ◽  
pp. 1-11
Author(s):  
Jie Liu ◽  
Shiqiang Hou ◽  
Jinyi Wang ◽  
Zhengjun Chai ◽  
Xuan Hong ◽  
...  

Background. Lung adenocarcinoma (LUAD), a major and fatal subtype of lung cancer, caused lots of mortalities and showed different outcomes in prognosis. This study was to assess key genes and to develop a prognostic signature for the patient therapy with LUAD. Method. RNA expression profile and clinical data from 522 LUAD patients were accessed and downloaded from the Cancer Genome Atlas (TCGA) database. Differentially expressed genes (DEGs) were extracted and analyzed between normal tissues and LUAD samples. Then, a 14-DEG signature was developed and identified for the survival prediction in LUAD patients by means of univariate and multivariate Cox regression analyses. The gene ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analysis were performed to predict the potential biological functions and pathways of these DEGs. Results. Twenty-two out of 5924 DEGs in the TCGA dataset were screened and associated with the overall survival (OS) of LUAD patients. 14CID="C008" value=" "DEGs were finally selected and included in our development and validation model by risk score analysis. The ROC analysis indicated that the specificity and sensitivity of this profile signature were high. Further functional enrichment analyses indicated that these DEGs might regulate genes that affect the function of release of sequestered calcium ion into cytosol and pathways that associated with vibrio cholerae infection. Conclusion. Our study developed a novel 14-DEG signature providing more efficient and persuasive prognostic information beyond conventional clinicopathological factors for survival prediction of LUAD patients.

2020 ◽  
Author(s):  
Yang Wang ◽  
Chengping Hu

Abstract Background: Long non-coding RNAs (lncRNAs) have been reported to play essential roles in tumorigenesis and cancers prognosis, and they can be a potential cancer prognostic markers. However, in lung adenocarcinoma(LUAD), how lncRNA signatures predict the survival of patients is poorly understood. Our study aims to explore lncRNA signatures and prognostic function in LUAD.Methods: The expression and prognosis data of lncRNAs in LUAD patients was collected from the Cancer Genome Atlas (TCGA) data. All analyses were performed using the R package (version 3.6.2). Metascape, STRING and Cytoscape were used for enrichment analysis and function prediction of the lncRNA co-expressed protein-coding genes.Results: We have collected lncRNA expression data in 466 LUAD tumors, and a six-lncRNA signature(RP11-79H23.3, RP11-309M7.1, CTD-2357A8.3, RP11-108P20.4, U47924.29, LHFPL3-AS2) has been shown to be significantly related to LUAD patients’ overall survival. According to the lncRNA signatures, the high-risk and low-risk groups were divided in LUAD patients with different survival rates. Further multivariable cox regression analysis showed that the prognostic value of this signature was independent of clinical factors. The potential functional roles and hub co-expressed protein-coding genes in the six prognostic lncRNAs are shown in the functional enrichment analysis.Conclusions: These results showed that these six lncRNAs could be independent predicted prognostic biomarkers in LUAD patients.


2022 ◽  
Vol 2022 ◽  
pp. 1-16
Author(s):  
Jin Zhou ◽  
Zheming Liu ◽  
Huibo Zhang ◽  
Tianyu Lei ◽  
Jiahui Liu ◽  
...  

Purpose. Recent researches showed the vital role of BACH1 in promoting the metastasis of lung cancer. We aimed to explore the value of BACH1 in predicting the overall survival (OS) of early-stage (stages I-II) lung adenocarcinoma. Patients and Methods. Lung adenocarcinoma cases were screened from the Cancer Genome Atlas (TCGA) database. Functional enrichment analysis was performed to obtain the biological mechanisms of BACH1. Gene set enrichment analysis (GSEA) was performed to identify the difference of biological pathways between high- and low-BACH1 groups. Univariate and multivariate COX regression analysis had been used to screen prognostic factors, which were used to establish the BACH1 expression-based prognostic model in the TCGA dataset. The C-index and time-dependent AUC curve were used to evaluate predictive power of the model. External validation of prognostic value was performed in two independent datasets from Gene Expression Omnibus (GEO). Decision analysis curve was finally used to evaluate clinical usefulness of the BACH1-based model beyond pathologic stage alone. Results. BACH1 was an independent prognostic factor for lung adenocarcinoma. High-expression BACH1 cases had worse OS. BACH1-based prognostic model showed an ideal C-index and t -AUC and validated by two GEO datasets, independently. More importantly, the BACH1-based model indicated positive clinical applicability by DCA curves. Conclusion. Our research confirmed that BACH1 was an important predictor of prognosis in early-stage lung adenocarcinoma. The higher the expression of BACH1, the worse OS of the patients.


2020 ◽  
Author(s):  
Gaochen Lan ◽  
Xiaoling Yu ◽  
Yanna Zhao ◽  
Jinjian Lan ◽  
Wan Li ◽  
...  

Abstract Background: Breast cancer is the most common malignant disease among women. At present, more and more attention has been paid to long non-coding RNAs (lncRNAs) in the field of breast cancer research. We aimed to investigate the expression profiles of lncRNAs and construct a prognostic lncRNA for predicting the overall survival (OS) of breast cancer.Methods: The expression profiles of lncRNAs and clinical data with breast cancer were obtained from The Cancer Genome Atlas (TCGA). Differentially expressed lncRNAs were screened out by R package (limma). The survival probability was estimated by the Kaplan‑Meier Test. The Cox Regression Model was performed for univariate and multivariate analysis. The risk score (RS) was established on the basis of the lncRNAs’ expression level (exp) multiplied regression coefficient (β) from the multivariate cox regression analysis with the following formula: RS=exp a1 * β a1 + exp a2 * β a2 +……+ exp an * β an. Functional enrichment analysis was performed by Metascape.Results: A total of 3404 differentially expressed lncRNAs were identified. Among them, CYTOR, MIR4458HG and MAPT-AS1 were significantly associated with the survival of breast cancer. Finally, The RS could predict OS of breast cancer (RS=exp CYTOR * β CYTOR + exp MIR4458HG * β MIR4458HG + exp MAPT-AS1 * β MAPT-AS1). Moreover, it was confirmed that the three-lncRNA signature could be an independent prognostic biomarker for breast cancer (HR=3.040, P=0.000).Conclusions: This study established a three-lncRNA signature, which might be a novel prognostic biomarker for breast cancer.


PeerJ ◽  
2021 ◽  
Vol 9 ◽  
pp. e10884
Author(s):  
Xin Yu ◽  
Qian Yang ◽  
Dong Wang ◽  
Zhaoyang Li ◽  
Nianhang Chen ◽  
...  

Applying the knowledge that methyltransferases and demethylases can modify adjacent cytosine-phosphorothioate-guanine (CpG) sites in the same DNA strand, we found that combining multiple CpGs into a single block may improve cancer diagnosis. However, survival prediction remains a challenge. In this study, we developed a pipeline named “stacked ensemble of machine learning models for methylation-correlated blocks” (EnMCB) that combined Cox regression, support vector regression (SVR), and elastic-net models to construct signatures based on DNA methylation-correlated blocks for lung adenocarcinoma (LUAD) survival prediction. We used methylation profiles from the Cancer Genome Atlas (TCGA) as the training set, and profiles from the Gene Expression Omnibus (GEO) as validation and testing sets. First, we partitioned the genome into blocks of tightly co-methylated CpG sites, which we termed methylation-correlated blocks (MCBs). After partitioning and feature selection, we observed different diagnostic capacities for predicting patient survival across the models. We combined the multiple models into a single stacking ensemble model. The stacking ensemble model based on the top-ranked block had the area under the receiver operating characteristic curve of 0.622 in the TCGA training set, 0.773 in the validation set, and 0.698 in the testing set. When stratified by clinicopathological risk factors, the risk score predicted by the top-ranked MCB was an independent prognostic factor. Our results showed that our pipeline was a reliable tool that may facilitate MCB selection and survival prediction.


2021 ◽  
Author(s):  
Boxuan Liu ◽  
Yun Zhao ◽  
Shuanying Yang

Abstract Background: Lung adenocarcinoma is the most occurred pathological type among non-small cell lung cancer. Although huge progress has been made in terms of early diagnosis, precision treatment in recent years, the overall 5-year survival rate of a patient remains low. In our study, we try to construct an autophagy-related lncRNA prognostic signature that may guide clinical practice.Methods: The mRNA and lncRNA expression matrix of lung adenocarcinoma patients were retrieved from TCGA database. Next, we constructed a co-expression network of lncRNAs and autophagy-related genes. Lasso regression and multivariate Cox regression were then applied to establish a prognostic risk model. Subsequently, a risk score was generated to differentiate high and low risk group and a ROC curve and Nomogram to visualize the predictive ability of current signature. Finally, gene ontology and pathway enrichment analysis were executed via GSEA.Results: A total of 1,703 autophagy-related lncRNAs were screened and five autophagy-related lncRNAs (LINC01137, AL691432.2, LINC01116, AL606489.1 and HLA-DQB1-AS1) were finally included in our signature. Judging from univariate(HR=1.075, 95% CI: 1.046–1.104) and multivariate(HR =1.088, 95%CI = 1.057 − 1.120) Cox regression analysis, the risk score is an independent factor for LUAD patients. Further, the AUC value based on the risk score for 1-year, 3-year, 5-year, was 0.735, 0.672 and 0.662 respectively. Finally, the lncRNAs included in our signature were primarily enriched in autophagy process, metabolism, p53 pathway and JAK/STAT pathway. Conclusions: Overall, our study indicated that the prognostic model we generated had certain predictability for LUAD patients’ prognosis.


2021 ◽  
Vol 2021 ◽  
pp. 1-17
Author(s):  
Lixian Chen ◽  
Zhonglu Ren ◽  
Yongming Cai

Increasing evidence has shown that noncoding RNAs play significant roles in the initiation, progression, and metastasis of tumours via participating in competing endogenous RNA (ceRNA) networks. However, the survival-associated ceRNA in lung adenocarcinoma (LUAD) remains poorly understood. In this study, we aimed to investigate the regulatory mechanisms underlying ceRNA in LUAD to identify novel prognostic factors. mRNA, lncRNA, and miRNA sequencing data obtained from the GDC data portal were utilized to identify differentially expressed (DE) RNAs. Survival-related RNAs were recognized using univariate Kaplan-Meier survival analysis. We performed functional enrichment analysis of survival-related mRNAs using the clusterProfiler package of R and STRING. lncRNA-miRNA and miRNA-mRNA interactions were predicted based on miRcode, Starbase, and miRanda. Subsequently, the survival-associated ceRNA network was constructed for LUAD. Multivariate Cox regression analysis was used to identify prognostic factors. Finally, we acquired 15 DE miRNAs, 49 DE lncRNAs, and 843 DE mRNAs associated with significant overall survival. Functional enrichment analysis indicated that survival-related DE mRNAs were enriched in cell cycle. The survival-associated lncRNA-miRNA-mRNA ceRNA network was constructed using five miRNAs, 49 mRNAs, and 21 lncRNAs. Furthermore, seven hub RNAs (LINC01936, miR-20a-5p, miR-31-5p, TNS1, TGFBR2, SMAD7, and NEDD4L) were identified based on the ceRNA network. LINC01936 and miR-31-5p were found to be significant using the multifactorial Cox regression model. In conclusion, we successfully constructed a survival-related lncRNA-miRNA-mRNA ceRNA regulatory network in LUAD and identified seven hub RNAs, which provide novel insights into the regulatory molecular mechanisms associated with survival of LUAD, and identified two independent prognostic predictors for LUAD.


2021 ◽  
Vol 11 ◽  
Author(s):  
Feng Yang ◽  
Shaoyi Cai ◽  
Li Ling ◽  
Haiji Zhang ◽  
Liang Tao ◽  
...  

Colorectal cancer (CRC) is a major cause of cancer deaths worldwide. Unfortunately, many CRC patients are still being diagnosed at an advanced stage of the cancer, and the 5-year survival rate is only ~30%. Effective prognostic markers of CRC are therefore urgently needed. To address this issue, we performed a detailed bioinformatics analysis based on the Cancer Genome Atlas (TCGA), Genotype-Tissue Expression (GTEx), and Gene Expression Omnibus (GEO) databases to identify prognostic biomarkers for CRC, which in turn help in exploring potential drug-repurposing. We identified five hub genes (PGM2, PODXL, RHNO1, SCD, and SEPHS1), which had good performance in survival prediction and might be involved in CRC through three key pathways (“Cell cycle,” “Purine metabolism,” and “Spliceosome” KEGG pathways) identified by a KEGG pathway enrichment analysis. What is more, we performed a co-expression analysis between five hub genes and transcription factors to explore the upstream regulatory region. Furthermore, we screened the potential drug-repurposing for the five hub genes in CRC according to the Binding DB and ZINC15 databases. Taking together, we constructed a five-gene signature to predict overall survival of CRC and found the potential drug-repurposing, which may improve the outcome of CRC in the future.


2021 ◽  
Author(s):  
Xiaoyu Ji ◽  
Guangdi Chu ◽  
Jinwen Jiao ◽  
Teng Lv ◽  
Yulong Chen ◽  
...  

Abstract Objective: Cervical cancer (CC) is one of the most common types of malignant female cancer, and its incidence and mortality are not optimistic. Protein panels can be a powerful prognostic factor for many types of cancer. The purpose of our study was to investigate a proteomic panel to predict survival of patients with common CC. Methods and results: The protein expression and clinicopathological data of CC were downloaded from The Cancer Proteome Atlas (TCPA) and The Cancer Genome Atlas (TCGA) database, respectively. We selected the prognosis-related proteins (PRPs) by univariate Cox regression analysis and found that the results of functional enrichment analysis were mainly related to apoptosis. We used Kaplan–Meier(K-M) analysis and multivariable Cox regression analysis further to screen PRPs to establish a prognostic model, including BCL2, SMAD3, and 4EBP1-pT70. The signature was verified to be independent predictors of OS by Cox regression analysis and the Area Under Curves. Nomogram and subgroup classification were established based on the signature to verify its clinical application. Furthermore, we looked for the co-expressed proteins of three-protein panel as potential prognostic proteins.Conclusion: A proteomic signature independently predicted OS of CC patients, and the predictive ability was better than the clinicopathological characteristics. This signature can help improve prediction for clinical outcome and provides new targets for CC treatment.


BMC Cancer ◽  
2021 ◽  
Vol 21 (1) ◽  
Author(s):  
Yang Zhai ◽  
Bin Zhao ◽  
Yuzhen Wang ◽  
Lina Li ◽  
Jingjin Li ◽  
...  

Abstract Background Lung adenocarcinoma (LUAD) is the most common pathology subtype of lung cancer. In recent years, immunotherapy, targeted therapy and chemotherapeutics conferred a certain curative effects. However, the effect and prognosis of LUAD patients are different, and the efficacy of existing LUAD risk prediction models is unsatisfactory. Methods The Cancer Genome Atlas (TCGA) LUAD dataset was downloaded. The differentially expressed immune genes (DEIGs) were analyzed with edgeR and DESeq2. The prognostic DEIGs were identified by COX regression. Protein-protein interaction (PPI) network was inferred by STRING using prognostic DEIGs with p value< 0.05. The prognostic model based on DEIGs was established using Lasso regression. Immunohistochemistry was used to assess the expression of FERMT2, FKBP3, SMAD9, GATA2, and ITIH4 in 30 cases of LUAD tissues. Results In total,1654 DEIGs were identified, of which 436 genes were prognostic. Gene functional enrichment analysis indicated that the DEIGs were involved in inflammatory pathways. We constructed 4 models using DEIGs. Finally, model 4, which was constructed using the 436 DEIGs performed the best in prognostic predictions, the receiver operating characteristic curve (ROC) was 0.824 for 3 years, 0.838 for 5 years, 0.834 for 10 years. High levels of FERMT2, FKBP3 and low levels of SMAD9, GATA2, ITIH4 expression are related to the poor overall survival in LUAD (p < 0.05). The prognostic model based on DEIGs reflected infiltration by immune cells. Conclusions In our study, we built an optimal prognostic signature for LUAD using DEIGs and verified the expression of selected genes in LUAD. Our result suggests immune signature can be harnessed to obtain prognostic insights.


2021 ◽  
Author(s):  
Yuan Fang ◽  
Yang Yang ◽  
XiaoLi Zhang ◽  
Na Li ◽  
Bo Yuan ◽  
...  

Abstract Background: The mechanistic basis for the relapse of hepatocellular carcinoma (HCC) remains poorly understood. Recent research has highlighted the important roles of long non-coding RNAs (lncRNAs) in HCC. However, there are only a few studies on lncRNAs associated with the relapse of HCC.Methods:We analyzed lncRNA and mRNA profiles in the GSE101432 dataset associated with HCC relapse. The differentially expressed lncRNAs and mRNAs were used to construct an lncRNA-mRNA co-expression network. Weighted gene co-expression network analysis followed by Gene Ontology and Kyoto Encyclopedia of Genes and Genomes enrichment analyses were conducted on the database. Furthermore, correlation and survival analyses were performed using The Cancer Genome Atlas database, and the clinical samples were verified by qRT-PCR.Results:In this study, lncRNAs and mRNAs associated with HCC recurrence were identified. Two gene modules were found to be closely linked to HCC relapse. The functional enrichment analysis results of lncRNAs and co-expression mRNAs indicated that they were closely related to the recurrence of HCC. In addition, we verified that the overall survival and recurrence-free survival of these genes in HCC have survival prediction functions. In total, we identified and validated two lncRNAs (LINC00941 and LINC00668) and six mRNAs (LOX, MICB, OTX1, BAIAP2L2, KCTD17, NDUFA4L2) associated with HCC relapse.Conclusion: In summary, we identified the key gene modules and central genes associated with recurrent HCC, and constructed lncRNA-mRNA networks related to this cancer type. These results provide a foundation for future basic research on the mechanism of recurrent liver cancer.


Sign in / Sign up

Export Citation Format

Share Document