Identification and Validation of Immune-Related Gene Prognostic Signature for breast cancer

Author(s):  
Bing Zhou ◽  
Linyi Zhang ◽  
Qinyu Wang ◽  
Changqin Pu ◽  
Sixuan Guo ◽  
...  

Abstract Background Although the outcome of breast cancer patients has been improved by advances in early detection, diagnosis and treatment. Due to the heterogeneity of the disease, prognostic assessment still faces challenges. The accumulated data indicate that there is a clear correlation between the tumor immune microenvironment and clinical outcomes. Objective Construct immune-related gene pairs to evaluate the prognosis of breast cancer and patient survival rate. Methods From the Cancer Genome Atlas (TCGA) and Gene Expression Omnibus (GEO) database, the Gene expression profiles and clinical data of breast cancer samples were downloaded. TCGA cohort were further divided into a training set (n = 764) and internal validation sets (n = 325). The GEO cohort was analyzed as an external validation cohort (n = 327). In the training set, differently expressed immune-relevant genes (IRGs) were screened firstly, and they were used to construct immune-relevant gene pairs (IRGPs). Then, the prognostic IRGPs were identified via univariate Cox regression analysis. Finally, least absolute shrinkage and selection operator (LASSO) Cox regression analysis was used to constituted the IRGP prognostic signature. Kaplan-Meier (KM) survival curves, receiver operating characteristic (ROC) curve analysis, univariate and multivariate Cox regression analysis were used to estimate the predictive value of the IRGP prognostic signature. And the IRGP prognostic signature was validated in the internal validation cohort and external validation cohort. We used gene set enrichment analysis (GSEA) to elucidate the biological functions of the IRGP prognostic signature. Results A total of 474 differently expressed IRGs and 2942 prognostic IRGPs were identified. Finally, we generated a IRGP prognostic signature consisting of 33 IRGPs. Subsequently, the 33 IRGPs grouped BRCA patients into high- and low-risk groups. Kaplan-Meier curves shown a significantly different overall survival in risk groups. Time-dependent ROC curves indicated that the IRGP prognostic signature possessed a high specificity and sensitivity in all the sets. Univariate and multivariate Cox regression analysis showed a statistical significance for the prognostic value of IRGP prognostic signature and the IRGP prognostic signature was a strong independent risk factor. The functional enrichment analysis indicated that low IRGP value was correlated with biological processes related to immune. Immune cell infiltration analysis indicated a significant difference in percentage of M2 macrophages between high- and low-risk groups. Conclusion The 33-IRGPs prognostic signature was developed to provide new insights for the identification of high-risk breast cancer and the evaluation of the possibility of immunotherapy in personalized breast cancer treatment.

2021 ◽  
Vol 12 ◽  
Author(s):  
Guomin Wu ◽  
Qihao Wang ◽  
Ting Zhu ◽  
Linhai Fu ◽  
Zhupeng Li ◽  
...  

This study aimed to establish a prognostic risk model for lung adenocarcinoma (LUAD). We firstly divided 535 LUAD samples in TCGA-LUAD into high-, medium-, and low-immune infiltration groups by consensus clustering analysis according to immunological competence assessment by single-sample gene set enrichment analysis (ssGSEA). Profile of long non-coding RNAs (lncRNAs) in normal samples and LUAD samples in TCGA was used for a differential expression analysis in the high- and low-immune infiltration groups. A total of 1,570 immune-related differential lncRNAs in LUAD were obtained by intersecting the above results. Afterward, univariate COX regression analysis and multivariate stepwise COX regression analysis were conducted to screen prognosis-related lncRNAs, and an eight-immune-related-lncRNA prognostic signature was finally acquired (AL365181.2, AC012213.4, DRAIC, MRGPRG-AS1, AP002478.1, AC092168.2, FAM30A, and LINC02412). Kaplan–Meier analysis and ROC analysis indicated that the eight-lncRNA-based model was accurate to predict the prognosis of LUAD patients. Simultaneously, univariate COX regression analysis and multivariate COX regression analysis were undertaken on clinical features and risk scores. It was illustrated that the risk score was a prognostic factor independent from clinical features. Moreover, immune data of LUAD in the TIMER database were analyzed. The eight-immune-related-lncRNA prognostic signature was related to the infiltration of B cells, CD4+ T cells, and dendritic cells. GSEA enrichment analysis revealed significant differences in high- and low-risk groups in pathways like pentose phosphate pathway, ubiquitin mediated proteolysis, and P53 signaling pathway. This study helps to treat LUAD patients and explore molecules related to LUAD immune infiltration to deeply understand the specific mechanism.


2021 ◽  
Vol 16 ◽  
Author(s):  
Dongqing Su ◽  
Qianzi Lu ◽  
Yi Pan ◽  
Yao Yu ◽  
Shiyuan Wang ◽  
...  

Background: Breast cancer has plagued women for many years and caused many deaths around the world. Method: In this study, based on the weighted correlation network analysis, univariate Cox regression analysis and least absolute shrinkage and selection operator, 12 immune-related genes were selected to construct the risk score for breast cancer patients. The multivariable Cox regression analysis, gene set enrichment analysis and nomogram were also conducted in this study. Results: Good results were obtained in the survival analysis, enrichment analysis, multivariable Cox regression analysis and immune-related feature analysis. When the risk score model was applied in 22 breast cancer cohorts, the univariate Cox regression analysis demonstrated that the risk score model was significantly associated with overall survival in most of the breast cancer cohorts. Conclusion: Based on these results, we could conclude that the proposed risk score model may be a promising method, and may improve the treatment stratification of breast cancer patients in the future work.


2021 ◽  
Vol 12 ◽  
Author(s):  
Xiaoping Li ◽  
Jishang Chen ◽  
Qihe Yu ◽  
Hui Huang ◽  
Zhuangsheng Liu ◽  
...  

Background: A surge in newly diagnosed breast cancer has overwhelmed the public health system worldwide. Joint effort had beed made to discover the genetic mechanism of these disease globally. Accumulated research has revealed autophagy may act as a vital part in the pathogenesis of breast cancer.Objective: Aim to construct a prognostic model based on autophagy-related lncRNAs and investigate their potential mechanisms in breast cancer.Methods: The transcriptome data and clinical information of patients with breast cancer were obtained from The Cancer Genome Atlas (TCGA) database. Autophagy-related genes were obtained from the Human Autophagy Database (HADb). Long non-coding RNAs (lncRNAs) related to autophagy were acquired through the Pearson correlation analysis. Univariate Cox regression analysis as well as the least absolute shrinkage and selection operator (LASSO) regression analysis were used to identify autophagy-related lncRNAs with prognostic value. We constructed a risk scoring model to assess the prognostic significance of the autophagy-related lncRNAs signatures. The nomogram was then established based on the risk score and clinical indicators. Through the calibration curve, the concordance index (C-index) and receiver operating characteristic (ROC) curve analysis were evaluated to obtain the model's predictive performance. Subgroup analysis was performed to evaluate the differential ability of the model. Subsequently, gene set enrichment analysis was conducted to investigate the potential functions of these lncRNAs.Results: We attained 1,164 breast cancer samples from the TCGA database and 231 autophagy-related genes from the HAD database. Through correlation analysis, 179 autophagy-related lncRNAs were finally identified. Univariate Cox regression analysis and LASSO regression analysis further screened 18 prognosis-associated lncRNAs. The risk scoring model was constructed to divide patients into high-risk and low-risk groups. It was found that the low-risk group had better overall survival (OS) than those of the high-risk group. Then, the nomogram model including age, tumor stage, TNM stage and risk score was established. The evaluation index (C-index: 0.78, 3-year OS AUC: 0.813 and 5-year OS AUC: 0.785) showed that the nomogram had excellent predictive power. Subgroup analysis showed there were difference in OS between high-risk and low-risk patients in different subgroups (stage I-II, ER positive, Her-2 negative and non-TNBC subgroups; all P < 0.05). According to the results of gene set enrichment analysis, these lncRNAs were involved in the regulation of multicellular organismal macromolecule metabolic process in multicellular organisms, nucleotide excision repair, oxidative phosphorylation, and TGF-β signaling pathway.Conclusions: We identified 18 autophagy-related lncRNAs with prognostic value in breast cancer, which may regulate tumor growth and progression in multiple ways.


2021 ◽  
Author(s):  
Yushu Liu ◽  
Jiantao Gong ◽  
Yanyi Huang ◽  
Qunguang Jiang

Abstract Background:Colon cancer is a common malignant cancer with high incidence and poor prognosis. Cell senescence and apoptosis are important mechanisms of tumor occurrence and development, in which aging-related genes(ARGs) play an important role. This study aimed to establish a prognostic risk model based on ARGs for diagnosis and prognosis prediction of colon cancer .Methods: We downloaded transcriptome data and clinical information of colon cancer patients from the Cancer Genome Atlas(TCGA) database and the microarray dataset(GSE39582) from the Gene Expression Omnibus(GEO) database. Univariate COX, least absolute shrinkage and selection operator(LASSO) regression algorithm and multivariate COX regression analysis were used to construct a 6-ARG prognosis model and calculated the riskScore. The prognostic signatures is validated by internal validation cohort and external validation cohort(GSE39582).In addition, functional enrichment pathways and immune microenvironment of aging-related genes(ARGs) were also analyzed. We also analyzed the correlation between rsikScore and clinical features and constructed a nomogram based on riskScore. We are the first to construct prognostic nomogram based on ARGs.Results: Through univariate COX,LASSO regression algorithm and multivariate COX regression analysis,6 prognostic ARGs (PDPK1,RAD52,GSR,IL7,BDNF and SERPINE1) were screened out and riskScore was constructed. We have verified that riskScore has good prognostic value in both internal validation cohort and external validation cohort. Pathway enrichment and immunoanalysis of ARGs provide a direction for the treatment of colon cancer patients. We also found that riskScore was closely related to the clinical characteristics of patients. Based on riskScore and related clinical features, we constructed a nomogram, which has good predictive performance.Conclusion: The 6-ARG prognostic signature we constructed has a certain clinical predictive ability. Its riskScore is also closely related to clinical characteristics, and nomogram based on this has stronger predictive ability than a single indicator. ARGs and the nomogram we constructed may provide a promising treatment for colon cancer patients.


2021 ◽  
Vol 11 ◽  
Author(s):  
Haoran Li ◽  
Fang Zhou ◽  
Zhifei Cao ◽  
Yuchen Tang ◽  
Yujie Huang ◽  
...  

PurposeThis study aimed to develop and validate a nomogram with preoperative nutritional indicators and tumor markers for predicting prognosis of patients with pancreatic ductal adenocarcinoma (PDAC).MethodsWe performed a bicentric, retrospective study including 155 eligible patients with PDAC. Patients were divided into a training group (n = 95), an internal validation group (n = 34), an external validation group (n = 26), and an entire validation group (n = 60). Cox regression analysis was conducted in the training group to identify independent prognostic factors to construct a nomogram for overall survival (OS) prediction. The performance of the nomogram was assessed in validation groups and through comparison with controlling nutritional status (CONUT) and prognostic nutrition index (PNI).ResultsThe least absolute shrinkage and selection operator (LASSO) regression, univariate and multivariate Cox regression analysis revealed that serum albumin and lymphocyte count were independent protective factors while CA19-9 and diabetes were independent risk factors. The concordance index (C-index) of the nomogram in the training, internal validation, external validation and entire validation groups were 0.777, 0.769, 0.759 and 0.774 respectively. The areas under curve (AUC) of the nomogram in each group were 0.861, 0.845, 0.773, and 0.814. C-index and AUC of the nomogram were better than those of CONUT and PNI in the training and validation groups. The net reclassification index (NRI), integrated discrimination improvement (IDI) and decision curve analysis showed improvement of accuracy of the nomogram in predicting OS and better net benefit in guiding clinical decisions in comparison with CONUT and PNI.ConclusionsThe nomogram incorporating four preoperative nutritional and tumor markers including serum albumin concentration, lymphocyte count, CA19-9 and diabetes mellitus could predict the prognosis more accurately than CONUT and PNI and may serve as a clinical decision support tool to determine what treatment options to choose.


2020 ◽  
Author(s):  
Ze-bing Song ◽  
Guo-pei Zhang ◽  
shaoqiang li

Abstract Background: Hepatocellular carcinoma (HCC) is one of the most common malignant tumor in the world which prognosis is poor. Therefore, a precise biomarker is needed to guide treatment and improve prognosis. More and more studies have shown that lncRNAs and immune response are closely related to the prognosis of hepatocellular carcinoma. The aim of this study was to establish a prognostic signature based on immune related lncRNAs for HCC.Methods: Univariate cox regression analysis was performed to identify immune related lncRNAs, which had negative correlation with overall survival (OS) of 370 HCC patients from The Cancer Genome Atlas (TCGA). A prognostic signature based on OS related lncRNAs was identified by using multivariate cox regression analysis. Gene set enrichment analysis (GSEA) and a competing endogenous RNA (ceRNA) network were performed to clarify the potential mechanism of lncRNAs included in prognostic signature. Results: A prognostic signature based on OS related lncRNAs (AC145207.5, AL365203.2, AC009779.2, ZFPM2-AS1, PCAT6, LINC00942) showed moderately in prognosis prediction, and related with pathologic stage (Stage I&II VS Stage III&IV), distant metastasis status (M0 VS M1) and tumor stage (T1-2 VS T3-4). CeRNA network constructed 15 aixs among differentially expressed immune related genes, lncRNAs included in prognostic signature and differentially expressed miRNA. GSEA indicated that these lncRNAs were involved in cancer-related pathways. Conclusion: We constructed a prognostic signature based on immune related lncRNAs which can predict prognosis and guide therapies for HCC.


2021 ◽  
Vol 8 ◽  
Author(s):  
Zhengjie Xu ◽  
Suxiao Jiang ◽  
Juan Ma ◽  
Desheng Tang ◽  
Changsheng Yan ◽  
...  

Background: Breast cancer (BC) is a heterogeneous malignant tumor, leading to the second major cause of female mortality. This study aimed to establish an in-depth relationship between ferroptosis-related LncRNA (FRlncRNA) and the prognosis as well as immune microenvironment of the patients with BC.Methods: We downloaded and integrated the gene expression data and the clinical information of the patients with BC from The Cancer Genome Atlas (TCGA) database. The co-expression network analysis and univariate Cox regression analysis were performed to screen out the FRlncRNAs related to prognosis. A cluster analysis was adopted to explore the difference of immune microenvironment between the clusters. Furthermore, we determined the optimal survival-related FRLncRNAs for final signature by LASSO Cox regression analysis. Afterward, we constructed and validated the prediction models, which were further tested in different subgroups.Results: A total of 31 FRLncRNAs were filtrated as prognostic biomarkers. Two clusters were determined, and C1 showed better prognosis and higher infiltration level of immune cells, such as B cells naive, plasma cells, T cells CD8, and T cells CD4 memory activated. However, there were no significantly different clinical characters between the clusters. Gene Set Enrichment Analysis (GSEA) revealed that some metabolism-related pathways and immune-associated pathways were exposed. In addition, 12 FRLncRNAs were determined by LASSO analysis and used to construct a prognostic signature. In both the training and testing sets, patients in the high-risk group had a worse survival than the low-risk patients. The area under the curves (AUCs) of receiver operator characteristic (ROC) curves were about 0.700, showing positive prognostic capacity. More notably, through the comprehensive analysis of heatmap, we regarded LINC01871, LINC02384, LIPE-AS1, and HSD11B1-AS1 as protective LncRNAs, while LINC00393, AC121247.2, AC010655.2, LINC01419, PTPRD-AS1, AC099329.2, OTUD6B-AS1, and LINC02266 were classified as risk LncRNAs. At the same time, the patients in the low-risk groups were more likely to be assigned to C1 and had a higher immune score, which were consistent with a better prognosis.Conclusion: Our research indicated that the ferroptosis-related prognostic signature could be used as novel biomarkers for predicting the prognosis of BC. The differences in the immune microenvironment exhibited by BC patients with different risks and clusters suggested that there may be a complementary synergistic effect between ferroptosis and immunotherapy.


2021 ◽  
Vol 11 ◽  
Author(s):  
Junyu Huo ◽  
Liqun Wu ◽  
Yunjin Zang

BackgroundThe high mutation rate of TP53 in hepatocellular carcinoma (HCC) makes it an attractive potential therapeutic target. However, the mechanism by which TP53 mutation affects the prognosis of HCC is not fully understood.Material and ApproachThis study downloaded a gene expression profile and clinical-related information from The Cancer Genome Atlas (TCGA) database and the international genome consortium (ICGC) database. We used Gene Set Enrichment Analysis (GSEA) to determine the difference in gene expression patterns between HCC samples with wild-type TP53 (n=258) and mutant TP53 (n=116) in the TCGA cohort. We screened prognosis-related genes by univariate Cox regression analysis and Kaplan–Meier (KM) survival analysis. We constructed a six-gene prognostic signature in the TCGA training group (n=184) by Lasso and multivariate Cox regression analysis. To assess the predictive capability and applicability of the signature in HCC, we conducted internal validation, external validation, integrated analysis and subgroup analysis.ResultsA prognostic signature consisting of six genes (EIF2S1, SEC61A1, CDC42EP2, SRM, GRM8, and TBCD) showed good performance in predicting the prognosis of HCC. The area under the curve (AUC) values of the ROC curve of 1-, 2-, and 3-year survival of the model were all greater than 0.7 in each independent cohort (internal testing cohort, n = 181; TCGA cohort, n = 365; ICGC cohort, n = 229; whole cohort, n = 594; subgroup, n = 9). Importantly, by gene set variation analysis (GSVA) and the single sample gene set enrichment analysis (ssGSEA) method, we found three possible causes that may lead to poor prognosis of HCC: high proliferative activity, low metabolic activity and immunosuppression.ConclusionOur study provides a reliable method for the prognostic risk assessment of HCC and has great potential for clinical transformation.


2020 ◽  
Vol 2020 ◽  
pp. 1-16
Author(s):  
Huiling Wang ◽  
Shuo You ◽  
Meng Fang ◽  
Qian Fang

Background. Breast cancer (BC) is the most common malignant tumor in women. The immunophenotype of tumor microenvironment (TME) has shown great therapeutic potential in tumor. Method. The transcriptome was obtained from TCGA and GEO data. Immune infiltration was analyzed by single-sample gene set enrichment (ssGSEA). The immune feature was constructed by Cox regression analysis. In addition, the coexpression of differential expression genes (DEGs) was identified. Through enrichment analysis, the function and pathway of module genes were identified. The somatic mutations related to immune characteristics were analyzed by Maftools. By using the consistency clustering algorithm, the molecular subtypes were constructed, and the overall survival time (OS) was predicted. Results. Immune landscape can be divided into low immune infiltration and high immune infiltration. Cox regression analysis identified seven immune cells as protective factors of BC. In the coexpression modules for DEGs of high and low immune infiltration, module 1 was related to T cells and high immune infiltration. In particular, the area under the curve (AUC) value of hub gene SASH3 was the highest, and the correlation with T cells was stronger in the high immune infiltration. Enrichment analysis found that oxidative stress, T cell aggregation, and apoptosis were observed in high immune infiltration. In addition, TP53 was identified as the most important somatic gene mutation related to immune characteristics. Importantly, we also constructed seven immune cell-based breast cancer subtypes to predict OS. Conclusion. We evaluated the immune landscape of BC and constructed the gene characteristics related to the immune landscape. The potential of T cells and SASH3 in immunotherapy of BC was revealed, which may guide the development of new clinical treatment strategies.


2021 ◽  
Vol 21 (1) ◽  
Author(s):  
Jianfeng Zheng ◽  
Jialu Guo ◽  
Benben Cao ◽  
Ying Zhou ◽  
Jinyi Tong

Abstract Background Both N6-methyladenosine (m6A) modification and lncRNAs play an important role in the carcinogenesis and cancer inhibition of ovarian cancer (OC). However, lncRNAs involved in m6A regulation (LI-m6As) have never been reported in OC. Herein, we aimed to identify and validate a signature based on LI-m6A for OC. Methods RNA sequencing profiles with corresponding clinical information associated with OC and 23 m6A regulators were extracted from TCGA. The Pearson correlation coefficient (PCC) between lncRNAs and 23 m6A regulators (|PCC|> 0.4 and p < 0.01) was calculated to identify LI-m6As. The LI-m6As with significant prognostic value were screened based on univariate Cox regression analysis to construct a risk model by LASSO Cox regression. Gene Set Enrichment Analysis (GSEA) was implemented to survey the biological functions of the risk groups. Several clinicopathological characteristics were utilized to evaluate their ability to predict prognosis, and a nomogram was constructed to evaluate the accuracy of survival prediction. Besides, immune microenvironment, checkpoint, and drug sensitivity in the two risk groups were compared using comprehensive algorithms. Finally, real-time qPCR analysis and cell counting kit-8 assays were performed on an alternative lncRNA, CACNA1G-AS1. Results The training cohort involving 258 OC patients and the validation cohort involving 111 OC patients were downloaded from TCGA. According to the PCC between the m6A regulators and lncRNAs, 129 LI-m6As were obtained to perform univariate Cox regression analysis and then 10 significant prognostic LI-m6As were identified. A prognostic signature containing four LI-m6As (AC010894.3, ACAP2-IT1, CACNA1G-AS1, and UBA6-AS1) was constructed according to the LASSO Cox regression analysis of the 10 LI-m6As. The prognostic signature was validated to show completely opposite prognostic value in the two risk groups and adverse overall survival (OS) in several clinicopathological characteristics. GSEA indicated that differentially expressed genes in disparate risk groups were enriched in several tumor-related pathways. At the same time, we found significant differences in some immune cells and chemotherapeutic agents between the two groups. An alternative lncRNA, CACNA1G-AS1, was proven to be upregulated in 30 OC specimens and 3 OC cell lines relative to control. Furthermore, knockdown of CACNA1G‐AS1 was proven to restrain the multiplication capacity of OC cells. Conclusions Based on the four LI-m6As (AC010894.3, ACAP2-IT1, CACNA1G-AS1, and UBA6-AS1), the risk model we identified can independently predict the OS and therapeutic value of OC. CACNA1G‐AS1 was preliminarily proved to be a malignant lncRNA.


Sign in / Sign up

Export Citation Format

Share Document