Identification of a prognostic long non-coding RNA signature in breast cancer
Abstract Background: Breast cancer is the most common malignant disease among women. At present, more and more attention has been paid to long non-coding RNAs (lncRNAs) in the field of breast cancer research. We aimed to investigate the expression profiles of lncRNAs and construct a prognostic lncRNA for predicting the overall survival (OS) of breast cancer. Methods: The expression profiles of lncRNAs and clinical data with breast cancer were obtained from The Cancer Genome Atlas (TCGA). Differentially expressed lncRNAs were screened out by R package (limma). The survival probability was estimated by the Kaplan‑Meier Test. The Cox Regression Model was performed for univariate and multivariate analysis. The risk score (RS) was established on the basis of the lncRNAs’ expression level (exp) multiplied regression coefficient (β) from the multivariate cox regression analysis with the following formula: RS=exp a1 * β a1 + exp a2 * β a2 +……+ exp an * β an . Functional enrichment analysis was performed by Metascape. Results: A total of 3404 differentially expressed lncRNAs were identified. Among them, CYTOR , MIR4458HG and MAPT-AS1 were significantly associated with the survival of breast cancer. Finally, The RS could predict OS of breast cancer (RS=exp CYTOR * β CYTOR + exp MIR4458HG * β MIR4458HG + exp MAPT-AS1 * β MAPT-AS1 ). Moreover, it was confirmed that the three-lncRNA signature could be an independent prognostic biomarker for breast cancer (HR=3.040, P=0.000). Conclusions: This study established a three-lncRNA signature, which might be a novel prognostic biomarker for breast cancer.