scholarly journals Combining Genetic Mutation and Expression Profiles Identifies Novel Prognostic Biomarkers of Lung Adenocarcinoma

2020 ◽  
Vol 14 ◽  
pp. 117955492096626
Author(s):  
Yun Liu ◽  
Fu Liu ◽  
Xintong Hu ◽  
Jiaxue He ◽  
Yanfang Jiang

Motivation: Although several prognostic signatures for lung adenocarcinoma (LUAD) have been developed, they are mainly based on a single-omics data set. This article aims to develop a novel set of prognostic signatures by combining genetic mutation and expression profiles of LUAD patients. Methods: The genetic mutation and expression profiles, together with the clinical profiles of a cohort of LUAD patients from The Cancer Genome Atlas (TCGA), were downloaded. Patients were separated into 2 groups, namely, the high-risk and low-risk groups, according to their overall survivals. Then, differential analysis was performed to determine differentially expressed genes (DEGs) and mutated genes (DMGs) in the expression and mutation profiles, respectively, between the 2 groups. Finally, a prognostic model based on the support vector machine (SVM) algorithm was developed by combining the expression values of the DEGs and the mutation times of the DMGs. Results: A total of 13 DEGs and 7 DMGs were recognized between the 2 groups. Their prognostic values were validated using independent cohorts. Compared with several existing signatures, the proposed prognostic signatures exhibited better prediction performance in the testing set. In addition, it is found that 1 of the 7 DMGs, GRIN2B, is mutated much more frequently in the high-risk group, showing a potential value as a therapy target. Conclusions: Combining multi-omics data sets is an applicable manner to identify novel prognostic signatures and to improve the prognostic prediction for LUAD, which will be heuristic to other types of cancers.

2020 ◽  
Vol 10 (1) ◽  
Author(s):  
Bin Baek ◽  
Hyunju Lee

Abstract Predicting the prognosis of pancreatic cancer is important because of the very low survival rates of patients with this particular cancer. Although several studies have used microRNA and gene expression profiles and clinical data, as well as images of tissues and cells, to predict cancer survival and recurrence, the accuracies of these approaches in the prediction of high-risk pancreatic adenocarcinoma (PAAD) still need to be improved. Accordingly, in this study, we proposed two biological features based on multi-omics datasets to predict survival and recurrence among patients with PAAD. First, the clonal expansion of cancer cells with somatic mutations was used to predict prognosis. Using whole-exome sequencing data from 134 patients with PAAD from The Cancer Genome Atlas (TCGA), we found five candidate genes that were mutated in the early stages of tumorigenesis with high cellular prevalence (CP). CDKN2A, TP53, TTN, KCNJ18, and KRAS had the highest CP values among the patients with PAAD, and survival and recurrence rates were significantly different between the patients harboring mutations in these candidate genes and those harboring mutations in other genes (p = 2.39E−03, p = 8.47E−04, respectively). Second, we generated an autoencoder to integrate the RNA sequencing, microRNA sequencing, and DNA methylation data from 134 patients with PAAD from TCGA. The autoencoder robustly reduced the dimensions of these multi-omics data, and the K-means clustering method was then used to cluster the patients into two subgroups. The subgroups of patients had significant differences in survival and recurrence (p = 1.41E−03, p = 4.43E−04, respectively). Finally, we developed a prediction model for prognosis using these two biological features and clinical data. When support vector machines, random forest, logistic regression, and L2 regularized logistic regression were used as prediction models, logistic regression analysis generally revealed the best performance for both disease-free survival (DFS) and overall survival (OS) (accuracy [ACC] = 0.762 and area under the curve [AUC] = 0.795 for DFS; ACC = 0.776 and AUC = 0.769 for OS). Thus, we could classify patients with a high probability of recurrence and at a high risk of poor outcomes. Our study provides insights into new personalized therapies on the basis of mutation status and multi-omics data.


PeerJ ◽  
2021 ◽  
Vol 9 ◽  
pp. e11687
Author(s):  
Guangxu Tu ◽  
Weilin Peng ◽  
Qidong Cai ◽  
Zhenyu Zhao ◽  
Xiong Peng ◽  
...  

Background Ferroptosis is a novel form of programmed cell death characterized by the excessive accumulation of intracellular iron and an increase in reactive oxygen species. Emerging studies have shown that ferroptosis plays a vital role in the progression of lung adenocarcinoma, but the effect of ferroptosis-related genes on prognosis has been poorly studied. The purpose of this study was to explore the prognostic value of ferroptosis-related genes. Methods Lung adenocarcinoma samples were downloaded from The Cancer Genome Atlas (TCGA) and Gene Expression Omnibus (GEO) databases. The least absolute shrinkage and selection operator (LASSO) Cox regression algorithm was used to establish a predictive signature for risk stratification. Kaplan–Meier (K–M) survival analysis and receiver operating characteristic (ROC) curve analysis were conducted to evaluate the signature. We further explored the potential correlation between the risk score model and tumor immune status. Results A 15-gene ferroptosis signature was constructed to classify patients into different risk groups. The overall survival (OS) of patients in the high-risk group was significantly shorter than that of patients in the low-risk group. The signature could predict OS independent of other risk factors. Single-sample gene set enrichment analysis (ssGSEA) identified the difference in immune status between the two groups. Patients in the high-risk group had stronger immune suppression, especially in the antigen presentation process. Conclusions The 15-gene ferroptosis signature identified in this study could be a potential biomarker for prognosis prediction in lung adenocarcinoma. Targeting ferroptosis might be a promising therapeutic alternative for lung adenocarcinoma.


PeerJ ◽  
2021 ◽  
Vol 9 ◽  
pp. e11627
Author(s):  
Jiaying Liang ◽  
Yaofeng Zhi ◽  
Wenhui Deng ◽  
Weige Zhou ◽  
Xuejun Li ◽  
...  

Background Hepatocellular carcinoma (HCC) with high heterogeneity is one of the most frequent malignant tumors throughout the world. However, there is no research to establish a ferroptosis-related lncRNAs (FRlncRNAs) signature for the patients with HCC. Therefore, this study was designed to establish a novel FRlncRNAs signature to predict the survival of patients with HCC. Method The expression profiles of lncRNAs were acquired from The Cancer Genome Atlas (TCGA) and Gene Expression Omnibus (GEO) database. FRlncRNAs co-expressed with ferroptosis-related genes were utilized to establish a signature. Cox regression was used to construct a novel three FRlncRNAs signature in the TCGA cohort, which was verified in the GEO validation cohort. Results Three differently expressed FRlncRNAs significantly associated with prognosis of HCC were identified, which composed a novel FRlncRNAs signature. According to the FRlncRNAs signature, the patients with HCC could be divided into low- and high-risk groups. Patients with HCC in the high-risk group displayed shorter overall survival (OS) contrasted with those in the low-risk group (P < 0.001 in TCGA cohort and P = 0.045 in GEO cohort). This signature could serve as a significantly independent predictor in Cox regression (multivariate HR > 1, P < 0.001), which was verified to a certain extent in the GEO cohort (univariate HR > 1, P < 0.05). Meanwhile, it was also a useful tool in predicting survival among each stratum of gender, age, grade, stage, and etiology,etc. This signature was connected with immune cell infiltration (i.e., Macrophage, Myeloid dendritic cell, and Neutrophil cell, etc.) and immune checkpoint blockade targets (PD-1, CTLA-4, and TIM-3). Conclusion The three FRlncRNAs might be potential therapeutic targets for patients, and their signature could be utilized for prognostic prediction in HCC.


2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Dejun Wu ◽  
Zhenhua Yin ◽  
Yisheng Ji ◽  
Lin Li ◽  
Yunxin Li ◽  
...  

AbstractLncRNAs play a pivotal role in tumorigenesis and development. However, the potential involvement of lncRNAs in colon adenocarcinoma (COAD) needs to be further explored. All the data used in this study were obtained from The Cancer Genome Atlas database, and all analyses were conducted using R software. Basing on the seven prognosis-related lncRNAs finally selected, we developed a prognosis-predicting model with powerful effectiveness (training cohort, 1 year: AUC = 0.70, 95% Cl = 0.57–0.78; 3 years: AUC = 0.71, 95% Cl = 0.6–0.8; 5 years: AUC = 0.76, 95% Cl = 0.66–0.87; validation cohort, 1 year: AUC = 0.70, 95% Cl = 0.58–0.8; 3 years: AUC = 0.73, 95% Cl = 0.63–0.82; 5 years: AUC = 0.68, 95% Cl = 0.5–0.85). The VEGF and Notch pathway were analyzed through GSEA analysis, and low immune and stromal scores were found in high-risk patients (immune score, cor =  − 0.15, P < 0.001; stromal score, cor =  − 0.18, P < 0.001) , which may partially explain the poor prognosis of patients in the high-risk group. We screened lncRNAs that are significantly associated with the survival of patients with COAD and possibly participate in autophagy regulation. This study may provide direction for future research.


Cancers ◽  
2021 ◽  
Vol 13 (1) ◽  
pp. 155
Author(s):  
Pankaj Ahluwalia ◽  
Meenakshi Ahluwalia ◽  
Ashis K. Mondal ◽  
Nikhil Sahajpal ◽  
Vamsi Kota ◽  
...  

Lung cancer is one of the leading causes of death worldwide. Cell death pathways such as autophagy, apoptosis, and necrosis can provide useful clinical and immunological insights that can assist in the design of personalized therapeutics. In this study, variations in the expression of genes involved in cell death pathways and resulting infiltration of immune cells were explored in lung adenocarcinoma (The Cancer Genome Atlas: TCGA, lung adenocarcinoma (LUAD), 510 patients). Firstly, genes involved in autophagy (n = 34 genes), apoptosis (n = 66 genes), and necrosis (n = 32 genes) were analyzed to assess the prognostic significance in lung cancer. The significant genes were used to develop the cell death index (CDI) of 21 genes which clustered patients based on high risk (high CDI) and low risk (low CDI). The survival analysis using the Kaplan–Meier curve differentiated patients based on overall survival (40.4 months vs. 76.2 months), progression-free survival (26.2 months vs. 48.6 months), and disease-free survival (62.2 months vs. 158.2 months) (Log-rank test, p < 0.01). Cox proportional hazard model significantly associated patients in high CDI group with a higher risk of mortality (Hazard Ratio: H.R 1.75, 95% CI: 1.28–2.45, p < 0.001). Differential gene expression analysis using principal component analysis (PCA) identified genes with the highest fold change forming distinct clusters. To analyze the immune parameters in two risk groups, cytokines expression (n = 265 genes) analysis revealed the highest association of IL-15RA and IL 15 (> 1.5-fold, p < 0.01) with the high-risk group. The microenvironment cell-population (MCP)-counter algorithm identified the higher infiltration of CD8+ T cells, macrophages, and lower infiltration of neutrophils with the high-risk group. Interestingly, this group also showed a higher expression of immune checkpoint molecules CD-274 (PD-L1), CTLA-4, and T cell exhaustion genes (HAVCR2, TIGIT, LAG3, PDCD1, CXCL13, and LYN) (p < 0.01). Furthermore, functional enrichment analysis identified significant perturbations in immune pathways in the higher risk group. This study highlights the presence of an immunocompromised microenvironment indicated by the higher infiltration of cytotoxic T cells along with the presence of checkpoint molecules and T cell exhaustion genes. These patients at higher risk might be more suitable to benefit from PD-L1 blockade or other checkpoint blockade immunotherapies.


2021 ◽  
Author(s):  
Zhenyu Zhao ◽  
Boxue He ◽  
Qidong Cai ◽  
Pengfei Zhang ◽  
Xiong Peng ◽  
...  

Abstract Background: Lung adenocarcinoma (LUAD) accounts for a majority of cancer-related deaths worldwide annually. A recent study shows that immunotherapy is an effective method of LUAD treatment, and tumor mutation burden (TMB) was associated with the immune microenvironment and affected the immunotherapy. Exploration of the gene signature associated with tumor mutation burden and immune infiltrates in predicting prognosis in lung adenocarcinoma in this study, we explored the correlation of TMB with immune infiltration and prognosis in LUAD.Materials and Methods: In this study, we firstly got mutation data and LUAD RNA-Seq data of the LUAD from The Cancer Genome Atlas (TCGA), and according to the TMB we divided the patients into high/low-TMB levels groups. The gene ontology (GO) pathway enrichment analysis and KOBAS-Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways analysis were utilized to explore the molecular function of the differentially expressed genes (DEGs) between the two groups. The function enrichment analyses of DEGs were related to the immune pathways. Then, the ESTIMATE algorithm, CIBERSORT, and ssGSEA analysis were utilized to identify the relationship between TMB subgroups and immune infiltration. According to the results, Venn analysis was utilized to select the immune-related genes in DEGs. Univariate and Lasso Cox proportional hazards regression analyses were performed to construct the signature which positively associated with the immune infiltration and affected the survival. Finally, we verified the correlation between the signature and immune infiltration. Result: The exploration of the immune infiltration suggested that high-TMB subgroups positively associated with the high level of immune infiltration in LUAD patients. According to the TMB-related immune signature, the patients were divided into High/Low-risk groups, and the high-risk group was positively associated with poor prognostic. The results of the PCA analysis confirmed the validity of the signature. We also verified the effectiveness of the signature in GSE30219 and GSE72094 datasets. The ROC curves and C-index suggested the good clinical application of the TMB-related immune signature in LUAD prognosis. Another result suggested that the patients of the high-risk group were positively associated with higher TMB levels, PD-L1expression, and immune infiltration levels.Conclusion: In conclusion, our signature provides potential biomarkers for studying aspects of the TMB in LUAD such as TMB affected immune microenvironment and prognosis. This signature may provide some biomarkers which could improve the biomarkers of PD-L1 immunotherapy response and were inverted for the clinical application of the TMB in LUAD. LUAD male patients with higher TMB-levels and risk scores may benefit from immunotherapy. The high-risk patients along with higher PD-L1 expression of the signature may suitable for immunotherapy and improve their survival by detecting the TMB of LUAD.


2021 ◽  
Author(s):  
Pingfan Wu ◽  
Xiaowen Zhao ◽  
Ling Xue ◽  
Xiaojing Yang ◽  
Yuxiang Shi ◽  
...  

Abstract Considerable evidence suggests that N6-methyladenosine (m6A) is involved in the regulation of long non-coding RNA (lncRNA), whichparticipates in the occurrence, development and prognosis of tumorscancerBut the relationship between m6A regulators-related lncRNA (mRlncRNA) and lung adenocarcinoma (LUAD) remains unclear. This study aims to determine a feature based on mRlncRNA for prognostic evaluation of LUAD patients. By integrating the gene expression data of LUAD and normal samples from the Cancer Genome Atlas (TCGA) and the Gene Expression Omnibus (GEO) database, the m6A gene and mRlncRNA with imbalanced expression were screened out. Then we used the least absolute shrinkage and selection operator (LASSO) to obtain the 13-lncRNA prognostic signature in the TCGA training cohort. Patients were divided into two risk groups based on the risk score of lncRNAs characteristics, and their overall survival (OS) was significantly different. The predictive power of this signature was verified in TCGA testing cohort and entire TCGA cohort. These landmark lncRNAs were involved in several biologiocal processes and pathways related to cell cycle, DNA replication, P53 signaling pathway and mismatch repair. Besides, the high-risk group was low-response to cisplatin, while high-response to mitomycin, docetaxel and immunotherapy. In conclusion, we identified a 13-mRlncRNA model associated with prognosis and treatment sensitivity in LUAD, which may provide clues about the influence of m6A on lncRNA in LUAD and promote the further improvement of LUAD individualized treatment strategies.


PeerJ ◽  
2019 ◽  
Vol 7 ◽  
pp. e7821 ◽  
Author(s):  
Xiaoming Zhang ◽  
Jing Zhuang ◽  
Lijuan Liu ◽  
Zhengguo He ◽  
Cun Liu ◽  
...  

Background Cumulative evidence suggests that long non-coding RNAs (lncRNAs) play an important role in tumorigenesis. This study aims to identify lncRNAs that can serve as new biomarkers for breast cancer diagnosis or screening. Methods First, the linear fitting method was used to identify differentially expressed genes from the breast cancer RNA expression profiles in The Cancer Genome Atlas (TCGA). Next, the diagnostic value of all differentially expressed lncRNAs was evaluated using a receiver operating characteristic (ROC) curve. Then, the top ten lncRNAs with the highest diagnostic value were selected as core genes for clinical characteristics and prognosis analysis. Furthermore, core lncRNA-mRNA co-expression networks based on weighted gene co-expression network analysis (WGCNA) were constructed, and functional enrichment analysis was performed using the Database for Annotation, Visualization and Integrated Discovery (DAVID). The differential expression level and diagnostic value of core lncRNAs were further evaluated by using independent data set from Gene Expression Omnibus (GEO). Finally, the expression status and prognostic value of core lncRNAs in various tumors were analyzed based on Gene Expression Profiling Interactive Analysis (GEPIA). Results Seven core lncRNAs (LINC00478, PGM5-AS1, AL035610.1, MIR143HG, RP11-175K6.1, AC005550.4, and MIR497HG) have good single-factor diagnostic value for breast cancer. AC093850.2 has a prognostic value for breast cancer. AC005550.4 and MIR497HG can better distinguish breast cancer patients in early-stage from the advanced-stage. Low expression of MAGI2-AS3, LINC00478, AL035610.1, MIR143HG, and MIR145 may be associated with lymph node metastasis in breast cancer. Conclusion Our study provides candidate biomarkers for the diagnosis and prognosis of breast cancer, as well as a bioinformatics basis for the further elucidation of the molecular pathological mechanism of breast cancer.


PeerJ ◽  
2021 ◽  
Vol 9 ◽  
pp. e10884
Author(s):  
Xin Yu ◽  
Qian Yang ◽  
Dong Wang ◽  
Zhaoyang Li ◽  
Nianhang Chen ◽  
...  

Applying the knowledge that methyltransferases and demethylases can modify adjacent cytosine-phosphorothioate-guanine (CpG) sites in the same DNA strand, we found that combining multiple CpGs into a single block may improve cancer diagnosis. However, survival prediction remains a challenge. In this study, we developed a pipeline named “stacked ensemble of machine learning models for methylation-correlated blocks” (EnMCB) that combined Cox regression, support vector regression (SVR), and elastic-net models to construct signatures based on DNA methylation-correlated blocks for lung adenocarcinoma (LUAD) survival prediction. We used methylation profiles from the Cancer Genome Atlas (TCGA) as the training set, and profiles from the Gene Expression Omnibus (GEO) as validation and testing sets. First, we partitioned the genome into blocks of tightly co-methylated CpG sites, which we termed methylation-correlated blocks (MCBs). After partitioning and feature selection, we observed different diagnostic capacities for predicting patient survival across the models. We combined the multiple models into a single stacking ensemble model. The stacking ensemble model based on the top-ranked block had the area under the receiver operating characteristic curve of 0.622 in the TCGA training set, 0.773 in the validation set, and 0.698 in the testing set. When stratified by clinicopathological risk factors, the risk score predicted by the top-ranked MCB was an independent prognostic factor. Our results showed that our pipeline was a reliable tool that may facilitate MCB selection and survival prediction.


2020 ◽  
Author(s):  
Rui Zhang ◽  
Chen Chen ◽  
Qi Li ◽  
Jialu Fu ◽  
Dong Zhang ◽  
...  

Abstract Background: Immune-related genes (IRGs) play a crucial role in the initiation and progression of cholangiocarcinoma (CCA). However, immune signatures have rarely been used to predict prognosis of CCA. The aim of this study was to construct a novel model for CCA to predict survival based on IRGs expression data.Methods: The gene expression profiles and clinical data of CCA patients from The Cancer Genome Atlas (TCGA) and Gene Expression Omnibus (GEO) database were integrated to establish and validate prognostic IRG signatures. Differentially expressed immune-related genes were screened. Univariate and multivariate Cox analysis were performed to identify prognostic IRGs, and the risk model that predicts outcomes was constructed. Furthermore, receiver operating characteristic (ROC) and Kaplan-Meier curve were plotted to examine predictive accuracy of the model, and a nomogram was constructed based on IRGs signature, combining with other clinical characteristics. Finally, CIBERSORT was used to analyze the association of immune cells infiltration with risk score.Results: We identified that 223 IRGs were significantly dysregulated in patients with CCA, among which five IRGs (AVPR1B, CST4, TDGF1, RAET1E and IL9R) were identified as robust indicators for overall survival (OS), and a prognostic model was built based on the IRGs signature. Meanwhile, patients with high risk had worse OS in training and validation cohort, and the area under the ROC was 0.898 and 0.846, respectively. Nomogram demonstrated that immune risk score contributed much more points than other clinicopathological variables, with a C-index of 0.819 (95% CI, 0.727-0.911). Finally, we found that IRGs signature was positively correlated with the proportion of CD8+ T cells, neurophils and T gamma delta, while negatively with that of CD4+ memory resting T cells.Conclusions: We established and validated an effective five IRGs-based prediction model for CCA, which could accurately classify patients into groups with low and high risk of poor prognosis.


Sign in / Sign up

Export Citation Format

Share Document