scholarly journals Identification and Validation of the Pyroptosis-Related Molecular Subtypes of Lung Adenocarcinoma by Bioinformatics and Machine Learning

Author(s):  
Le-Ping Liu ◽  
Lu Lu ◽  
Qiang-Qiang Zhao ◽  
Qin-Jie Kou ◽  
Zhen-Zhen Jiang ◽  
...  

Lung cancer remains the leading cause of cancer death globally, with lung adenocarcinoma (LUAD) being its most prevalent subtype. Due to the heterogeneity of LUAD, patients given the same treatment regimen may have different responses and clinical outcomes. Therefore, identifying new subtypes of LUAD is important for predicting prognosis and providing personalized treatment for patients. Pyroptosis-related genes play an essential role in anticancer, but there is limited research investigating pyroptosis in LUAD. In this study, 33 pyroptosis gene expression profiles and clinical information were collected from The Cancer Genome Atlas (TCGA) and Gene Expression Omnibus (GEO) databases. By bioinformatics and machine learning analyses, we identified novel subtypes of LUAD based on 10 pyroptosis-related genes and further validated them in the GEO dataset, with machine learning models performing up to an AUC of 1 for classifying in GEO. A web-based tool was established for clinicians to use our clustering model (http://www.aimedicallab.com/tool/aiml-subphe-luad.html). LUAD patients were clustered into 3 subtypes (A, B, and C), and survival analysis showed that B had the best survival outcome and C had the worst survival outcome. The relationships between pyroptosis gene expression and clinical characteristics were further analyzed in the three molecular subtypes. Immune profiling revealed significant differences in immune cell infiltration among the three molecular subtypes. GO enrichment and KEGG pathway analyses were performed based on the differential genes of the three subtypes, indicating that differentially expressed genes (DEGs) were involved in multiple cellular and biological functions, including RNA catabolic process, mRNA catabolic process, and pathways of neurodegeneration-multiple diseases. Finally, we developed an 8-gene prognostic model that accurately predicted 1-, 3-, and 5-year overall survival. In conclusion, pyroptosis-related genes may play a critical role in LUAD, and provide new insights into the underlying mechanisms of LUAD.

2020 ◽  
Vol 38 (5_suppl) ◽  
pp. 25-25
Author(s):  
Yuanyuan Shen ◽  
Justin Hummel ◽  
Isabel Cristina Trindade ◽  
Christos Papageorgiou ◽  
Chi-Ren Shyu ◽  
...  

25 Background: Low cytotoxic T lymphocyte (CTLs) infiltration in colorectal cancer (CRC) tumors is a challenge to treatment with immune checkpoint inhibitors. Consensus molecular subtypes (CMS) classify patients based on tumor attributes, and CMS1 patients include the majority of patients with high CTL infiltration and “inflamed” tumors. Epigenetic modification plays a critical role in gene expression and therapy resistance. Therefore, in this study we compared DNA methylation, gene expression, and CTL infiltration of CMS1 patients to other CMS groups to determine targets for improving immunotherapy in CRC. Methods: RNA-seq (n = 511) and DNA methylation (n = 316) from The Cancer Genome Atlas databases were used to determine gene expression and methylation profiles based on CMSs. CMS1 was used as a reference and compared to other subtypes (CMS2-4). Microenvironment Cell Populations- counter (MCPcounter) was used to determine tumor CTL infiltration. Genes with significantly different expression (p < 0.01, LogFC≥|1.5|) and difference of mean methylation β value ≥|0.25| were integrated for Pearson correlation coefficient analysis with MCPcounter score (r > |0.7|). Results: Comparing CMS1 and CMS2, ARHGAP9, TBX21, and LAG3 were differentially methylated and correlated with CTL scores. ARHGAP9 and TBX21 were decreased and hypomethylated in CMS2. Comparing CMS1 and CMS3, ARHGAP9, TBX21, FMNL1, HLA-DPB1, and STX11 were downregulated in CMS3 and highly correlated with CTL scores. ARHGAP9, FMNL1, HLA-DPB1, and STX11 were hypomethylated in CMS3 and TBX21 was methylated in both, but had a higher methylation ratio in CMS1. Comparing CMS1 and CMS4, TBX21 was the only gene downregulated, hypomethylated, and highly correlated with CTL scores in CMS4 patients. Conclusions: We found six genes differentially expressed, differentially methylated, and highly correlated with CTL infiltration when comparing CMS1 to other CMS groups. Specifically, TBX21 was the only gene highly correlated with CTL scores with differential gene expression and methylation in CMS2-4 when compared to CMS1. Thus, T-bet may be a critical regulator of T cell responses in CRC.


PLoS ONE ◽  
2021 ◽  
Vol 16 (7) ◽  
pp. e0254368
Author(s):  
Gang Liu ◽  
Jian-ying Ma ◽  
Gang Hu ◽  
Huan Jin

Background Ferroptosis is a novel form of regulated cell death that plays a critical role in tumorigenesis. The purpose of this study was to establish a ferroptosis-associated gene (FRG) signature and assess its clinical outcome in gastric cancer (GC). Methods Differentially expressed FRGs were identified using gene expression profiles from The Cancer Genome Atlas (TCGA) and Gene Expression Omnibus (GEO) database. Univariate and least absolute shrinkage and selection operator (LASSO) Cox regression analyses were performed to construct a prognostic signature. The model was validated using an independent GEO dataset, and a genomic-clinicopathologic nomogram integrating risk scores and clinicopathological features was established. Results An 8-FRG signature was constructed to calculate the risk score and classify GC patients into two risk groups (high- and low-risk) according to the median value of the risk score. The signature showed a robust predictive capacity in the stratification analysis. A high-risk score was associated with advanced clinicopathological features and an unfavorable prognosis. The predictive accuracy of the signature was confirmed using an independent GSE84437 dataset. Patients in the two groups showed different enrichment of immune cells and immune-related pathways. Finally, we established a genomic-clinicopathologic nomogram (based on risk score, age, and tumor stage) to predict the overall survival (OS) of GC patients. Conclusions The novel FRG signature may be a reliable tool for assisting clinicians in predicting the OS of GC patients and may facilitate personalized treatment.


2021 ◽  
Author(s):  
Ping Yu ◽  
Linlin Tong ◽  
Yujia Song ◽  
Hui Qu ◽  
Ying Chen

Abstract Background: Due to the high heterogeneity of lung adenocarcinoma (LUAD), molecular subtype based on gene expression profiles is of great significance for diagnosis, and prognosis prediction in patients with LUAD.Methods: Invasion-related genes were obtained from the CancerSEA database, and LUAD expression profiles were downloaded from The Cancer Genome Atlas. The ConsensusClusterPlus was used to obtain molecular subtypes based on invasion-related genes. The limma software package was used to identify differentially expressed genes (DEGs). A multi-gene risk model was constructed by Lasso-Cox analysis. A nomogram was also constructed based on risk scores and meaningful clinical features.Results: 3 subtypes (C1, C2, C3) based on the expression of invasion-related genes were obtained. C3 had the worst prognosis. A total of 669 DEGs were identified among the subtypes. Pathway enrichment analysis results showed that the DEGs were mainly enriched in the cell cycle, DNA replication, the p53 signaling pathway, and other tumor-related pathways. A 5-gene signature (KRT6A, MELTF, IRX5, MS4A1, CRTAC1) was identified by using Lasso-Cox analysis. The training, validation, and external independent cohorts proved that the model was robust and had better prediction ability than other lung cancer models. The gene expression results showed that the expression levels of MS4A1 and KRT6A in tumor tissues were higher than in normal tissues, while CRTAC1 expression in tumor tissues was lower than in normal tissues. At the same time, the 5 genes were significantly expressed in pan-cancer immune subtypes. Gene set enrichment analysis showed that MS4A1, KRT6A, and CRAT1 genes were both enriched in the HALLMARK_IL2_STAT5_SIGNALING pathway, and IRX5 and MELTF gene were both enriched in the HALLMARK_EPITHELIAL_MESENCHYMAL_TRANSITION pathway. Conclusion: The 5-gene signature prognostic stratification system based on invasion-related genes could be used to assess prognostic risk in patients with LUAD.


2021 ◽  
Author(s):  
Qian Yan ◽  
Baoqian Ye ◽  
Boqing Wang ◽  
Wenjiang Zheng ◽  
Xiongwen Wang

Abstract The purpose of this study is to analyze the DNA methylation and gene expression profiles of immune-related CpG sites to identify the molecular subtypes and CpG sites related to the prognosis of HCC. In this study, the DNA methylation and gene expression datasets were downloaded from The Cancer Genome Atlas database, together with immune-related genes downloaded from the immunology database and analysis portal database to explore the prognostic molecular subtypes of HCC. By performing consistent clustering analysis on 830 immune-related CpG sites, we identified seven subgroups with significant differences in overall survival. Finally, 16 classifiers of immune-related CpG sites were constructed and used in the testing set to verify the prognosis of DNA methylation subgroups, and the results were consistent with the training set. Using the TIMER database, we analyzed 16 immune-related CpG sites expression with the abundance of six types of immune infiltrating cells and found that most are positively correlated with the level of infiltration of multiple immune cells in HCC. This study screened potential immune-related prognostic methylation sites and established a new prognosis model of HCC based on DNA methylation molecular subtype, which may help in the early diagnosis of HCC and developing more effective personalized treatments.


2021 ◽  
Vol 14 (1) ◽  
Author(s):  
Guoliang Jia ◽  
Zheyu Song ◽  
Zhonghang Xu ◽  
Youmao Tao ◽  
Yuanyu Wu ◽  
...  

Abstract Background Bioinformatics was used to analyze the skin cutaneous melanoma (SKCM) gene expression profile to provide a theoretical basis for further studying the mechanism underlying metastatic SKCM and the clinical prognosis. Methods We downloaded the gene expression profiles of 358 metastatic and 102 primary (nonmetastatic) CM samples from The Cancer Genome Atlas (TCGA) database as a training dataset and the GSE65904 dataset from the National Center for Biotechnology Information database as a validation dataset. Differentially expressed genes (DEGs) were screened using the limma package of R3.4.1, and prognosis-related feature DEGs were screened using Logit regression (LR) and survival analyses. We also used the STRING online database, Cytoscape software, and Database for Annotation, Visualization and Integrated Discovery software for protein–protein interaction network, Gene Ontology, and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analyses based on the screened DEGs. Results Of the 876 DEGs selected, 11 (ZNF750, NLRP6, TGM3, KRTDAP, CAMSAP3, KRT6C, CALML5, SPRR2E, CD3G, RTP5, and FAM83C) were screened using LR analysis. The survival prognosis of nonmetastatic group was better compared to the metastatic group between the TCGA training and validation datasets. The 11 DEGs were involved in 9 KEGG signaling pathways, and of these 11 DEGs, CALML5 was a feature DEG involved in the melanogenesis pathway, 12 targets of which were collected. Conclusion The feature DEGs screened, such as CALML5, are related to the prognosis of metastatic CM according to LR. Our results provide new ideas for exploring the molecular mechanism underlying CM metastasis and finding new diagnostic prognostic markers.


BMC Cancer ◽  
2021 ◽  
Vol 21 (1) ◽  
Author(s):  
Xiaodong Yang ◽  
Yuexin Zheng ◽  
Zhihai Han ◽  
Xiliang Zhang

Abstract Background As a marker of differentiation, Killer cell lectin like receptor G1 (KLRG1) plays an inhibitory role in human NK cells and T cells. However, its clinical role remains inexplicit. This work intended to investigate the predictive ability of KLRG1 on the efficacy of immune-checkpoint inhibitor in the treatment of lung adenocarcinoma (LUAD), as well as contribute to the possible molecular mechanisms of KLRG1 on LUAD development. Methods Using data from the Gene Expression Omnibus, the Cancer Genome Atlas and the Genotype-Tissue Expression, we compared the expression of KLRG1 and its related genes Bruton tyrosine kinase (BTK), C-C motif chemokine receptor 2 (CCR2), Scm polycomb group protein like 4 (SCML4) in LUAD and normal lung tissues. We also established stable LUAD cell lines with KLRG1 gene knockdown and investigated the effect of KLRG1 knockdown on tumor cell proliferation. We further studied the prognostic value of the four factors in terms of overall survival (OS) in LUAD. Using data from the Gene Expression Omnibus, we further investigated the expression of KLRG1 in the patients with different responses after immunotherapy. Results The expression of KLRG1, BTK, CCR2 and SCML4 was significantly downregulated in LUAD tissues compared to normal controls. Knockdown of KLRG1 promoted the proliferation of A549 and H1299 tumor cells. And low expression of these four factors was associated with unfavorable overall survival in patients with LUAD. Furthermore, low expression of KLRG1 also correlated with poor responses to immunotherapy in LUAD patients. Conclusion Based on these findings, we inferred that KLRG1 had significant correlation with immunotherapy response. Meanwhile, KLRG1, BTK, CCR2 and SCML4 might serve as valuable prognostic biomarkers in LUAD.


PeerJ ◽  
2018 ◽  
Vol 6 ◽  
pp. e5285 ◽  
Author(s):  
Mei Sze Tan ◽  
Siow-Wee Chang ◽  
Phaik Leng Cheah ◽  
Hwa Jen Yap

Although most of the cervical cancer cases are reported to be closely related to the Human Papillomavirus (HPV) infection, there is a need to study genes that stand up differentially in the final actualization of cervical cancers following HPV infection. In this study, we proposed an integrative machine learning approach to analyse multiple gene expression profiles in cervical cancer in order to identify a set of genetic markers that are associated with and may eventually aid in the diagnosis or prognosis of cervical cancers. The proposed integrative analysis is composed of three steps: namely, (i) gene expression analysis of individual dataset; (ii) meta-analysis of multiple datasets; and (iii) feature selection and machine learning analysis. As a result, 21 gene expressions were identified through the integrative machine learning analysis which including seven supervised and one unsupervised methods. A functional analysis with GSEA (Gene Set Enrichment Analysis) was performed on the selected 21-gene expression set and showed significant enrichment in a nine-potential gene expression signature, namely PEG3, SPON1, BTD and RPLP2 (upregulated genes) and PRDX3, COPB2, LSM3, SLC5A3 and AS1B (downregulated genes).


2021 ◽  
Author(s):  
Pingfan Wu ◽  
Xiaowen Zhao ◽  
Ling Xue ◽  
Xiaojing Yang ◽  
Yuxiang Shi ◽  
...  

Abstract Considerable evidence suggests that N6-methyladenosine (m6A) is involved in the regulation of long non-coding RNA (lncRNA), whichparticipates in the occurrence, development and prognosis of tumorscancerBut the relationship between m6A regulators-related lncRNA (mRlncRNA) and lung adenocarcinoma (LUAD) remains unclear. This study aims to determine a feature based on mRlncRNA for prognostic evaluation of LUAD patients. By integrating the gene expression data of LUAD and normal samples from the Cancer Genome Atlas (TCGA) and the Gene Expression Omnibus (GEO) database, the m6A gene and mRlncRNA with imbalanced expression were screened out. Then we used the least absolute shrinkage and selection operator (LASSO) to obtain the 13-lncRNA prognostic signature in the TCGA training cohort. Patients were divided into two risk groups based on the risk score of lncRNAs characteristics, and their overall survival (OS) was significantly different. The predictive power of this signature was verified in TCGA testing cohort and entire TCGA cohort. These landmark lncRNAs were involved in several biologiocal processes and pathways related to cell cycle, DNA replication, P53 signaling pathway and mismatch repair. Besides, the high-risk group was low-response to cisplatin, while high-response to mitomycin, docetaxel and immunotherapy. In conclusion, we identified a 13-mRlncRNA model associated with prognosis and treatment sensitivity in LUAD, which may provide clues about the influence of m6A on lncRNA in LUAD and promote the further improvement of LUAD individualized treatment strategies.


2021 ◽  
Vol 9 (Suppl 3) ◽  
pp. A954-A955
Author(s):  
Jacob Kaufman ◽  
Doug Cress ◽  
Theresa Boyle ◽  
David Carbone ◽  
Neal Ready ◽  
...  

BackgroundLKB1 (STK11) is a commonly disrupted tumor suppressor in NSCLC. Its loss promotes an immune exclusion phenotype with evidence of low expression of interferon stimulated genes (ISG) and decreased microenvironment immune infiltration.1 2 Clinically, LKB1 loss induces primary immunotherapy resistance.3 LKB1 is a master regulator of a complex downstream kinase network and has pleiotropic effects on cell biology. Understanding the heterogeneous phenotypes associated with LKB1 loss and their influence on tumor-immune biology will help define and overcome mechanisms of immunotherapy resistance within this subset of lung cancer.MethodsWe applied multi-omic analyses across multiple lung adenocarcinoma datasets2 4–6 (>1000 tumors) to define transcriptional and genetic features enriched in LKB1-deficient lung cancer. Top scoring phenotypes exhibited heterogeneity across LKB1-loss tumors, and were further interrogated to determine association with increased or decreased markers of immune activity. Further, immune cell-types were estimated by Cibersort to identify effects of LKB1 loss on the immune microenvironment. Key conclusions were confirmed by blinded pathology review.ResultsWe show that LKB1 loss significantly affects differentiation patterns, with enrichment of ASCL1-expressing tumors with putative neuroendocrine differentiation. LKB1-deficient neuroendocrine tumors had lower expression of Interferon Stimulated Genes (ISG), MHC1 and MHC2 components, and immune infiltration compared to LKB1-WT and non-neuroendocrine LKB1-deficient tumors (figure 1).The abundances of 22 immune cell types assessed by Cibersort were compared between LKB1-deficient and LKB1-WT tumors. We observe skewing of immune microenvironmental composition by LKB1 loss, with lower abundance of dendritic cells, monocytes, and macrophages, and increased levels of neutrophils and plasma cells (table 1). These trends were most pronounced among tumors with neuroendocrine differentiation, and were concordant across three independent datasets. In a confirmatory subset of 20 tumors, plasma cell abundance was assessed by a blinded pathologist. Pathologist assessment was 100% concordant with Cibersort prediction, and association with LKB1 loss was confirmed (P=0.001).Abstract 909 Figure 1Immune-associated Gene Expression Profiles Affected by Neuroendocrine Differentiation within LKB1-Deficient Lung Adenocarcinomas. Gene expression profiles corresponding to five immune-associated phenotypes are shown with bars indicating average GEP scores for tumors grouped according to LKB1 and neuroendocrine status as indicated. P-values represent results from Student’s T-test between groups as indicated.Abstract 909 Table 1LKB1 Loss Affects Composition of Immune Microenvironment. Values indicate log10 P-values comparing LKB1-loss to LKB1-WT tumors. Positive (red) indicates increased abundance in LKB1 loss. Negative (blue) indicates decreased abundance.ConclusionsWe conclude that tumor differentiation patterns strongly influence the immune microenvironment and immune exclusion characteristics of LKB1-deficient tumors. Neuroendocrine differentiation is associated with the strongest immune exclusion characteristics and should be evaluated clinically for evidence of immunotherapy resistance. A novel observation of increased plasma cell abundance is observed across multiple datasets and confirmed by pathology. Causal mechanisms linking differentiation status to immune activity is not well understood, and the functional role of plasma cells in the immune biology of LKB1-deficient tumors is undefined. These questions warrant further study to inform precision immuno-oncology treatments for these patients.AcknowledgementsThis work was funded by SITC AZ Immunotherapy in Lung Cancer grant (SPS256666) and DOD Lung Cancer Research Program Concept Award (LC180633).ReferencesSkoulidis F, Byers LA, Diao L, et al. Co-occurring genomic alterations define major subsets of KRAS-mutant lung adenocarcinoma with distinct biology, immune profiles, and therapeutic vulnerabilities. Cancer Discov 2015;5:860–77.Schabath MB, Welsh EA, Fulp WJ, et al. Differential association of STK11 and TP53 with KRAS mutation-associated gene expression, proliferation and immune surveillance in lung adenocarcinoma. Oncogene 2016;35:3209–16.Skoulidis F, Goldberg ME, Greenawalt DM, et al. STK11/LKB1 mutations and PD-1 inhibitor resistance in KRAS-mutant lung adenocarcinoma. Cancer Discovery 2018;8:822-835.Cancer Genome Atlas Research Network. Comprehensive molecular profiling of lung adenocarcinoma. Nature 2014;511:543–50.Chitale D, Gong Y, Taylor BS, et al. An integrated genomic analysis of lung cancer reveals loss of DUSP4 in EGFR-mutant tumors. Oncogene 2009;28:2773–83.Shedden K, Taylor JM, Enkemann SA, et al. Gene expression-based survival prediction in lung adenocarcinoma: a multi-site, blinded validation study. Nat Med 2008;14:822–7.


2021 ◽  
Vol 11 ◽  
Author(s):  
Yi Zhang ◽  
Lei Xia ◽  
Dawei Ma ◽  
Jing Wu ◽  
Xinyu Xu ◽  
...  

Cancer of unknown primary (CUP), in which metastatic diseases exist without an identifiable primary location, accounts for about 3–5% of all cancer diagnoses. Successful diagnosis and treatment of such patients are difficult. This study aimed to assess the expression characteristics of 90 genes as a method of identifying the primary site from CUP samples. We validated a 90-gene expression assay and explored its potential diagnostic utility in 44 patients at Jiangsu Cancer Hospital. For each specimen, the expression of 90 tumor-specific genes in malignant tumors was analyzed, and similarity scores were obtained. The types of malignant tumors predicted were compared with the reference diagnosis to calculate the accuracy. In addition, we verified the consistency of the expression profiles of the 90 genes in CUP secondary malignancies and metastatic malignancies in The Cancer Genome Atlas. We also reported a detailed description of the next-generation coding sequences for CUP patients. For each clinical medical specimen collected, the type of malignant tumor predicted and analyzed by the 90-gene expression assay was compared with its reference diagnosis, and the overall accuracy was 95.4%. In addition, the 90-gene expression profile generally accurately classified CUP into the cluster of its primary tumor. Sequencing of the exome transcriptome containing 556 high-frequency gene mutation oncogenes was not significantly related to the 90 genes analysis. Our results demonstrate that the expression characteristics of these 90 genes can be used as a powerful tool to accurately identify the primary sites of CUP. In the future, the inclusion of the 90-gene expression assay in pathological diagnosis will help oncologists use precise treatments, thereby improving the care and outcomes of CUP patients.


Sign in / Sign up

Export Citation Format

Share Document