scholarly journals Cancer gene expression profiles associated with clinical outcomes to chemotherapy treatments

2020 ◽  
Vol 13 (S8) ◽  
Author(s):  
Nicolas Borisov ◽  
Maxim Sorokin ◽  
Victor Tkachev ◽  
Andrew Garazha ◽  
Anton Buzdin

Abstract Background Machine learning (ML) methods still have limited applicability in personalized oncology due to low numbers of available clinically annotated molecular profiles. This doesn’t allow sufficient training of ML classifiers that could be used for improving molecular diagnostics. Methods We reviewed published datasets of high throughput gene expression profiles corresponding to cancer patients with known responses on chemotherapy treatments. We browsed Gene Expression Omnibus (GEO), The Cancer Genome Atlas (TCGA) and Tumor Alterations Relevant for GEnomics-driven Therapy (TARGET) repositories. Results We identified data collections suitable to build ML models for predicting responses on certain chemotherapeutic schemes. We identified 26 datasets, ranging from 41 till 508 cases per dataset. All the datasets identified were checked for ML applicability and robustness with leave-one-out cross validation. Twenty-three datasets were found suitable for using ML that had balanced numbers of treatment responder and non-responder cases. Conclusions We collected a database of gene expression profiles associated with clinical responses on chemotherapy for 2786 individual cancer cases. Among them seven datasets included RNA sequencing data (for 645 cases) and the others – microarray expression profiles. The cases represented breast cancer, lung cancer, low-grade glioma, endothelial carcinoma, multiple myeloma, adult leukemia, pediatric leukemia and kidney tumors. Chemotherapeutics included taxanes, bortezomib, vincristine, trastuzumab, letrozole, tipifarnib, temozolomide, busulfan and cyclophosphamide.

2021 ◽  
Vol 41 (1) ◽  
Author(s):  
Xianxue Zhang ◽  
Feng Yang ◽  
Zhenbao Wang

Abstract Immunotherapy is remarkably affected by the immune environment of the principal tumor. Nonetheless, the immune environment’s clinical relevance in stage IV gastric cancer (GC) is largely unknown. The gene expression profiles of 403 stage IV GC patients in the three cohorts: GEO (Gene Expression Omnibus, GSE84437 (n=292) and GSE62254 (n=77), and TCGA (The Cancer Genome Atlas, n=34) were used in the present study. Using four publicly available stage IV GC expression datasets, 29 immune signatures were expression profiled, and on this basis, we classified stage IV GC. The classification was conducted using the hierarchical clustering method. Three stage IV GC subtypes L, M, and H were identified representing low, medium, and high immunity, respectively. Immune correlation analysis of these three types revealed that Immune H exhibited a better prognostic outcome as well as a higher immune score compared with other subtypes. There was a noticeable difference in the three subgroups of HLA genes. Further, on comparing with other subtypes, CD86, CD80, CD274, CTLA4, PDCD1, and PDCD1LG2 had higher expression in the Immunity H subtype. In stage IV GC, potentially positive associations between immune and pathway activities were displayed, due to the enrichment of pathways including TNF signaling, Th-17 cell differentiation, and JAK-STAT signaling pathways in Immunity H vs Immunity L subtypes. External cohorts from TCGA cohort ratified these results. The identification of stage IV GC subtypes has potential clinical implications in stage IV GC treatment.


BMC Genomics ◽  
2021 ◽  
Vol 22 (1) ◽  
Author(s):  
Yanan Ren ◽  
Ting-You Wang ◽  
Leah C. Anderton ◽  
Qi Cao ◽  
Rendong Yang

Abstract Background Long non-coding RNAs (lncRNAs) are a growing focus in cancer research. Deciphering pathways influenced by lncRNAs is important to understand their role in cancer. Although knock-down or overexpression of lncRNAs followed by gene expression profiling in cancer cell lines are established approaches to address this problem, these experimental data are not available for a majority of the annotated lncRNAs. Results As a surrogate, we present lncGSEA, a convenient tool to predict the lncRNA associated pathways through Gene Set Enrichment Analysis of gene expression profiles from large-scale cancer patient samples. We demonstrate that lncGSEA is able to recapitulate lncRNA associated pathways supported by literature and experimental validations in multiple cancer types. Conclusions LncGSEA allows researchers to infer lncRNA regulatory pathways directly from clinical samples in oncology. LncGSEA is written in R, and is freely accessible at https://github.com/ylab-hi/lncGSEA.


2021 ◽  
pp. 1-6
Author(s):  
Reza Vafaee ◽  
Mostafa Rezaei Tavirani ◽  
Sina Rezaei Tavirani ◽  
Mohammadreza Razzaghi

There are many documents about benefits of exercise on human health. However, evidences indicate to positive effect of exercise on disease prevention, understanding of many aspects of this mechanism need more investigations. Determination of critical genes which effect human health. GSE156249 including 12 gene expression profiles of healthy individual biopsy from vastus lateralis muscle before and after 12-week combined exercise training intervention were extracted from gene expression omnibus (GEO) database. The significant DEGs were included in interactome unit by Cytoscape software and STRING database. The network was analyzed to find the central nodes subnetwork clusters. The nodes of prominent cluster were assessed via gene ontology by using ClueGO. Number of 8 significant DEGs and 100 first neighbors analyzed via network analysis. The network includes 2 clusters and COL3A1, BGN, and LOX were determined as central DEGs. The critical DEGs were involved in cancer prevention process.


2021 ◽  
Vol 12 ◽  
Author(s):  
Kaisong Bai ◽  
Tong Zhao ◽  
Yilong Li ◽  
Xinjian Li ◽  
Zhantian Zhang ◽  
...  

Pancreatic adenocarcinoma (PAAD) is one of the deadliest malignancies and mortality for PAAD have remained increasing under the conditions of substantial improvements in mortality for other major cancers. Although multiple of studies exists on PAAD, few studies have dissected the oncogenic mechanisms of PAAD based on genomic variation. In this study, we integrated somatic mutation data and gene expression profiles obtained by high-throughput sequencing to characterize the pathogenesis of PAAD. The mutation profile containing 182 samples with 25,470 somatic mutations was obtained from The Cancer Genome Atlas (TCGA). The mutation landscape was generated and somatic mutations in PAAD were found to have preference for mutation location. The combination of mutation matrix and gene expression profiles identified 31 driver genes that were closely associated with tumor cell invasion and apoptosis. Co-expression networks were constructed based on 461 genes significantly associated with driver genes and the hub gene FAM133A in the network was identified to be associated with tumor metastasis. Further, the cascade relationship of somatic mutation-Long non-coding RNA (lncRNA)-microRNA (miRNA) was constructed to reveal a new mechanism for the involvement of mutations in post-transcriptional regulation. We have also identified prognostic markers that are significantly associated with overall survival (OS) of PAAD patients and constructed a risk score model to identify patients’ survival risk. In summary, our study revealed the pathogenic mechanisms and prognostic markers of PAAD providing theoretical support for the development of precision medicine.


2021 ◽  
Author(s):  
Hongpeng Fang ◽  
Zhansen Huang ◽  
Xianzi Zeng ◽  
Jiaming Wan ◽  
Jieying Wu ◽  
...  

Abstract Background As a common malignant cancer of the urinary system, the precise molecular mechanisms of bladder cancer remain to be illuminated. The purpose of this study was to identify core genes with prognostic value as potential oncogenes for the diagnosis, prognosis or novel therapeutic targets of bladder cancer. Methods The gene expression profiles GSE3167 and GSE7476 were available from the Gene Expression Omnibus (GEO) database. Next, PPI network was built to filter the hub gene through the STRING database and Cytoscape software and GEPIA and Kaplan-Meier plotter were implemented. Frequency and type of hub genes and sub groups analysis were performed in cBioportal and ULCAN database. Finally,We used RT-qPCR to confirm our results. Results Totally, 251 DEGs were excavated from two datasets in our study. We only founded high expression of SMC4, TYMS, CCNB1, CKS1B, NUSAP1 and KPNA2 was associated with worse outcomes in bladder cancer patients and no matter from the type of mutation or at the transcriptional level of hub genes, the tumor showed a high form of expression. However, only the expression of SMC4,CCNB1and CKS1B remained changed between the cancer and the normal samples in our results of RT-qPCR. Conclusion In conclusion,These findings indicate that the SMC4,CCNB1 and CKS1B may serve as critical biomarkers in the development and poor prognosis.


2020 ◽  
Author(s):  
Rui Zhang ◽  
Chen Chen ◽  
Qi Li ◽  
Jialu Fu ◽  
Dong Zhang ◽  
...  

Abstract Background: Immune-related genes (IRGs) play a crucial role in the initiation and progression of cholangiocarcinoma (CCA). However, immune signatures have rarely been used to predict prognosis of CCA. The aim of this study was to construct a novel model for CCA to predict survival based on IRGs expression data.Methods: The gene expression profiles and clinical data of CCA patients from The Cancer Genome Atlas (TCGA) and Gene Expression Omnibus (GEO) database were integrated to establish and validate prognostic IRG signatures. Differentially expressed immune-related genes were screened. Univariate and multivariate Cox analysis were performed to identify prognostic IRGs, and the risk model that predicts outcomes was constructed. Furthermore, receiver operating characteristic (ROC) and Kaplan-Meier curve were plotted to examine predictive accuracy of the model, and a nomogram was constructed based on IRGs signature, combining with other clinical characteristics. Finally, CIBERSORT was used to analyze the association of immune cells infiltration with risk score.Results: We identified that 223 IRGs were significantly dysregulated in patients with CCA, among which five IRGs (AVPR1B, CST4, TDGF1, RAET1E and IL9R) were identified as robust indicators for overall survival (OS), and a prognostic model was built based on the IRGs signature. Meanwhile, patients with high risk had worse OS in training and validation cohort, and the area under the ROC was 0.898 and 0.846, respectively. Nomogram demonstrated that immune risk score contributed much more points than other clinicopathological variables, with a C-index of 0.819 (95% CI, 0.727-0.911). Finally, we found that IRGs signature was positively correlated with the proportion of CD8+ T cells, neurophils and T gamma delta, while negatively with that of CD4+ memory resting T cells.Conclusions: We established and validated an effective five IRGs-based prediction model for CCA, which could accurately classify patients into groups with low and high risk of poor prognosis.


BMC Genomics ◽  
2019 ◽  
Vol 20 (1) ◽  
Author(s):  
Kyu-Sang Lim ◽  
Qian Dong ◽  
Pamela Moll ◽  
Jana Vitkovska ◽  
Gregor Wiktorin ◽  
...  

Abstract Background Gene expression profiling in blood is a potential source of biomarkers to evaluate or predict phenotypic differences between pigs but is expensive and inefficient because of the high abundance of globin mRNA in porcine blood. These limitations can be overcome by the use of QuantSeq 3’mRNA sequencing (QuantSeq) combined with a method to deplete or block the processing of globin mRNA prior to or during library construction. Here, we validated the effectiveness of QuantSeq using a novel specific globin blocker (GB) that is included in the library preparation step of QuantSeq. Results In data set 1, four concentrations of the GB were applied to RNA samples from two pigs. The GB significantly reduced the proportion of globin reads compared to non-GB (NGB) samples (P = 0.005) and increased the number of detectable non-globin genes. The highest evaluated concentration (C1) of the GB resulted in the largest reduction of globin reads compared to the NGB (from 56.4 to 10.1%). The second highest concentration C2, which showed very similar globin depletion rates (12%) as C1 but a better correlation of the expression of non-globin genes between NGB and GB (r = 0.98), allowed the expression of an additional 1295 non-globin genes to be detected, although 40 genes that were detected in the NGB sample (at a low level) were not present in the GB library. Concentration C2 was applied in the rest of the study. In data set 2, the distribution of the percentage of globin reads for NGB (n = 184) and GB (n = 189) samples clearly showed the effects of the GB on reducing globin reads, in particular for HBB, similar to results from data set 1. Data set 3 (n = 84) revealed that the proportion of globin reads that remained in GB samples was significantly and positively correlated with the reticulocyte count in the original blood sample (P < 0.001). Conclusions The effect of the GB on reducing the proportion of globin reads in porcine blood QuantSeq was demonstrated in three data sets. In addition to increasing the efficiency of sequencing non-globin mRNA, the GB for QuantSeq has an advantage that it does not require an additional step prior to or during library creation. Therefore, the GB is a useful tool in the quantification of whole gene expression profiles in porcine blood.


Viruses ◽  
2020 ◽  
Vol 12 (4) ◽  
pp. 404 ◽  
Author(s):  
Claudia Cava ◽  
Gloria Bertoli ◽  
Isabella Castiglioni

Previous studies reported that Angiotensin converting enzyme 2 (ACE2) is the main cell receptor of SARS-CoV and SARS-CoV-2. It plays a key role in the access of the virus into the cell to produce the final infection. In the present study we investigated in silico the basic mechanism of ACE2 in the lung and provided evidences for new potentially effective drugs for Covid-19. Specifically, we used the gene expression profiles from public datasets including The Cancer Genome Atlas, Gene Expression Omnibus and Genotype-Tissue Expression, Gene Ontology and pathway enrichment analysis to investigate the main functions of ACE2-correlated genes. We constructed a protein-protein interaction network containing the genes co-expressed with ACE2. Finally, we focused on the genes in the network that are already associated with known drugs and evaluated their role for a potential treatment of Covid-19. Our results demonstrate that the genes correlated with ACE2 are mainly enriched in the sterol biosynthetic process, Aryldialkylphosphatase activity, adenosylhomocysteinase activity, trialkylsulfonium hydrolase activity, acetate-CoA and CoA ligase activity. We identified a network of 193 genes, 222 interactions and 36 potential drugs that could have a crucial role. Among possible interesting drugs for Covid-19 treatment, we found Nimesulide, Fluticasone Propionate, Thiabendazole, Photofrin, Didanosine and Flutamide.


Sign in / Sign up

Export Citation Format

Share Document