Mining The Cancer Genome Atlas gene expression data for lineage markers in distinguishing bladder urothelial carcinoma and prostate adenocarcinoma

AbstractDistinguishing bladder urothelial carcinomas from prostate adenocarcinomas for poorly differentiated carcinomas derived from the bladder neck entails the use of a panel of lineage markers to help make this distinction. Publicly available The Cancer Genome Atlas (TCGA) gene expression data provides an avenue to examine utilities of these markers. This study aimed to verify expressions of urothelial and prostate lineage markers in the respective carcinomas and to seek the relative importance of these markers in making this distinction. Gene expressions of these markers were downloaded from TCGA Pan-Cancer database for bladder and prostate carcinomas. Differential gene expressions of these markers were analyzed. Standard linear discriminant analyses were applied to establish the relative importance of these markers in lineage determination and to construct the model best in making the distinction. This study shows that all urothelial lineage genes except for the gene for uroplakin III were significantly expressed in bladder urothelial carcinomas (p < 0.001). In descending order of importance to distinguish from prostate adenocarcinomas, genes for uroplakin II, S100P, GATA3 and thrombomodulin had high discriminant loadings (> 0.3). All prostate lineage genes were significantly expressed in prostate adenocarcinomas(p < 0.001). In descending order of importance to distinguish from bladder urothelial carcinomas, genes for NKX3.1, prostate specific antigen (PSA), prostate-specific acid phosphatase, prostein, and prostate-specific membrane antigen had high discriminant loadings (> 0.3). Combination of gene expressions for uroplakin II, S100P, NKX3.1 and PSA approached 100% accuracy in tumor classification both in the training and validation sets. Mining gene expression data, a combination of four lineage markers helps distinguish between bladder urothelial carcinomas and prostate adenocarcinomas.

Download Full-text

Abstract A46: A comprehensive genomic pan-cancer analysis comparing males and females using The Cancer Genome Atlas gene expression data

10.1158/1557-3265.pmccavuln16-a46 ◽

2017 ◽

Author(s):

YuanYuan Li ◽

David M. Umbach ◽

Leping Li

Keyword(s):

Gene Expression ◽

Gene Expression Data ◽

Cancer Genome ◽

The Cancer Genome Atlas ◽

Expression Data ◽

Cancer Genome Atlas ◽

Males And Females ◽

Pan Cancer ◽

Genome Atlas

Download Full-text

A comprehensive genomic pan-cancer classification using The Cancer Genome Atlas gene expression data

BMC Genomics ◽

10.1186/s12864-017-3906-0 ◽

2017 ◽

Vol 18 (1) ◽

Cited By ~ 49

Author(s):

Yuanyuan Li ◽

Kai Kang ◽

Juno M. Krahn ◽

Nicole Croutwater ◽

Kevin Lee ◽

...

Keyword(s):

Gene Expression ◽

Gene Expression Data ◽

Cancer Genome ◽

Cancer Classification ◽

The Cancer Genome Atlas ◽

Expression Data ◽

Cancer Genome Atlas ◽

Pan Cancer ◽

Genome Atlas

Download Full-text

Explainable autoencoder-based representation learning for gene expression data

10.1101/2021.12.21.473742 ◽

2021 ◽

Author(s):

Yang Yu ◽

Pathum Kossinna ◽

Wenyuan Liao ◽

Qingrun Zhang

Keyword(s):

Gene Expression ◽

Gene Expression Data ◽

Hidden Variables ◽

Representation Learning ◽

The Cancer Genome Atlas ◽

Expression Data ◽

Rna Seq ◽

Gene Expression Data Analysis ◽

Cancer Genome Atlas ◽

Modern Machine

Modern machine learning methods have been extensively utilized in gene expression data analysis. In particular, autoencoders (AE) have been employed in processing noisy and heterogenous RNA-Seq data. However, AEs usually lead to "black-box" hidden variables difficult to interpret, hindering downstream experimental validation and clinical translation. To bridge the gap between complicated models and biological interpretations, we developed a tool, XAE4Exp (eXplainable AutoEncoder for Expression data), which integrates AE and SHapley Additive exPlanations (SHAP), a flagship technique in the field of eXplainable AI (XAI). It quantitatively evaluates the contributions of each gene to the hidden structure learned by an AE, substantially improving the expandability of AE outcomes. By applying XAE4Exp to The Cancer Genome Atlas (TCGA) breast cancer gene expression data, we identified genes that are not differentially expressed, and pathways in various cancer-related classes. This tool will enable researchers and practitioners to analyze high-dimensional expression data intuitively, paving the way towards broader uses of deep learning.

Download Full-text

The Analysis of Gene Expression Data Incorporating Tumor Purity Information

Frontiers in Genetics ◽

10.3389/fgene.2021.642759 ◽

2021 ◽

Vol 12 ◽

Author(s):

Seungjun Ahn ◽

Tyler Grimes ◽

Somnath Datta

Keyword(s):

Gene Expression ◽

Tumor Cells ◽

Gene Expression Data ◽

The Cancer Genome Atlas ◽

Data Sets ◽

Expression Data ◽

Tumor Purity ◽

Robust Model ◽

Differential Network ◽

Cancer Genome Atlas

The tumor microenvironment is composed of tumor cells, stroma cells, immune cells, blood vessels, and other associated non-cancerous cells. Gene expression measurements on tumor samples are an average over cells in the microenvironment. However, research questions often seek answers about tumor cells rather than the surrounding non-tumor tissue. Previous studies have suggested that the tumor purity (TP)—the proportion of tumor cells in a solid tumor sample—has a confounding effect on differential expression (DE) analysis of high vs. low survival groups. We investigate three ways incorporating the TP information in the two statistical methods used for analyzing gene expression data, namely, differential network (DN) analysis and DE analysis. Analysis 1 ignores the TP information completely, Analysis 2 uses a truncated sample by removing the low TP samples, and Analysis 3 uses TP as a covariate in the underlying statistical models. We use three gene expression data sets related to three different cancers from the Cancer Genome Atlas (TCGA) for our investigation. The networks from Analysis 2 have greater amount of differential connectivity in the two networks than that from Analysis 1 in all three cancer datasets. Similarly, Analysis 1 identified more differentially expressed genes than Analysis 2. Results of DN and DE analyses using Analysis 3 were mostly consistent with those of Analysis 1 across three cancers. However, Analysis 3 identified additional cancer-related genes in both DN and DE analyses. Our findings suggest that using TP as a covariate in a linear model is appropriate for DE analysis, but a more robust model is needed for DN analysis. However, because true DN or DE patterns are not known for the empirical datasets, simulated datasets can be used to study the statistical properties of these methods in future studies.

Download Full-text

Abstract PS18-12: Comparative analysis of differential gene expression by ancestry using primary breast cancers from Nigeria and the cancer genome atlas (TCGA)

10.1158/1538-7445.sabcs20-ps18-12 ◽

2021 ◽

Author(s):

Padma Sheila Rajagopal ◽

Yi-Hsuan S Tsai ◽

Ashley Hardeman ◽

Ian Hurley ◽

Aminah Sallam ◽

...

Keyword(s):

Gene Expression ◽

Comparative Analysis ◽

Differential Gene Expression ◽

Cancer Genome ◽

The Cancer Genome Atlas ◽

Breast Cancers ◽

Cancer Genome Atlas ◽

Differential Gene ◽

Genome Atlas

Download Full-text

Building Gene Networks by Analyzing Gene Expression Profiles

Advanced Methodologies and Technologies in Medicine and Healthcare - Advances in Medical Diagnosis, Treatment, and Care ◽

10.4018/978-1-5225-7489-7.ch003 ◽

2019 ◽

pp. 27-44

Author(s):

Crescenzio Gallo

Keyword(s):

Gene Expression ◽

Gene Expression Data ◽

Gene Networks ◽

Dna Microarrays ◽

Expression Profiles ◽

Expression Patterns ◽

Gene Expression Profiles ◽

Expression Data ◽

Gene Expressions ◽

Over Time

The possible applications of modeling and simulation in the field of bioinformatics are very extensive, ranging from understanding basic metabolic paths to exploring genetic variability. Experimental results carried out with DNA microarrays allow researchers to measure expression levels for thousands of genes simultaneously, across different conditions and over time. A key step in the analysis of gene expression data is the detection of groups of genes that manifest similar expression patterns. In this chapter, the authors examine various methods for analyzing gene expression data, addressing the important topics of (1) selecting the most differentially expressed genes, (2) grouping them by means of their relationships, and (3) classifying samples based on gene expressions.

Download Full-text

PABPC1 relevant bioinformatic profiling and prognostic value in gliomas

Future Oncology ◽

10.2217/fon-2019-0268 ◽

2020 ◽

Vol 16 (1) ◽

pp. 4279-4288 ◽

Cited By ~ 1

Author(s):

Qiangwei Wang ◽

Zhiliang Wang ◽

Zhaoshi Bao ◽

Chuanbao Zhang ◽

Zheng Wang ◽

...

Keyword(s):

Biological Process ◽

The Cancer Genome Atlas ◽

Analysis Tool ◽

Expression Data ◽

Clinical Value ◽

Mrna Expression Data ◽

R Language ◽

Molecular Features ◽

Cancer Genome Atlas ◽

Genome Atlas

Aim: We aimed at investigating molecular features and potential clinical value of PABPC1 in gliomas. Materials & methods: We assembled totally 1000 glioma samples with mRNA expression data from Chinese Glioma Genome Atlas and The Cancer Genome Atlas. We utilized R language as the main analysis tool. Gene Ontology was performed for functional analysis. Results: PABPC1 was downregulated in gliomas with higher malignance and PABPC1 may contribute as potential predictor of proneural subtype in gliomas. Higher expression of PABPC1 was significantly related to better prognosis and related to biological process of translation. Conclusion: Our finding improves the understanding of PABPC1 as a novel biomarker with potential therapeutic connotations.

Download Full-text

Identification of Potential Key Genes for Pathogenesis and Prognosis in Prostate Cancer by Integrated Analysis of Gene Expression Profiles and the Cancer Genome Atlas

Frontiers in Oncology ◽

10.3389/fonc.2020.00809 ◽

2020 ◽

Vol 10 ◽

Cited By ~ 2

Author(s):

Shuang Liu ◽

Wenxin Wang ◽

Yan Zhao ◽

Kaige Liang ◽

Yaojiang Huang

Keyword(s):

Gene Expression ◽

Prostate Cancer ◽

Expression Profiles ◽

Gene Expression Profiles ◽

Cancer Genome ◽

The Cancer Genome Atlas ◽

Integrated Analysis ◽

Cancer Genome Atlas ◽

Key Genes ◽

Genome Atlas

Download Full-text

GEDS: A Gene Expression Display Server for mRNAs, miRNAs and Proteins

Cells ◽

10.3390/cells8070675 ◽

2019 ◽

Vol 8 (7) ◽

pp. 675 ◽

Cited By ~ 5

Author(s):

Xia ◽

Liu ◽

Zhang ◽

Guo

Keyword(s):

Gene Expression ◽

Cell Lines ◽

Gene Expression Data ◽

Expression Profiles ◽

Gene Expression Profiles ◽

Cancer Cell Line ◽

Tissue Expression ◽

The Cancer Genome Atlas ◽

Expression Data ◽

Protein Levels

High-throughput technologies generate a tremendous amount of expression data on mRNA, miRNA and protein levels. Mining and visualizing the large amount of expression data requires sophisticated computational skills. An easy to use and user-friendly web-server for the visualization of gene expression profiles could greatly facilitate data exploration and hypothesis generation for biologists. Here, we curated and normalized the gene expression data on mRNA, miRNA and protein levels in 23315, 9009 and 9244 samples, respectively, from 40 tissues (The Cancer Genome Atlas (TCGA) and Genotype-Tissue Expression (GETx)) and 1594 cell lines (Cancer Cell Line Encyclopedia (CCLE) and MD Anderson Cell Lines Project (MCLP)). Then, we constructed the Gene Expression Display Server (GEDS), a web-based tool for quantification, comparison and visualization of gene expression data. GEDS integrates multiscale expression data and provides multiple types of figures and tables to satisfy several kinds of user requirements. The comprehensive expression profiles plotted in the one-stop GEDS platform greatly facilitate experimental biologists utilizing big data for better experimental design and analysis. GEDS is freely available on http://bioinfo.life.hust.edu.cn/web/GEDS/.

Download Full-text