GEDS: A Gene Expression Display Server for mRNAs, miRNAs and Proteins

High-throughput technologies generate a tremendous amount of expression data on mRNA, miRNA and protein levels. Mining and visualizing the large amount of expression data requires sophisticated computational skills. An easy to use and user-friendly web-server for the visualization of gene expression profiles could greatly facilitate data exploration and hypothesis generation for biologists. Here, we curated and normalized the gene expression data on mRNA, miRNA and protein levels in 23315, 9009 and 9244 samples, respectively, from 40 tissues (The Cancer Genome Atlas (TCGA) and Genotype-Tissue Expression (GETx)) and 1594 cell lines (Cancer Cell Line Encyclopedia (CCLE) and MD Anderson Cell Lines Project (MCLP)). Then, we constructed the Gene Expression Display Server (GEDS), a web-based tool for quantification, comparison and visualization of gene expression data. GEDS integrates multiscale expression data and provides multiple types of figures and tables to satisfy several kinds of user requirements. The comprehensive expression profiles plotted in the one-stop GEDS platform greatly facilitate experimental biologists utilizing big data for better experimental design and analysis. GEDS is freely available on http://bioinfo.life.hust.edu.cn/web/GEDS/.

Download Full-text

Predicting drug-induced transcriptome responses of a wide range of human cell lines by a novel tensor-train decomposition algorithm

Bioinformatics ◽

10.1093/bioinformatics/btz313 ◽

2019 ◽

Vol 35 (14) ◽

pp. i191-i199 ◽

Cited By ~ 4

Author(s):

Michio Iwata ◽

Longhao Yuan ◽

Qibin Zhao ◽

Yasuo Tabei ◽

Francois Berenger ◽

...

Keyword(s):

Gene Expression ◽

Cell Lines ◽

Gene Expression Data ◽

Human Cell ◽

Expression Profiles ◽

Gene Expression Profiles ◽

Human Cell Lines ◽

Expression Data ◽

Drug Induced ◽

Wide Range

Abstract Motivation Genome-wide identification of the transcriptomic responses of human cell lines to drug treatments is a challenging issue in medical and pharmaceutical research. However, drug-induced gene expression profiles are largely unknown and unobserved for all combinations of drugs and human cell lines, which is a serious obstacle in practical applications. Results Here, we developed a novel computational method to predict unknown parts of drug-induced gene expression profiles for various human cell lines and predict new drug therapeutic indications for a wide range of diseases. We proposed a tensor-train weighted optimization (TT-WOPT) algorithm to predict the potential values for unknown parts in tensor-structured gene expression data. Our results revealed that the proposed TT-WOPT algorithm can accurately reconstruct drug-induced gene expression data for a range of human cell lines in the Library of Integrated Network-based Cellular Signatures. The results also revealed that in comparison with the use of original gene expression profiles, the use of imputed gene expression profiles improved the accuracy of drug repositioning. We also performed a comprehensive prediction of drug indications for diseases with gene expression profiles, which suggested many potential drug indications that were not predicted by previous approaches. Supplementary information Supplementary data are available at Bioinformatics online.

Download Full-text

Connecting gene expression data from connectivity map and in silico target predictions for small molecule mechanism-of-action analysis

Molecular BioSystems ◽

10.1039/c4mb00328d ◽

2015 ◽

Vol 11 (1) ◽

pp. 86-96 ◽

Cited By ~ 17

Author(s):

Aakash Chavan Ravindranath ◽

Nolen Perualila-Tan ◽

Adetayo Kasim ◽

Georgios Drakakis ◽

Sonia Liggi ◽

...

Keyword(s):

Gene Expression ◽

Ligand Binding ◽

Gene Expression Data ◽

Mechanism Of Action ◽

In Silico ◽

Expression Profiles ◽

Gene Expression Profiles ◽

Expression Data ◽

Connectivity Map ◽

Action Analysis

Integrating gene expression profiles with certain proteins can improve our understanding of the fundamental mechanisms in protein–ligand binding.

Download Full-text

Building Gene Networks by Analyzing Gene Expression Profiles

Advanced Methodologies and Technologies in Medicine and Healthcare - Advances in Medical Diagnosis, Treatment, and Care ◽

10.4018/978-1-5225-7489-7.ch003 ◽

2019 ◽

pp. 27-44

Author(s):

Crescenzio Gallo

Keyword(s):

Gene Expression ◽

Gene Expression Data ◽

Gene Networks ◽

Dna Microarrays ◽

Expression Profiles ◽

Expression Patterns ◽

Gene Expression Profiles ◽

Expression Data ◽

Gene Expressions ◽

Over Time

The possible applications of modeling and simulation in the field of bioinformatics are very extensive, ranging from understanding basic metabolic paths to exploring genetic variability. Experimental results carried out with DNA microarrays allow researchers to measure expression levels for thousands of genes simultaneously, across different conditions and over time. A key step in the analysis of gene expression data is the detection of groups of genes that manifest similar expression patterns. In this chapter, the authors examine various methods for analyzing gene expression data, addressing the important topics of (1) selecting the most differentially expressed genes, (2) grouping them by means of their relationships, and (3) classifying samples based on gene expressions.

Download Full-text

Association Between Gene Expression Profiles and Commonly Mutated Genes In The Hematopoietic Stem Cells Of Patients With Myelodysplastic Syndromes

Blood ◽

10.1182/blood.v122.21.2779.2779 ◽

2013 ◽

Vol 122 (21) ◽

pp. 2779-2779 ◽

Cited By ~ 1

Author(s):

Andrea Pellagatti ◽

Moritz Gerstung ◽

Elli Papaemmanuil ◽

Luca Malcovati ◽

Aristoteles Giagounidis ◽

...

Keyword(s):

Gene Expression ◽

Differentially Expressed Genes ◽

Gene Expression Data ◽

Expression Profiles ◽

Gene Mutations ◽

Gene Expression Profiles ◽

Differentially Expressed ◽

Expression Data ◽

Common Gene ◽

Mutational Status

Abstract A particular profile of gene expression can reflect an underlying molecular abnormality in malignancy. Distinct gene expression profiles and deregulated gene pathways can be driven by specific gene mutations and may shed light on the biology of the disease and lead to the identification of new therapeutic targets. We selected 143 cases from our large-scale gene expression profiling (GEP) dataset on bone marrow CD34+ cells from patients with myelodysplastic syndromes (MDS), for which matching genotyping data were obtained using next-generation sequencing of a comprehensive list of 111 genes involved in myeloid malignancies (including the spliceosomal genes SF3B1, SRSF2, U2AF1 and ZRSR2, as well as TET2, ASXL1and many other). The GEP data were then correlated with the mutational status to identify significantly differentially expressed genes associated with each of the most common gene mutations found in MDS. The expression levels of the mutated genes analyzed were generally lower in patients carrying a mutation than in patients wild-type for that gene (e.g. SF3B1, ASXL1 and TP53), with the exception of RUNX1 for which patients carrying a mutation showed higher expression levels than patients without mutation. Principal components analysis showed that the main directions of gene expression changes (principal components) tend to coincide with some of the common gene mutations, including SF3B1, SRSF2 and TP53. SF3B1 and STAG2 were the mutated genes showing the highest number of associated significantly differentially expressed genes, including ABCB7 as differentially expressed in association with SF3B1 mutation and SULT2A1 in association with STAG2 mutation. We found distinct differentially expressed genes associated with the four most common splicing gene mutations (SF3B1, SRSF2, U2AF1 and ZRSR2) in MDS, suggesting that different phenotypes associated with these mutations may be driven by different effects on gene expression and that the target gene may be different. We have also evaluated the prognostic impact of the GEP data in comparison with that of the genotype data and importantly we have found a larger contribution of gene expression data in predicting progression free survival compared to mutation-based multivariate survival models. In summary, this analysis correlating gene expression data with genotype data has revealed that the mutational status shapes the gene expression landscape. We have identified deregulated genes associated with the most common gene mutations in MDS and found that the prognostic power of gene expression data is greater than the prognostic power provided by mutation data. AP and MG contributed equally to this work. JB and PJC are co-senior authors. Disclosures: No relevant conflicts of interest to declare.

Download Full-text

Bayesian log-normal deconvolution for enhanced in silico microdissection of bulk gene expression data

Nature Communications ◽

10.1038/s41467-021-26328-2 ◽

2021 ◽

Vol 12 (1) ◽

Author(s):

Bárbara Andrade Barbosa ◽

Saskia D. van Asten ◽

Ji Won Oh ◽

Arantza Farina-Sarasqueta ◽

Joanne Verheij ◽

...

Keyword(s):

Gene Expression ◽

Gene Expression Data ◽

Expression Profiles ◽

Gene Expression Profiles ◽

Cell Types ◽

Expression Data ◽

Cell Type ◽

Expression Variability ◽

Variable Nature ◽

Log Normal

AbstractDeconvolution of bulk gene expression profiles into the cellular components is pivotal to portraying tissue’s complex cellular make-up, such as the tumor microenvironment. However, the inherently variable nature of gene expression requires a comprehensive statistical model and reliable prior knowledge of individual cell types that can be obtained from single-cell RNA sequencing. We introduce BLADE (Bayesian Log-normAl Deconvolution), a unified Bayesian framework to estimate both cellular composition and gene expression profiles for each cell type. Unlike previous comprehensive statistical approaches, BLADE can handle > 20 types of cells due to the efficient variational inference. Throughout an intensive evaluation with > 700 simulated and real datasets, BLADE demonstrated enhanced robustness against gene expression variability and better completeness than conventional methods, in particular, to reconstruct gene expression profiles of each cell type. In summary, BLADE is a powerful tool to unravel heterogeneous cellular activity in complex biological systems from standard bulk gene expression data.

Download Full-text

Systematic assessment of multi-gene predictors of pan-cancer cell line sensitivity to drugs exploiting gene expression data

F1000Research ◽

10.12688/f1000research.10529.1 ◽

2016 ◽

Vol 5 ◽

pp. 2927 ◽

Cited By ~ 9

Author(s):

Linh Nguyen ◽

Cuong C Dang ◽

Pedro J. Ballester

Keyword(s):

Gene Expression ◽

Machine Learning ◽

Cell Line ◽

Cell Lines ◽

Gene Expression Data ◽

Single Gene ◽

Cancer Cell Line ◽

Expression Data ◽

Gene Markers ◽

Pan Cancer

Background:Selected gene mutations are routinely used to guide the selection of cancer drugs for a given patient tumour. Large pharmacogenomic data sets were introduced to discover more of these single-gene markers of drug sensitivity. Very recently, machine learning regression has been used to investigate how well cancer cell line sensitivity to drugs is predicted depending on the type of molecular profile. The latter has revealed that gene expression data is the most predictive profile in the pan-cancer setting. However, no study to date has exploited GDSC data to systematically compare the performance of machine learning models based on multi-gene expression data against that of widely-used single-gene markers based on genomics data.Methods:Here we present this systematic comparison using Random Forest (RF) classifiers exploiting the expression levels of 13,321 genes and an average of 501 tested cell lines per drug. To account for time-dependent batch effects in IC50measurements, we employ independent test sets generated with more recent GDSC data than that used to train the predictors and show that this is a more realistic validation than K-fold cross-validation.Results and Discussion:Across 127 GDSC drugs, our results show that the single-gene markers unveiled by the MANOVA analysis tend to achieve higher precision than these RF-based multi-gene models, at the cost of generally having a poor recall (i.e. correctly detecting only a small part of the cell lines sensitive to the drug). Regarding overall classification performance, about two thirds of the drugs are better predicted by multi-gene RF classifiers. Among the drugs with the most predictive of these models, we found pyrimethamine, sunitinib and 17-AAG.Conclusions:We now know that this type of models can predictin vitrotumour response to these drugs. These models can thus be further investigated onin vivotumour models.

Download Full-text

BLADE: Bayesian Log-normAl DEconvolution for enhanced in silico microdissection of bulk gene expression data

10.21203/rs.3.rs-123595/v1 ◽

2020 ◽

Author(s):

Bárbara Andrade Barbosa ◽

Saskia van Asten ◽

Ji-won Oh ◽

Arantza Fariña-Sarasqueta ◽

Joanne Verheij ◽

...

Keyword(s):

Gene Expression ◽

Gene Expression Data ◽

Expression Profiles ◽

Gene Expression Profiles ◽

Cell Types ◽

Expression Data ◽

Cell Type ◽

Expression Variability ◽

Variable Nature ◽

Log Normal

Abstract High-resolution deconvolution of bulk gene expression profiles is pivotal to characterize the complex cellular make-up of tissues, such as tumor microenvironment. Single-cell RNA-seq provides reliable prior knowledge for deconvolution, however, a comprehensive statistical model is required for efficient utilization due to the inherently variable nature of gene expression. We introduce BLADE (Bayesian Log-normAl Deconvolution), a comprehensive probabilistic framework to estimate both cellular make-up and gene expression profiles of each cell type in each sample. Unlike previous comprehensive statistical approaches, BLADE can handle >20 cell types thanks to the efficient variational inference. Throughout an intensive evaluation using >700 datasets, BLADE showed enhanced robustness against gene expression variability and better completeness than conventional methods, in particular to reconstruct gene expression profiles of each cell type. All-in-all, BLADE is a powerful tool to unravel heterogeneous cellular activity in complex biological systems based on standard bulk gene expression data.

Download Full-text

xSyn: A Software Tool for Identifying Sophisticated 3-Way Interactions From Cancer Expression Data

Cancer Informatics ◽

10.1177/1176935117728516 ◽

2017 ◽

Vol 16 ◽

pp. 117693511772851 ◽

Cited By ~ 1

Author(s):

Baishali Bandyopadhyay ◽

Veda Chanda ◽

Yupeng Wang

Keyword(s):

Gene Expression ◽

Gene Expression Data ◽

Expression Profiles ◽

Gene Expression Profiles ◽

Optimization Procedure ◽

Software Tool ◽

Group Method ◽

Expression Data ◽

Linear Regression Models ◽

Expression Levels

Background: Constructing gene co-expression networks from cancer expression data is important for investigating the genetic mechanisms underlying cancer. However, correlation coefficients or linear regression models are not able to model sophisticated relationships among gene expression profiles. Here, we address the 3-way interaction that 2 genes’ expression levels are clustered in different space locations under the control of a third gene’s expression levels. Results: We present xSyn, a software tool for identifying such 3-way interactions from cancer gene expression data based on an optimization procedure involving the usage of UPGMA (Unweighted Pair Group Method with Arithmetic Mean) and synergy. The effectiveness is demonstrated by application to 2 real gene expression data sets. Conclusions: xSyn is a useful tool for decoding the complex relationships among gene expression profiles. xSyn is available at http://www.bdxconsult.com/xSyn.html .

Download Full-text

Novel Epigenetic 12-gene Signature Predictive of Poor Prognosis and MSI-like Phenotype in Human Metastatic Colorectal Carcinomas

10.21203/rs.3.rs-74746/v1 ◽

2020 ◽

Author(s):

Valentina Condelli ◽

Giovanni Calice ◽

Alessandra Cassano ◽

Michele Basso ◽

Maria Grazia Rodriquenz ◽

...

Keyword(s):

Gene Expression ◽

Cell Lines ◽

Gene Expression Data ◽

Poor Prognosis ◽

Expression Profiles ◽

Hct116 Cell ◽

Gene Copy ◽

Expression Data ◽

Global Methylation ◽

Differentially Methylated Genes

Abstract Background. Epigenetic remodeling is responsible for tumor progression and drug resistance in human colorectal carcinoma (CRC). A subgroup of human CRCs exhibits the CIMP status, with extensive hypermethylation events in promoter regions of several genes, even though the prognostic significance of CIMP is controversial. This study addressed the hypothesis that DNA methylation profiling may identify metastatic CRC (mCRC) subtypes with different clinical behavior. Methods. Global methylation profile was comparatively analyzed between 24 first-line primary-resistant and 12 drug-sensitive mCRCs (in-house cohort), two subgroups of tumors with significantly different outcome. Methylation and gene expression data from 33 mCRCs of the TCGA COAD dataset (TCGA COAD cohort) were used to identify, among differentially methylated genes, a prognostic signature of functionally methylated genes. Clusters of mCRCs with different methylation patterns were further characterized for DNA mutational load, gene copy number and gene expression profiles. Human CRC HT29 and HCT116 cell lines were adapted to growth in presence of oxaliplatin and irinotecan and used as in vitro model to validate gene expression data.Results. Twelve functionally methylated genes yielded a hierarchical clustering of patients in two well-defined clusters with hypermethylated tumors characterized by a significantly worse relapse-free and overall survival compared to hypomethylated cancers and this was reproduced in both the in-house and the TCGA COAD cohorts. Interestingly, the hypermethylated poor prognosis cluster was enriched of CIMP-high and MSI-like cases. Furthermore, methylation events were enriched in genes located on q-arm of chromosomes 13 and 20, two chromosomal regions with gain/loss alterations strongly associated with adenoma-to-carcinoma progression. Finally, the expression of the 12-genes signature and MSI-enriching genes was confirmed in two independent oxaliplatin- and irinotecan-resistant CRC cell lines. Conclusions. These data represent the proof of concept that the hypermethylation of specific sets of genes may provide prognostic information being able to identify a subgroup of mCRCs with poor prognosis.

Download Full-text