ImmuCellDB: An Indicative Database of Immune Cell Composition From Different Tissues and Disease Conditions in Mouse and Human

Immune cell composition is highly divergent across different tissues and diseases. A comprehensive resource of tissue immune cells across different conditions in mouse and human will thus provide great understanding of the immune microenvironment of many diseases. Recently, computational methods for estimating immune cell abundance from tissue transcriptome data have been developed and are now widely used. Using these computational tools, large-scale estimation of immune cell composition across tissues and conditions should be possible using gene expression data collected from public databases. In total, 266 tissue types and 706 disease types in humans, as well as 143 tissue types and 61 disease types, and 206 genotypes in mouse had been included in a database we have named ImmuCellDB (http://wap-lab.org:3200/ImmuCellDB/). In ImmuCellDB, users can search and browse immune cell proportions based on tissues, disease or genotype in mouse or humans. Additionally, the variation and correlation of immune cell abundance and gene expression level between different conditions can be compared and viewed in this database. We believe that ImmuCellDB provides not only an indicative view of tissue-dependent or disease-dependent immune cell profiles, but also represents an easy way to pre-determine immune cell abundance and gene expression profiles for specific situations.

Download Full-text

Improved cell composition deconvolution method of bulk gene expression profiles to quantify subsets of immune cells

BMC Medical Genomics ◽

10.1186/s12920-019-0613-5 ◽

2019 ◽

Vol 12 (S8) ◽

Cited By ~ 2

Author(s):

Yen-Jung Chiu ◽

Yi-Hsuan Hsieh ◽

Yen-Hua Huang

Keyword(s):

Gene Expression ◽

Reference Gene ◽

Immune Cells ◽

Immune Cell ◽

Expression Profiles ◽

Gene Expression Profiles ◽

Deconvolution Method ◽

Cell Composition ◽

Reference Gene Expression ◽

Cell Fractions

Abstract Background To facilitate the investigation of the pathogenic roles played by various immune cells in complex tissues such as tumors, a few computational methods for deconvoluting bulk gene expression profiles to predict cell composition have been created. However, available methods were usually developed along with a set of reference gene expression profiles consisting of imbalanced replicates across different cell types. Therefore, the objective of this study was to create a new deconvolution method equipped with a new set of reference gene expression profiles that incorporate more microarray replicates of the immune cells that have been frequently implicated in the poor prognosis of cancers, such as T helper cells, regulatory T cells and macrophage M1/M2 cells. Methods Our deconvolution method was developed by choosing ε-support vector regression (ε-SVR) as the core algorithm assigned with a loss function subject to the L1-norm penalty. To construct the reference gene expression signature matrix for regression, a subset of differentially expressed genes were chosen from 148 microarray-based gene expression profiles for 9 types of immune cells by using ANOVA and minimizing condition number. Agreement analyses including mean absolute percentage errors and Bland-Altman plots were carried out to compare the performances of our method and CIBERSORT. Results In silico cell mixtures, simulated bulk tissues, and real human samples with known immune-cell fractions were used as the test datasets for benchmarking. Our method outperformed CIBERSORT in the benchmarks using in silico breast tissue-immune cell mixtures in the proportions of 30:70 and 50:50, and in the benchmark using 164 human PBMC samples. Our results suggest that the performance of our method was at least comparable to that of a state-of-the-art tool, CIBERSORT. Conclusions We developed a new cell composition deconvolution method and the implementation was entirely based on the publicly available R and Python packages. In addition, we compiled a new set of reference gene expression profiles, which might allow for a more robust prediction of the immune cell fractions from the expression profiles of cell mixtures. The source code of our method could be downloaded from https://github.com/holiday01/deconvolution-to-estimate-immune-cell-subsets.

Download Full-text

LncGSEA: a versatile tool to infer lncRNA associated pathways from large-scale cancer transcriptome sequencing data

BMC Genomics ◽

10.1186/s12864-021-07900-y ◽

2021 ◽

Vol 22 (1) ◽

Author(s):

Yanan Ren ◽

Ting-You Wang ◽

Leah C. Anderton ◽

Qi Cao ◽

Rendong Yang

Keyword(s):

Gene Expression ◽

Large Scale ◽

Expression Profiles ◽

Gene Expression Profiles ◽

Clinical Samples ◽

Sequencing Data ◽

Multiple Cancer ◽

Regulatory Pathways ◽

Cancer Transcriptome ◽

Versatile Tool

Abstract Background Long non-coding RNAs (lncRNAs) are a growing focus in cancer research. Deciphering pathways influenced by lncRNAs is important to understand their role in cancer. Although knock-down or overexpression of lncRNAs followed by gene expression profiling in cancer cell lines are established approaches to address this problem, these experimental data are not available for a majority of the annotated lncRNAs. Results As a surrogate, we present lncGSEA, a convenient tool to predict the lncRNA associated pathways through Gene Set Enrichment Analysis of gene expression profiles from large-scale cancer patient samples. We demonstrate that lncGSEA is able to recapitulate lncRNA associated pathways supported by literature and experimental validations in multiple cancer types. Conclusions LncGSEA allows researchers to infer lncRNA regulatory pathways directly from clinical samples in oncology. LncGSEA is written in R, and is freely accessible at https://github.com/ylab-hi/lncGSEA.

Download Full-text

Predicting Host Immune Cell Dynamics and Key Disease-Associated Genes Using Tissue Transcriptional Profiles

Processes ◽

10.3390/pr7050301 ◽

2019 ◽

Vol 7 (5) ◽

pp. 301

Author(s):

Muying Wang ◽

Satoshi Fukuyama ◽

Yoshihiro Kawaoka ◽

Jason E. Shoemaker

Keyword(s):

Gene Expression ◽

Gene Expression Data ◽

Immune Cell ◽

Mean Squared Error ◽

Expression Profiles ◽

Statistical Tests ◽

Critical Factor ◽

Expression Data ◽

Cell Dynamics ◽

Cell Counts

Motivation: Immune cell dynamics is a critical factor of disease-associated pathology (immunopathology) that also impacts the levels of mRNAs in diseased tissue. Deconvolution algorithms attempt to infer cell quantities in a tissue/organ sample based on gene expression profiles and are often evaluated using artificial, non-complex samples. Their accuracy on estimating cell counts given temporal tissue gene expression data remains not well characterized and has never been characterized when using diseased lung. Further, how to remove the effects of cell migration on transcript counts to improve discovery of disease factors is an open question. Results: Four cell count inference (i.e., deconvolution) tools are evaluated using microarray data from influenza-infected lung sampled at several time points post-infection. The analysis finds that inferred cell quantities are accurate only for select cell types and there is a tendency for algorithms to have a good relative fit (R 2 ) but a poor absolute fit (normalized mean squared error; NMSE), which suggests systemic biases exist. Nonetheless, using cell fraction estimates to adjust gene expression data, we show that genes associated with influenza virus replication and increased infection pathology are more likely to be identified as significant than when applying traditional statistical tests.

Download Full-text

Time Dependent Gene Expression Changes in the Liver of Mice Treated with Benzene

Biomarker Insights ◽

10.4137/bmi.s590 ◽

2008 ◽

Vol 3 ◽

pp. BMI.S590 ◽

Cited By ~ 3

Author(s):

Han-Jin Park ◽

Jung Hwa Oh ◽

Seokjoo Yoon ◽

S.V.S. Rana

Keyword(s):

Gene Expression ◽

Expression Profiles ◽

Gene Expression Profiles ◽

General Purpose ◽

Time Dependent ◽

Expression Data ◽

Benzene Exposure ◽

Microarray Analyses ◽

First Time ◽

Affymetrix Gene Chip

Benzene is used as a general purpose solvent. Benzene metabolism starts from phenol and ends with p-benzoquinone and o-benzoquinone. Liver injury inducted by benzene still remains a toxicologic problem. Tumor related genes and immune responsive genes have been studied in patients suffering from benzene exposure. However, gene expression profiles and pathways related to its hepatotoxicity are not known. This study reports the results obtained in the liver of BALB/C mice (SLC, Inc., Japan) administered 0.05 ml/100 g body weight of 2% benzene for six days. Serum, ALT, AST and ALP were determined using automated analyzer (Fuji., Japan). Histopathological observations were made to support gene expression data. c-DNA microarray analyses were performed using Affymetrix Gene-chip system. After six days of benzene exposure, twenty five genes were down regulated whereas nineteen genes were up-regulated. These gene expression changes were found to be related to pathways of biotransformation, detoxification, apoptosis, oxidative stress and cell cycle. It has been shown for the first time that genes corresponding to circadian rhythms are affected by benzene. Results suggest that gene expression profile might serve as potential biomarkers of hepatotoxicity during benzene exposure.

Download Full-text

Analysis of blood-based gene expression in idiopathic Parkinson disease

Neurology ◽

10.1212/wnl.0000000000004516 ◽

2017 ◽

Vol 89 (16) ◽

pp. 1676-1683 ◽

Cited By ~ 36

Author(s):

Ron Shamir ◽

Christine Klein ◽

David Amar ◽

Eva-Juliane Vollstedt ◽

Michael Bonin ◽

...

Keyword(s):

Gene Expression ◽

Parkinson Disease ◽

Gene Networks ◽

Large Scale ◽

Expression Profiles ◽

Area Under The Curve ◽

Gene Expression Profiles ◽

Gene Signature ◽

Gene Profiles ◽

Independent Test

Objective:To examine whether gene expression analysis of a large-scale Parkinson disease (PD) patient cohort produces a robust blood-based PD gene signature compared to previous studies that have used relatively small cohorts (≤220 samples).Methods:Whole-blood gene expression profiles were collected from a total of 523 individuals. After preprocessing, the data contained 486 gene profiles (n = 205 PD, n = 233 controls, n = 48 other neurodegenerative diseases) that were partitioned into training, validation, and independent test cohorts to identify and validate a gene signature. Batch-effect reduction and cross-validation were performed to ensure signature reliability. Finally, functional and pathway enrichment analyses were applied to the signature to identify PD-associated gene networks.Results:A gene signature of 100 probes that mapped to 87 genes, corresponding to 64 upregulated and 23 downregulated genes differentiating between patients with idiopathic PD and controls, was identified with the training cohort and successfully replicated in both an independent validation cohort (area under the curve [AUC] = 0.79, p = 7.13E–6) and a subsequent independent test cohort (AUC = 0.74, p = 4.2E–4). Network analysis of the signature revealed gene enrichment in pathways, including metabolism, oxidation, and ubiquitination/proteasomal activity, and misregulation of mitochondria-localized genes, including downregulation of COX4I1, ATP5A1, and VDAC3.Conclusions:We present a large-scale study of PD gene expression profiling. This work identifies a reliable blood-based PD signature and highlights the importance of large-scale patient cohorts in developing potential PD biomarkers.

Download Full-text

Discovering Distinct Patterns in Gene Expression Profiles

Journal of Integrative Bioinformatics ◽

10.1515/jib-2008-105 ◽

2008 ◽

Vol 5 (2) ◽

Cited By ~ 1

Author(s):

Li Teng ◽

Laiwan Chan

Keyword(s):

Gene Expression ◽

Large Scale ◽

Expression Profiles ◽

Expression Patterns ◽

Gene Expression Profiles ◽

Clustering Methods ◽

Gene Expressions ◽

Real Gene ◽

Large Scale Dataset ◽

Coexpressed Genes

SummaryTraditional analysis of gene expression profiles use clustering to find groups of coexpressed genes which have similar expression patterns. However clustering is time consuming and could be diffcult for very large scale dataset. We proposed the idea of Discovering Distinct Patterns (DDP) in gene expression profiles. Since patterns showing by the gene expressions reveal their regulate mechanisms. It is significant to find all different patterns existing in the dataset when there is little prior knowledge. It is also a helpful start before taking on further analysis. We propose an algorithm for DDP by iteratively picking out pairs of gene expression patterns which have the largest dissimilarities. This method can also be used as preprocessing to initialize centers for clustering methods, like K-means. Experiments on both synthetic dataset and real gene expression datasets show our method is very effective in finding distinct patterns which have gene functional significance and is also effcient.

Download Full-text

CFTR ΔF508 mutation has minimal effect on the gene expression profile of differentiated human airway epithelia

AJP Lung Cellular and Molecular Physiology ◽

10.1152/ajplung.00065.2005 ◽

2005 ◽

Vol 289 (4) ◽

pp. L545-L553 ◽

Cited By ~ 29

Author(s):

Joseph Zabner ◽

Todd E. Scheetz ◽

Hakeem G. Almabrazi ◽

Thomas L. Casavant ◽

Jian Huang ◽

...

Keyword(s):

Gene Expression ◽

Cystic Fibrosis ◽

Large Scale ◽

Expression Profiles ◽

Expression Patterns ◽

Primary Cultures ◽

Gene Expression Profiles ◽

Filter Method ◽

Tissue Destruction ◽

Airway Epithelia

Cystic fibrosis (CF) is caused by mutations in the cystic fibrosis transmembrane conductance regulator (CFTR), an epithelial chloride channel regulated by phosphorylation. Most of the disease-associated morbidity is the consequence of chronic lung infection with progressive tissue destruction. As an approach to investigate the cellular effects of CFTR mutations, we used large-scale microarray hybridization to contrast the gene expression profiles of well-differentiated primary cultures of human CF and non-CF airway epithelia grown under resting culture conditions. We surveyed the expression profiles for 10 non-CF and 10 ΔF508 homozygote samples. Of the 22,283 genes represented on the Affymetrix U133A GeneChip, we found evidence of significant changes in expression in 24 genes by two-sample t-test ( P < 0.00001). A second, three-filter method of comparative analysis found no significant differences between the groups. The levels of CFTR mRNA were comparable in both groups. There were no significant differences in the gene expression patterns between male and female CF specimens. There were 18 genes with significant increases and 6 genes with decreases in CF relative to non-CF samples. Although the function of many of the differentially expressed genes is unknown, one transcript that was elevated in CF, the KCl cotransporter (KCC4), is a candidate for further study. Overall, the results indicate that CFTR dysfunction has little direct impact on airway epithelial gene expression in samples grown under these conditions.

Download Full-text

Connecting gene expression data from connectivity map and in silico target predictions for small molecule mechanism-of-action analysis

Molecular BioSystems ◽

10.1039/c4mb00328d ◽

2015 ◽

Vol 11 (1) ◽

pp. 86-96 ◽

Cited By ~ 17

Author(s):

Aakash Chavan Ravindranath ◽

Nolen Perualila-Tan ◽

Adetayo Kasim ◽

Georgios Drakakis ◽

Sonia Liggi ◽

...

Keyword(s):

Gene Expression ◽

Ligand Binding ◽

Gene Expression Data ◽

Mechanism Of Action ◽

In Silico ◽

Expression Profiles ◽

Gene Expression Profiles ◽

Expression Data ◽

Connectivity Map ◽

Action Analysis

Integrating gene expression profiles with certain proteins can improve our understanding of the fundamental mechanisms in protein–ligand binding.

Download Full-text

Dynamical consequences of regional heterogeneity in the brain’s transcriptional landscape

10.1101/2020.10.28.359943 ◽

2020 ◽

Cited By ~ 1

Author(s):

Gustavo Deco ◽

Kevin Aquino ◽

Aurina Arnatkevičiūtė ◽

Stuart Oldham ◽

Kristina Sabaroedin ◽

...

Keyword(s):

Gene Expression ◽

Large Scale ◽

Expression Profiles ◽

Gene Expression Profiles ◽

Global Gene Expression ◽

Brain Regions ◽

Biophysical Model ◽

Neuronal Dynamics ◽

Regional Heterogeneity ◽

Magnetic Resonance Imaging Mri

AbstractBrain regions vary in their molecular and cellular composition, but how this heterogeneity shapes neuronal dynamics is unclear. Here, we investigate the dynamical consequences of regional heterogeneity using a biophysical model of whole-brain functional magnetic resonance imaging (MRI) dynamics in humans. We show that models in which transcriptional variations in excitatory and inhibitory receptor (E:I) gene expression constrain regional heterogeneity more accurately reproduce the spatiotemporal structure of empirical functional connectivity estimates than do models constrained by global gene expression profiles and MRI-derived estimates of myeloarchitecture. We further show that regional heterogeneity is essential for yielding both ignition-like dynamics, which are thought to support conscious processing, and a wide variance of regional activity timescales, which supports a broad dynamical range. We thus identify a key role for E:I heterogeneity in generating complex neuronal dynamics and demonstrate the viability of using transcriptional data to constrain models of large-scale brain function.

Download Full-text

909 Differentiation subgroups within LKB1-deficient lung cancer influence both the immune exclusion phenotype and cellular composition of the immune microenvironment

Journal for ImmunoTherapy of Cancer ◽

10.1136/jitc-2021-sitc2021.909 ◽

2021 ◽

Vol 9 (Suppl 3) ◽

pp. A954-A955

Author(s):

Jacob Kaufman ◽

Doug Cress ◽

Theresa Boyle ◽

David Carbone ◽

Neal Ready ◽

...

Keyword(s):

Gene Expression ◽

Lung Cancer ◽

Lung Adenocarcinoma ◽

Plasma Cells ◽

Immune Cell ◽

Expression Profiles ◽

Neuroendocrine Differentiation ◽

Gene Expression Profiles ◽

Immune Microenvironment ◽

Immune Exclusion

BackgroundLKB1 (STK11) is a commonly disrupted tumor suppressor in NSCLC. Its loss promotes an immune exclusion phenotype with evidence of low expression of interferon stimulated genes (ISG) and decreased microenvironment immune infiltration.1 2 Clinically, LKB1 loss induces primary immunotherapy resistance.3 LKB1 is a master regulator of a complex downstream kinase network and has pleiotropic effects on cell biology. Understanding the heterogeneous phenotypes associated with LKB1 loss and their influence on tumor-immune biology will help define and overcome mechanisms of immunotherapy resistance within this subset of lung cancer.MethodsWe applied multi-omic analyses across multiple lung adenocarcinoma datasets2 4–6 (>1000 tumors) to define transcriptional and genetic features enriched in LKB1-deficient lung cancer. Top scoring phenotypes exhibited heterogeneity across LKB1-loss tumors, and were further interrogated to determine association with increased or decreased markers of immune activity. Further, immune cell-types were estimated by Cibersort to identify effects of LKB1 loss on the immune microenvironment. Key conclusions were confirmed by blinded pathology review.ResultsWe show that LKB1 loss significantly affects differentiation patterns, with enrichment of ASCL1-expressing tumors with putative neuroendocrine differentiation. LKB1-deficient neuroendocrine tumors had lower expression of Interferon Stimulated Genes (ISG), MHC1 and MHC2 components, and immune infiltration compared to LKB1-WT and non-neuroendocrine LKB1-deficient tumors (figure 1).The abundances of 22 immune cell types assessed by Cibersort were compared between LKB1-deficient and LKB1-WT tumors. We observe skewing of immune microenvironmental composition by LKB1 loss, with lower abundance of dendritic cells, monocytes, and macrophages, and increased levels of neutrophils and plasma cells (table 1). These trends were most pronounced among tumors with neuroendocrine differentiation, and were concordant across three independent datasets. In a confirmatory subset of 20 tumors, plasma cell abundance was assessed by a blinded pathologist. Pathologist assessment was 100% concordant with Cibersort prediction, and association with LKB1 loss was confirmed (P=0.001).Abstract 909 Figure 1Immune-associated Gene Expression Profiles Affected by Neuroendocrine Differentiation within LKB1-Deficient Lung Adenocarcinomas. Gene expression profiles corresponding to five immune-associated phenotypes are shown with bars indicating average GEP scores for tumors grouped according to LKB1 and neuroendocrine status as indicated. P-values represent results from Student’s T-test between groups as indicated.Abstract 909 Table 1LKB1 Loss Affects Composition of Immune Microenvironment. Values indicate log10 P-values comparing LKB1-loss to LKB1-WT tumors. Positive (red) indicates increased abundance in LKB1 loss. Negative (blue) indicates decreased abundance.ConclusionsWe conclude that tumor differentiation patterns strongly influence the immune microenvironment and immune exclusion characteristics of LKB1-deficient tumors. Neuroendocrine differentiation is associated with the strongest immune exclusion characteristics and should be evaluated clinically for evidence of immunotherapy resistance. A novel observation of increased plasma cell abundance is observed across multiple datasets and confirmed by pathology. Causal mechanisms linking differentiation status to immune activity is not well understood, and the functional role of plasma cells in the immune biology of LKB1-deficient tumors is undefined. These questions warrant further study to inform precision immuno-oncology treatments for these patients.AcknowledgementsThis work was funded by SITC AZ Immunotherapy in Lung Cancer grant (SPS256666) and DOD Lung Cancer Research Program Concept Award (LC180633).ReferencesSkoulidis F, Byers LA, Diao L, et al. Co-occurring genomic alterations define major subsets of KRAS-mutant lung adenocarcinoma with distinct biology, immune profiles, and therapeutic vulnerabilities. Cancer Discov 2015;5:860–77.Schabath MB, Welsh EA, Fulp WJ, et al. Differential association of STK11 and TP53 with KRAS mutation-associated gene expression, proliferation and immune surveillance in lung adenocarcinoma. Oncogene 2016;35:3209–16.Skoulidis F, Goldberg ME, Greenawalt DM, et al. STK11/LKB1 mutations and PD-1 inhibitor resistance in KRAS-mutant lung adenocarcinoma. Cancer Discovery 2018;8:822-835.Cancer Genome Atlas Research Network. Comprehensive molecular profiling of lung adenocarcinoma. Nature 2014;511:543–50.Chitale D, Gong Y, Taylor BS, et al. An integrated genomic analysis of lung cancer reveals loss of DUSP4 in EGFR-mutant tumors. Oncogene 2009;28:2773–83.Shedden K, Taylor JM, Enkemann SA, et al. Gene expression-based survival prediction in lung adenocarcinoma: a multi-site, blinded validation study. Nat Med 2008;14:822–7.

Download Full-text