The effect of tissue composition on gene co-expression

Variable cellular composition of tissue samples represents a significant challenge for the interpretation of genomic profiling studies. Substantial effort has been devoted to modeling and adjusting for compositional differences when estimating differential expression between sample types. However, relatively little attention has been given to the effect of tissue composition on co-expression estimates. In this study, we illustrate the effect of variable cell type composition on correlation-based network estimation and provide a mathematical decomposition of the tissue-level correlation. We show that a class of deconvolution methods developed to separate tumor and stromal signatures can be applied to two component cell type mixtures. In simulated and real data, we identify conditions in which a deconvolution approach would be beneficial. Our results suggest that uncorrelated cell type specific markers are ideally suited to deconvolute both the expression and co expression patterns of an individual cell type. Finally, we provide a Shiny application for users to interactively explore the effect of cell type composition on correlation-based co-expression estimation for any cell types of interest.

Download Full-text

The effect of tissue composition on gene co-expression

Briefings in Bioinformatics ◽

10.1093/bib/bbz135 ◽

2019 ◽

Cited By ~ 6

Author(s):

Yun Zhang ◽

Jonavelle Cuerdo ◽

Marc K Halushka ◽

Matthew N McCall

Keyword(s):

Expression Patterns ◽

Real Data ◽

Cell Types ◽

Tissue Level ◽

Tissue Composition ◽

Cell Type ◽

Tissue Samples ◽

Cell Type Composition ◽

Type Composition ◽

Component Cell

Abstract Variable cellular composition of tissue samples represents a significant challenge for the interpretation of genomic profiling studies. Substantial effort has been devoted to modeling and adjusting for compositional differences when estimating differential expression between sample types. However, relatively little attention has been given to the effect of tissue composition on co-expression estimates. In this study, we illustrate the effect of variable cell-type composition on correlation-based network estimation and provide a mathematical decomposition of the tissue-level correlation. We show that a class of deconvolution methods developed to separate tumor and stromal signatures can be applied to two component cell-type mixtures. In simulated and real data, we identify conditions in which a deconvolution approach would be beneficial. Our results suggest that uncorrelated cell-type-specific markers are ideally suited to deconvolute both the expression and co-expression patterns of an individual cell type. We provide a Shiny application for users to interactively explore the effect of cell-type composition on correlation-based co-expression estimation for any cell types of interest.

Download Full-text

Adipose tissue in health and disease through the lens of its building blocks

10.1101/316083 ◽

2018 ◽

Cited By ~ 2

Author(s):

Michael Lenz ◽

Ilja C.W. Arts ◽

Ralf L.M. Peeters ◽

Theo M. de Kok ◽

Gökhan Ertaylan

Keyword(s):

Adipose Tissue ◽

Cell Types ◽

Cellular Heterogeneity ◽

Tissue Cell ◽

Cell Type ◽

Tissue Samples ◽

Cell Type Composition ◽

Type Composition ◽

Adipose Tissue Cell ◽

Cellular Markers

AbstractBackgroundHighly specialized cells work in synergy forming tissues to perform functions required for the survival of organisms. Understanding this tissue-specific cellular heterogeneity and homeostasis is essential to comprehend the development of diseases within the tissue and also for developing regenerative therapies. Cellular subpopulations in the adipose tissue have been related to disease development, but efforts towards characterizing the adipose tissue cell type composition are limited due to lack of robust cell surface markers, limited access to tissue samples, and the labor-intensive process required to identify them.ResultsWe propose a framework, identifying cellular heterogeneity while providing state-of-the-art cellular markers for each cell type present in tissues using transcriptomics level analysis. We validate our approach with an independent dataset and present the most comprehensive study of adipose tissue cell type composition to date, determining the relative amounts of 21 different cell types in 779 adipose tissue samples detailing differences across four adipose tissue depots, between genders, across ranges of BMI and in different stages of type-2 diabetes. We also highlight the heterogeneity in reported marker-based studies of adipose tissue cell type composition and provide novel cellular markers to distinguish different cell types within the adipose tissue.ConclusionsOur study provides a systematic framework for studying cell type composition in a given tissue and valuable insights into adipose tissue cell type heterogeneity in health and disease.

Download Full-text

Cell Type Aware analysis of RNA-seq data (CARseq) reveals difference and similarities of the molecular mechanisms of Schizophrenia and Autism

10.1101/2020.07.13.201061 ◽

2020 ◽

Author(s):

Chong Jin ◽

Mengjie Chen ◽

Danyu Lin ◽

Wei Sun

Keyword(s):

Differential Expression ◽

Molecular Mechanisms ◽

Differential Expression Analysis ◽

Cell Types ◽

Cell Type ◽

Specific Expression ◽

Tissue Samples ◽

Cell Type Composition ◽

Type Composition ◽

Cell Type Specific Expression

AbstractMost tissue samples are composed of different cell types. Differential expression analysis without accounting for cell type composition cannot separate the changes due to cell type composition or cell type-specific expression. We propose a new framework to address these limitations: Cell Type Aware analysis of RNA-seq (CARseq). After evaluating its performance in simulations, we apply CARseq to compare gene expression of schizophrenia/autism subjects versus controls. Our results show that these two neurodevelopmental disorders differ from each other in terms of cell type composition changes and differential expression associated with different types of neurotransmitter receptors. We also discover overlapping signals of differential expression in microglia, supporting the two diseases’ similarity through immune regulation.

Download Full-text

THUNDER: A reference-free deconvolution method to infer cell type proportions from bulk Hi-C data

10.1101/2020.11.12.379941 ◽

2020 ◽

Author(s):

Bryce Rowland ◽

Ruth Huh ◽

Zoey Hou ◽

Ming Hu ◽

Yin Shen ◽

...

Keyword(s):

Three Dimensional ◽

Real Data ◽

Cell Types ◽

Deconvolution Method ◽

Cell Type ◽

Cell Type Specificity ◽

Organization Studies ◽

Cell Type Composition ◽

Type Composition ◽

Downstream Analysis

AbstractHi-C data provide population averaged estimates of three-dimensional chromatin contacts across cell types and states in bulk samples. To effectively leverage Hi-C data for biological insights, we need to control for the confounding factor of differential cell type proportions across heterogeneous bulk samples. We propose a novel unsupervised deconvolution method for inferring cell type composition from bulk Hi-C data, the Two-step Hi-c UNsupervised DEconvolution appRoach (THUNDER). We conducted extensive real data based simulations to test THUNDER constructed from published single-cell Hi-C (scHi-C) data. THUNDER more accurately estimates the underlying cell type proportions when compared to both supervised and unsupervised deconvolution methods including CIBERSORT, TOAST, and NMF. THUNDER will be a useful tool in adjusting for varying cell type composition in population samples, facilitating valid and more powerful downstream analysis such as differential chromatin organization studies. Additionally, THUNDER estimates cell-type-specific chromatin contact profiles for all cell types in bulk Hi-C mixtures. These estimated contact profiles provide a useful exploratory framework to investigate cell-type-specificity of the chromatin interactome while experimental data is still sparse.

Download Full-text

Transit-amplifying cells coordinate changes in intestinal epithelial cell-type composition

10.1101/840371 ◽

2019 ◽

Author(s):

Laura E. Sanman ◽

Ina W. Chen ◽

Jake M. Bieber ◽

Veronica Steri ◽

Byron Hann ◽

...

Keyword(s):

Quantitative Imaging ◽

Cell Types ◽

Culture Conditions ◽

Tissue Cell ◽

Specific Cell ◽

Cell Type ◽

Cell Type Composition ◽

Type Composition ◽

Coordinate Changes

AbstractRenewing tissues have the remarkable ability to continually produce both proliferative progenitor and specialized differentiated cell-types. How are complex milieus of microenvironmental signals interpreted to coordinate tissue cell-type composition? Here, we develop a high-throughput approach that combines organoid technology and quantitative imaging to address this question in the context of the intestinal epithelium. Using this approach, we comprehensively survey enteroid responses to individual and paired perturbations to eight epithelial signaling pathways. We uncover culture conditions that enrich for specific cell-types, including Lgr5+ stem and enteroendocrine cells. We analyze interactions between perturbations and dissect mechanisms underlying an unexpected mutual antagonism between EGFR and IL-4 signals. Finally, we show that, across diverse perturbations, modulating proliferation of transit-amplifying cells also consistently changes the composition of differentiated secretory and absorptive cell-types. This property is conserved in vivo and can arise from differential amplification of secretory and absorptive progenitor cells. Taken together, the observations highlight an underappreciated role for transit-amplifying cells in which proliferation of these short-lived progenitors provides a lineage-based mechanism for tuning differentiated cell-type composition.

Download Full-text

Systematic evaluation of cell-type deconvolution pipelines for sequencing-based bulk DNA methylomes

10.1101/2021.11.29.470374 ◽

2021 ◽

Author(s):

Yunhee Jeong ◽

Reka Toth ◽

Marlene Ganslmeier ◽

Kersten Breuer ◽

Christoph Plass ◽

...

Keyword(s):

Cell Types ◽

Systematic Evaluation ◽

Cell Type ◽

Factors Affecting ◽

Genome Wide ◽

Cell Type Composition ◽

Type Composition ◽

Level Information ◽

Genomic Regions ◽

The Impact

DNA methylation sequencing is becoming increasingly popular, yielding genome-wide methylome data at single-base pair resolution through the novel cost- and labor-optimized protocols. It has tremendous potential for cell-type heterogeneity analysis, particularly in tumors, due to intrinsic read-level information. Although diverse deconvolution methods were developed to infer cell-type composition based on bulk sequencing-based methylomes, their systematic evaluation has not been performed so far. Here, we thoroughly review and evaluate five previously published deconvolution methods: Bayesian epiallele detection (BED), PRISM, csmFinder + coMethy, ClubCpG and MethylPurify, together with two array-based methods, MeDeCom and Houseman as a comparison group. Sequencing-based deconvolution methods consist of two main steps, informative region selection and cell-type composition estimation. Accordingly, we individually assessed the performance of each step and demonstrated the impact of the former step upon the performance of the following one. In conclusion, we demonstrate the best method showing the highest accuracy in different samples, and infer factors affecting cell-type deconvolution performance according to the number of cell types in the mixture. We found that cell-type deconvolution performance is influenced by different factors according to the number of components in the mixture. Whereas selecting similar genomic regions to DMRs generally contributed to increasing the performance in bi-component mixtures, the uniformity of cell-type distribution showed a high correlation with the performance in five cell-type bulk analyses.

Download Full-text

LRcell: detecting the source of differential expression at the sub-cell type level from bulk RNA-seq data

10.1101/2021.08.10.455821 ◽

2021 ◽

Author(s):

Wenjing Ma ◽

Sumeet Sharma ◽

Peng Jin ◽

Shannon L Gourley ◽

Zhaohui Qin

Keyword(s):

Single Cell ◽

Cell Types ◽

Marker Genes ◽

Bioconductor Package ◽

Rna Seq ◽

Cell Type ◽

Reference Dataset ◽

Cell Type Composition ◽

Type Composition ◽

Differential Gene

The rapid proliferation of single-cell RNA-sequencing (scRNA-seq) datasets have revealed cell heterogeneity at unprecedented scales. Several deconvolution methods have been developed to decompose bulk experiments to reveal cell type contributions. However, these methods lack power in identifying the accurate cell type composition when having a considerable amount of sub-cell types in the reference dataset. Here, we present LRcell, a R Bioconductor package (http://bioconductor.org/packages/release/bioc/html/LRcell.html) aiming to identify specific sub-cell type(s) that drives the changes observed in a bulk RNA-seq differential gene expression experiment. In addition, LRcell provides pre-embedded marker genes computed from putative single-cell RNA-seq experiments as options to execute the analyses.

Download Full-text

Comprehensive benchmarking of computational deconvolution of transcriptomics data

10.1101/2020.01.10.897116 ◽

2020 ◽

Cited By ~ 2

Author(s):

Francisco Avila Cobos ◽

José Alquicira-Hernandez ◽

Joseph Powell ◽

Pieter Mestdagh ◽

Katleen De Preter

Keyword(s):

Single Cell ◽

Cell Types ◽

Cell Type ◽

Factors Affecting ◽

Marker Selection ◽

Cell Type Composition ◽

Type Composition ◽

Comparable Performance ◽

Transcriptomics Data ◽

Combined Impact

AbstractMany computational methods to infer cell type proportions from bulk transcriptomics data have been developed. Attempts comparing these methods revealed that the choice of reference marker signatures is far more important than the method itself. However, a thorough evaluation of the combined impact of data transformation, pre-processing, marker selection, cell type composition and choice of methodology on the results is still lacking.Using different single-cell RNA-sequencing (scRNA-seq) datasets, we generated hundreds of pseudo-bulk mixtures to evaluate the combined impact of these factors on the deconvolution results. Along with methods to perform deconvolution of bulk RNA-seq data we also included five methods specifically designed to infer the cell type composition of bulk data using scRNA-seq data as reference.Both bulk and single-cell deconvolution methods perform best when applied to data in linear scale and the choice of normalization can have a dramatic impact on the performance of some, but not all methods. Overall, single-cell methods have comparable performance to the best performing bulk methods and bulk methods based on semi-supervised approaches showed higher error and lower correlation values between the computed and the expected proportions. Moreover, failure to include cell types in the reference that are present in a mixture always led to substantially worse results, regardless of any of the previous choices. Taken together, we provide a thorough evaluation of the combined impact of the different factors affecting the computational deconvolution task across different datasets and propose general guidelines to maximize its performance.

Download Full-text

DNA Methylation Profiles of Purified Cell Types in Bronchoalveolar Lavage: Applications for Mixed Cell Paediatric Pulmonary Studies

Frontiers in Immunology ◽

10.3389/fimmu.2021.788705 ◽

2021 ◽

Vol 12 ◽

Author(s):

Shivanthan Shanthikumar ◽

Melanie R. Neeland ◽

Richard Saffery ◽

Sarath C. Ranganathan ◽

Alicia Oshlack ◽

...

Keyword(s):

Dna Methylation ◽

Bronchoalveolar Lavage ◽

Association Studies ◽

Cell Types ◽

Alveolar Epithelial Cells ◽

Cell Type ◽

Mixed Cell ◽

Alveolar Epithelial ◽

Cell Type Composition ◽

Type Composition

In epigenome-wide association studies analysing DNA methylation from samples containing multiple cell types, it is essential to adjust the analysis for cell type composition. One well established strategy for achieving this is reference-based cell type deconvolution, which relies on knowledge of the DNA methylation profiles of purified constituent cell types. These are then used to estimate the cell type proportions of each sample, which can then be incorporated to adjust the association analysis. Bronchoalveolar lavage is commonly used to sample the lung in clinical practice and contains a mixture of different cell types that can vary in proportion across samples, affecting the overall methylation profile. A current barrier to the use of bronchoalveolar lavage in DNA methylation-based research is the lack of reference DNA methylation profiles for each of the constituent cell types, thus making reference-based cell composition estimation difficult. Herein, we use bronchoalveolar lavage samples collected from children with cystic fibrosis to define DNA methylation profiles for the four most common and clinically relevant cell types: alveolar macrophages, granulocytes, lymphocytes and alveolar epithelial cells. We then demonstrate the use of these methylation profiles in conjunction with an established reference-based methylation deconvolution method to estimate the cell type composition of two different tissue types; a publicly available dataset derived from artificial blood-based cell mixtures and further bronchoalveolar lavage samples. The reference DNA methylation profiles developed in this work can be used for future reference-based cell type composition estimation of bronchoalveolar lavage. This will facilitate the use of this tissue in studies examining the role of DNA methylation in lung health and disease.

Download Full-text

Notch signalling patterns retinal composition by regulating atoh7 during post-embryonic growth

10.1101/363010 ◽

2018 ◽

Cited By ~ 1

Author(s):

Alicia Pérez Saturnino ◽

Katharina Lust ◽

Joachim Wittbrodt

Keyword(s):

De Novo ◽

Cell Types ◽

Notch Signalling ◽

Cell Type ◽

Lineage Specification ◽

Cell Type Composition ◽

Type Composition ◽

Cell Niche ◽

Functional Relevance ◽

Type Specification

AbstractPatterning of a continuously growing naive field in the context of a life-long growing organ, the teleost eye is of highest functional relevance. Intrinsic and extrinsic signals were proposed to regulate lineage specification in progenitors that exit the stem cell niche in the ciliary marginal zone (CMZ). The proper cell type composition arising from those progenitors is prerequisite for retinal function. Our findings in the teleost medaka (Oryzias latipes) uncover that the Notch–Atoh7 axis continuously patterns the CMZ. The complement of cell-types originating from the two juxtaposed progenitors marked by Notch or Atoh7 activity contains all constituents of a retinal column. Modulation of Notch signalling specifically in Atoh7-expressing cells demonstrates the crucial role of this axis in generating the correct cell type proportions. After transiently blocking Notch signalling, retinal patterning and differentiation is reinitiated de novo. Taken together we show that Notch activity in the CMZ continuously structures the growing retina by juxtaposing Notch and Atoh7 progenitors giving rise to distinct, complementary lineages, revealing a coupling of de novo patterning and cell-type specification in the respective lineages.

Download Full-text