scholarly journals Constructing local cell-specific networks from single-cell data

2021 ◽  
Vol 118 (51) ◽  
pp. e2113178118
Author(s):  
Xuran Wang ◽  
David Choi ◽  
Kathryn Roeder

Gene coexpression networks yield critical insights into biological processes, and single-cell RNA sequencing provides an opportunity to target inquiries at the cellular level. However, due to the sparsity and heterogeneity of transcript counts, it is challenging to construct accurate gene networks. We develop an approach, locCSN, that estimates cell-specific networks (CSNs) for each cell, preserving information about cellular heterogeneity that is lost with other approaches. LocCSN is based on a nonparametric investigation of the joint distribution of gene expression; hence it can readily detect nonlinear correlations, and it is more robust to distributional challenges. Although individual CSNs are estimated with considerable noise, average CSNs provide stable estimates of networks, which reveal gene communities better than traditional measures. Additionally, we propose downstream analysis methods using CSNs to utilize more fully the information contained within them. Repeated estimates of gene networks facilitate testing for differences in network structure between cell groups. Notably, with this approach, we can identify differential network genes, which typically do not differ in gene expression, but do differ in terms of the coexpression networks. These genes might help explain the etiology of disease. Finally, to further our understanding of autism spectrum disorder, we examine the evolution of gene networks in fetal brain cells and compare the CSNs of cells sampled from case and control subjects to reveal intriguing patterns in gene coexpression.

2021 ◽  
Author(s):  
Xuran Wang ◽  
David Choi ◽  
Kathryn Roeder

ABSTRACTGene co-expression networks yield critical insights into biological processes, and single-cell RNA sequencing provides an opportunity to target inquiries at the cellular level. However, due to the sparsity and heterogeneity of transcript counts, it is challenging to construct accurate gene networks. We develop an approach that estimates cell-specific networks (CSN) for each cell using a method inspired by Dai et al. (7). Although individual CSNs are estimated with considerable noise, average CSNs provide stable estimates of network structure, which provide better estimates of gene block structures than traditional measures. The method, called locCSN, is based on a non-parametric investigation of the joint distribution of gene expression, hence it can readily detect nonlinear correlations, and it is more robust to distributional challenges. The individual networks preserve information about the heterogeneity of the cells and having repeated estimates of network structure facilitates testing for difference in network structure between groups of cells. The original CSN algorithm showed promise; however, it had shortcomings which locCSN overcomes. Additionally, we propose new downstream analysis methods using CSNs, to utilize more fully the information contained within them. Finally, to further our understanding of autism spectrum disorder we examined the evolution of gene networks in fetal brain cells and compared the CSNs of cells sampled from case and control subjects to reveal intriguing patterns in gene co-expression changes.


2019 ◽  
Author(s):  
Marcus Alvarez ◽  
Elior Rahmani ◽  
Brandon Jew ◽  
Kristina M. Garske ◽  
Zong Miao ◽  
...  

AbstractSingle-nucleus RNA sequencing (snRNA-seq) measures gene expression in individual nuclei instead of cells, allowing for unbiased cell type characterization in solid tissues. Contrary to single-cell RNA seq (scRNA-seq), we observe that snRNA-seq is commonly subject to contamination by high amounts of extranuclear background RNA, which can lead to identification of spurious cell types in downstream clustering analyses if overlooked. We present a novel approach to remove debris-contaminated droplets in snRNA-seq experiments, called Debris Identification using Expectation Maximization (DIEM). Our likelihood-based approach models the gene expression distribution of debris and cell types, which are estimated using EM. We evaluated DIEM using three snRNA-seq data sets: 1) human differentiating preadipocytes in vitro, 2) fresh mouse brain tissue, and 3) human frozen adipose tissue (AT) from six individuals. All three data sets showed various degrees of extranuclear RNA contamination. We observed that existing methods fail to account for contaminated droplets and led to spurious cell types. When compared to filtering using these state of the art methods, DIEM better removed droplets containing high levels of extranuclear RNA and led to higher quality clusters. Although DIEM was designed for snRNA-seq data, we also successfully applied DIEM to single-cell data. To conclude, our novel method DIEM removes debris-contaminated droplets from single-cell-based data fast and effectively, leading to cleaner downstream analysis. Our code is freely available for use at https://github.com/marcalva/diem.


2021 ◽  
Vol 12 (1) ◽  
Author(s):  
Yuxuan Liu ◽  
Zhimin Gu ◽  
Hui Cao ◽  
Pranita Kaphle ◽  
Junhua Lyu ◽  
...  

AbstractCancers develop from the accumulation of somatic mutations, yet it remains unclear how oncogenic lesions cooperate to drive cancer progression. Using a mouse model harboring NRasG12D and EZH2 mutations that recapitulates leukemic progression, we employ single-cell transcriptomic profiling to map cellular composition and gene expression alterations in healthy or diseased bone marrows during leukemogenesis. At cellular level, NRasG12D induces myeloid lineage-biased differentiation and EZH2-deficiency impairs myeloid cell maturation, whereas they cooperate to promote myeloid neoplasms with dysregulated transcriptional programs. At gene level, NRasG12D and EZH2-deficiency independently and synergistically deregulate gene expression. We integrate results from histopathology, leukemia repopulation, and leukemia-initiating cell assays to validate transcriptome-based cellular profiles. We use this resource to relate developmental hierarchies to leukemia phenotypes, evaluate oncogenic cooperation at single-cell and single-gene levels, and identify GEM as a regulator of leukemia-initiating cells. Our studies establish an integrative approach to deconvolute cancer evolution at single-cell resolution in vivo.


2020 ◽  
Author(s):  
Manuel Göpferich ◽  
Nikhil Oommen George ◽  
Ana Domingo Muelas ◽  
Alex Bizyn ◽  
Rosa Pascual ◽  
...  

SUMMARYAutism spectrum disorder (ASD) is a neurodevelopmental disease affecting social behavior. Many of the high-confident ASD risk genes relate to mRNA translation. Specifically, many of these genes are involved in regulation of gene expression for subcellular compartmentalization of proteins1. Cis-regulatory motifs that often localize to 3’- and 5’-untranslated regions (UTRs) offer an additional path for posttranscriptional control of gene expression. Alternative cleavage and polyadenylation (APA) affect 3’UTR length thereby influencing the presence or absence of regulatory elements. However, APA has not yet been addressed in the context of neurodevelopmental disorders. Here we used single cell 3’end sequencing to examine changes in 3’UTRs along the differentiation from neural stem cells (NSCs) to neuroblasts within the adult brain. We identified many APA events in genes involved in neurodevelopment, many of them being high confidence ASD risk genes. Further, analysis of 3’UTR lengths in single cells from ASD and healthy individuals detected longer 3’UTRs in ASD patients. Motif analysis of modulated 3’UTRs in the mouse adult neurogenic lineage and ASD-patients revealed enrichment of the cytoplasmic and polyadenylation element (CPE). This motif is bound by CPE binding protein 4 (CPEB4). In human and mouse data sets we observed co-regulation of CPEB4 and the CPEB-binding synaptic adhesion molecule amyloid beta precursor-like protein 1 (APLP1). We show that mice deficient in APLP1 show aberrant regulation of APA, decreased number of neural stem cells, and autistic-like traits. Our findings indicate that APA is used for control of gene expression along neuronal differentiation and is altered in ASD patients.


2021 ◽  
Author(s):  
Dmitry Velmeshev ◽  
Manideep Chavali ◽  
Tomasz Jan Nowakowski ◽  
Mohini Bhade ◽  
Simone Mayer ◽  
...  

Cortical interneurons are indispensable for proper function of neocortical circuits. Changes in interneuron development and function are implicated in human disorders, such as autism spectrum disorder and epilepsy. In order to understand human-specific features of cortical development as well as the origins of neurodevelopmental disorders it is crucial to identify the molecular programs underlying human interneuron development and subtype specification. Recent studies have explored gene expression programs underlying mouse interneuron specification and maturation. We applied single-cell RNA sequencing to samples of second trimester human ganglionic eminence and developing cortex to identify molecularly defined subtypes of human interneuron progenitors and immature interneurons. In addition, we integrated this data from the developing human ganglionic eminences and neocortex with single-nucleus RNA-seq of adult cortical interneurons in order to elucidate dynamic molecular changes associated with commitment of progenitors and immature interneurons to mature interneuron subtypes. By comparing our data with published mouse single-cell genomic data, we discover a number of divergent gene expression programs that distinguish human interneuron progenitors from mouse. Moreover, we find that a number of transcription factors expressed during prenatal development become restricted to adult interneuron subtypes in the human but not the mouse, and these adult interneurons express species- and lineage-specific cell adhesion and synaptic genes. Therefore, our study highlights that despite the similarity of main principles of cortical interneuron development and lineage commitment between mouse and human, human interneuron genesis and subtype specification is guided by species-specific gene programs, contributing to human-specific features of cortical inhibitory interneurons.


2018 ◽  
Vol 29 (8) ◽  
pp. 2060-2068 ◽  
Author(s):  
Nikos Karaiskos ◽  
Mahdieh Rahmatollahi ◽  
Anastasiya Boltengagen ◽  
Haiyue Liu ◽  
Martin Hoehne ◽  
...  

Background Three different cell types constitute the glomerular filter: mesangial cells, endothelial cells, and podocytes. However, to what extent cellular heterogeneity exists within healthy glomerular cell populations remains unknown.Methods We used nanodroplet-based highly parallel transcriptional profiling to characterize the cellular content of purified wild-type mouse glomeruli.Results Unsupervised clustering of nearly 13,000 single-cell transcriptomes identified the three known glomerular cell types. We provide a comprehensive online atlas of gene expression in glomerular cells that can be queried and visualized using an interactive and freely available database. Novel marker genes for all glomerular cell types were identified and supported by immunohistochemistry images obtained from the Human Protein Atlas. Subclustering of endothelial cells revealed a subset of endothelium that expressed marker genes related to endothelial proliferation. By comparison, the podocyte population appeared more homogeneous but contained three smaller, previously unknown subpopulations.Conclusions Our study comprehensively characterized gene expression in individual glomerular cells and sets the stage for the dissection of glomerular function at the single-cell level in health and disease.


2020 ◽  
Author(s):  
Matthew N. Bernstein ◽  
Zijian Ni ◽  
Michael Collins ◽  
Mark E. Burkard ◽  
Christina Kendziorski ◽  
...  

AbstractBackgroundSingle-cell RNA-seq (scRNA-seq) enables the profiling of genome-wide gene expression at the single-cell level and in so doing facilitates insight into and information about cellular heterogeneity within a tissue. Perhaps nowhere is this more important than in cancer, where tumor and tumor microenvironment heterogeneity directly impact development, maintenance, and progression of disease. While publicly available scRNA-seq cancer datasets offer unprecedented opportunity to better understand the mechanisms underlying tumor progression, metastasis, drug resistance, and immune evasion, much of the available information has been underutilized, in part, due to the lack of tools available for aggregating and analysing these data.ResultsWe present CHARacterizing Tumor Subpopulations (CHARTS), a computational pipeline and web application for analyzing, characterizing, and integrating publicly available scRNA-seq cancer datasets. CHARTS enables the exploration of individual gene expression, cell type, malignancy-status, differentially expressed genes, and gene set enrichment results in subpopulations of cells across multiple tumors and datasets.ConclusionCHARTS is an easy to use, comprehensive platform for exploring single-cell subpopulations within tumors across the ever-growing collection of public scRNA-seq cancer datasets. CHARTS is freely available at charts.morgridge.org.


2021 ◽  
Author(s):  
Jinyue Liao ◽  
Hoi Ching Suen ◽  
Shitao Rao ◽  
Alfred Chun Shui Luk ◽  
Ruoyu Zhang ◽  
...  

AbstractSpermatogenesis depends on an orchestrated series of developing events in germ cells and full maturation of the somatic microenvironment. To date, the majority of efforts to study cellular heterogeneity in testis has been focused on single-cell gene expression rather than the chromatin landscape shaping gene expression. To advance our understanding of the regulatory programs underlying testicular cell types, we analyzed single-cell chromatin accessibility profiles in more than 25,000 cells from mouse developing testis. We showed that scATAC-Seq allowed us to deconvolve distinct cell populations and identify cis-regulatory elements (CREs) underlying cell type specification. We identified sets of transcription factors associated with cell type-specific accessibility, revealing novel regulators of cell fate specification and maintenance. Pseudotime reconstruction revealed detailed regulatory dynamics coordinating the sequential developmental progressions of germ cells and somatic cells. This high-resolution data also revealed putative stem cells within the Sertoli and Leydig cell populations. Further, we defined candidate target cell types and genes of several GWAS signals, including those associated with testosterone levels and coronary artery disease. Collectively, our data provide a blueprint of the ‘regulon’ of the mouse male germline and supporting somatic cells.


2019 ◽  
Author(s):  
Chiaowen Joyce Hsiao ◽  
PoYuan Tung ◽  
John D. Blischak ◽  
Jonathan E. Burnett ◽  
Kenneth A. Barr ◽  
...  

AbstractCellular heterogeneity in gene expression is driven by cellular processes such as cell cycle and cell-type identity, and cellular environment such as spatial location. The cell cycle, in particular, is thought to be a key driver of cell-to-cell heterogeneity in gene expression, even in otherwise homogeneous cell populations. Recent advances in single-cell RNA-sequencing (scRNA-seq) facilitate detailed characterization of gene expression heterogeneity, and can thus shed new light on the processes driving heterogeneity. Here, we combined fluorescence imaging with scRNA-seq to measure cell cycle phase and gene expression levels in human induced pluripotent stem cells (iPSCs). Using these data, we developed a novel approach to characterize cell cycle progression. While standard methods assign cells to discrete cell cycle stages, our method goes beyond this, and quantifies cell cycle progression on a continuum. We found that, on average, scRNA-seq data from only five genes predicted a cell’s position on the cell cycle continuum to within 14% of the entire cycle, and that using more genes did not improve this accuracy. Our data and predictor of cell cycle phase can directly help future studies to account for cell-cycle-related heterogeneity in iPSCs. Our results and methods also provide a foundation for future work to characterize the effects of the cell cycle on expression heterogeneity in other cell types.


2016 ◽  
Author(s):  
Catalina A Vallejos ◽  
Sylvia Richardson ◽  
John C Marioni

Single-cell RNA sequencing (scRNA-seq) can be used to characterise differences in gene expression patterns between pre-specified populations of cells. Traditionally, differential expression tools are restricted to the study of changes in overall expression between cell populations. However, such analyses do not take full advantage of the rich information provided by scRNA-seq. In this article, we present a Bayesian hierarchical model which can be used to study changes in expression that lie beyond comparisons of means. In particular, our method can highlight genes that undergo changes in cell-to-cell heterogeneity between the populations but whose overall expression is preserved. Evidence supporting these changes is quantified using a probabilistic approach based on tail posterior probabilities, where a probability cut-off is calibrated through the expected false discovery rate. Our method incorporates a built-in normalisation strategy and quantifies technical artefacts by borrowing information from technical spike-in genes. Control experiments validate the performance of our approach. Finally, we compare expression patterns of mouse embryonic stem cells between different stages of the cell cycle, revealing substantial differences in cellular heterogeneity.


Sign in / Sign up

Export Citation Format

Share Document