scholarly journals Codependency and mutual exclusivity for gene community detection from sparse single-cell transcriptome data

2021 ◽  
Author(s):  
Natsu Nakajima ◽  
Tomoatsu Hayashi ◽  
Katsunori Fujiki ◽  
Katsuhiko Shirahige ◽  
Tetsu Akiyama ◽  
...  

Single-cell RNA-seq (scRNA-seq) can be used to characterize cellular heterogeneity in thousands of cells. The reconstruction of a gene network based on coexpression patterns is a fundamental task in scRNA-seq analyses, and the mutual exclusivity of gene expression can be critical for understanding such heterogeneity. Here, we propose an approach for detecting communities from a genetic network constructed on the basis of coexpression properties. The community-based comparison of multiple coexpression networks enables the identification of functionally related gene clusters that cannot be fully captured through differential gene expression-based analysis. We also developed a novel metric referred to as the exclusively expressed index (EEI) that identifies mutually exclusive gene pairs from sparse scRNA-seq data. EEI quantifies and ranks the exclusive expression levels of all gene pairs from binary expression patterns while maintaining robustness against a low sequencing depth. We applied our methods to glioblastoma scRNA-seq data and found that gene communities were partially conserved after serum stimulation despite a considerable number of differentially expressed genes. We also demonstrate that the identification of mutually exclusive gene sets with EEI can improve the sensitivity of capturing cellular heterogeneity. Our methods complement existing approaches and provide new biological insights, even for a large, sparse dataset, in the single-cell analysis field.

2021 ◽  
Vol 12 (1) ◽  
Author(s):  
Rongxin Fang ◽  
Sebastian Preissl ◽  
Yang Li ◽  
Xiaomeng Hou ◽  
Jacinta Lucero ◽  
...  

AbstractIdentification of the cis-regulatory elements controlling cell-type specific gene expression patterns is essential for understanding the origin of cellular diversity. Conventional assays to map regulatory elements via open chromatin analysis of primary tissues is hindered by sample heterogeneity. Single cell analysis of accessible chromatin (scATAC-seq) can overcome this limitation. However, the high-level noise of each single cell profile and the large volume of data pose unique computational challenges. Here, we introduce SnapATAC, a software package for analyzing scATAC-seq datasets. SnapATAC dissects cellular heterogeneity in an unbiased manner and map the trajectories of cellular states. Using the Nyström method, SnapATAC can process data from up to a million cells. Furthermore, SnapATAC incorporates existing tools into a comprehensive package for analyzing single cell ATAC-seq dataset. As demonstration of its utility, SnapATAC is applied to 55,592 single-nucleus ATAC-seq profiles from the mouse secondary motor cortex. The analysis reveals ~370,000 candidate regulatory elements in 31 distinct cell populations in this brain region and inferred candidate cell-type specific transcriptional regulators.


2016 ◽  
Author(s):  
Catalina A Vallejos ◽  
Sylvia Richardson ◽  
John C Marioni

Single-cell RNA sequencing (scRNA-seq) can be used to characterise differences in gene expression patterns between pre-specified populations of cells. Traditionally, differential expression tools are restricted to the study of changes in overall expression between cell populations. However, such analyses do not take full advantage of the rich information provided by scRNA-seq. In this article, we present a Bayesian hierarchical model which can be used to study changes in expression that lie beyond comparisons of means. In particular, our method can highlight genes that undergo changes in cell-to-cell heterogeneity between the populations but whose overall expression is preserved. Evidence supporting these changes is quantified using a probabilistic approach based on tail posterior probabilities, where a probability cut-off is calibrated through the expected false discovery rate. Our method incorporates a built-in normalisation strategy and quantifies technical artefacts by borrowing information from technical spike-in genes. Control experiments validate the performance of our approach. Finally, we compare expression patterns of mouse embryonic stem cells between different stages of the cell cycle, revealing substantial differences in cellular heterogeneity.


2021 ◽  
Author(s):  
Fang Ye ◽  
Guodong Zhang ◽  
Weigao E ◽  
Haide Chen ◽  
Chengxuan Yu ◽  
...  

Abstract The Mexican axolotl (Ambystoma mexicanum) is a promising tetrapod model for regeneration and developmental studies. Remarkably, neotenic axolotls may undergo metamorphosis, during which their regeneration capacity and lifespan gradually decline. However, a system-level single-cell analysis of molecular characteristics in neotenic and metamorphosed axolotls is still lacking. Here, we developed a single-cell RNA-seq method based on combinatorial hybridization to generate a tissue-based transcriptomic atlas of the adult axolotl. We performed gene expression profiling of over 1 million single cells across 19 tissues to construct the first adult axolotl cell atlas. Comparison of single-cell transcriptomes between the tissues of neotenic and metamorphosed axolotls revealed the heterogeneity of structural cells in different tissues and established their regulatory network. Furthermore, we described dynamic gene expression patterns during limb development in neotenic axolotls. These data serve as a resource to explore the molecular identity of the axolotl as well as its metamorphosis.


2021 ◽  
Author(s):  
Manuel Neumann ◽  
Xiaocai Xu ◽  
Cezary Smaczniak ◽  
Julia Schumacher ◽  
Wenhao Yan ◽  
...  

Identity and functions of plant cells are influenced by their precise cellular location within the plant body. Cellular heterogeneity in growth and differentiation trajectories results in organ patterning. Therefore, assessing this heterogeneity at molecular scale is a major question in developmental biology. Single-cell transcriptomics (scRNA-seq) allows to characterize and quantify gene expression heterogeneity in developing organs at unprecedented resolution. However, the original physical location of the cell is lost during the scRNA-seq procedure. To recover the original location of cells is essential to link gene activity with cellular function and morphology. Here, we reconstruct genome-wide gene expression patterns of individual cells in a floral meristem by combining single-nuclei RNA-seq with 3D spatial reconstruction. By this, gene expression differences among meristematic domains giving rise to different tissue and organ types can be determined. As a proof of principle, the data are used to trace the initiation of vascular identity within the floral meristem. Our work demonstrates the power of spatially reconstructed single cell transcriptome atlases to understand plant morphogenesis. The floral meristem 3D gene expression atlas can be accessed at http://threed-flower-meristem.herokuapp.com


2020 ◽  
Vol 176 (2) ◽  
pp. 396-409
Author(s):  
Kelly M Bakulski ◽  
John F Dou ◽  
Robert C Thompson ◽  
Christopher Lee ◽  
Lauren Y Middleton ◽  
...  

Abstract Lead (Pb) exposure is ubiquitous with permanent neurodevelopmental effects. The hippocampus brain region is involved in learning and memory with heterogeneous cellular composition. The hippocampus cell type-specific responses to Pb are unknown. The objective of this study is to examine perinatal Pb treatment effects on adult hippocampus gene expression, at the level of individual cells. In mice perinatally exposed to control water or a human physiologically relevant level (32 ppm in maternal drinking water) of Pb, 2 weeks prior to mating through weaning, we tested for hippocampus gene expression and cellular differences at 5 months of age. We sequenced RNA from 5258 hippocampal cells to (1) test for treatment gene expression differences averaged across all cells, (2) compare cell cluster composition by treatment, and (3) test for treatment gene expression and pathway differences within cell clusters. Gene expression patterns revealed 12 hippocampus cell clusters, mapping to major expected cell types (eg, microglia, astrocytes, neurons, and oligodendrocytes). Perinatal Pb treatment was associated with 12.4% more oligodendrocytes (p = 4.4 × 10−21) in adult mice. Across all cells, Pb treatment was associated with expression of cell cluster marker genes. Within cell clusters, Pb treatment (q < 0.05) caused differential gene expression in endothelial, microglial, pericyte, and astrocyte cells. Pb treatment upregulated protein folding pathways in microglia (p = 3.4 × 10−9) and stress response in oligodendrocytes (p = 3.2 × 10−5). Bulk tissue analysis may be influenced by changes in cell type composition, obscuring effects within vulnerable cell types. This study serves as a biological reference for future single-cell toxicant studies, to ultimately characterize molecular effects on cognition and behavior.


2021 ◽  
Author(s):  
Douglas Arneson ◽  
Guanglin Zhang ◽  
In Sook Ahn ◽  
Zhe Ying ◽  
Graciel Diamante ◽  
...  

Abstract The etiology of mild traumatic brain injury (mTBI) remains elusive due to the tissue and cellular heterogeneity of the affected brain regions that underlie cognitive impairments and subsequent neurological disorders. This complexity is further exacerbated by disrupted circuits within and between cell populations across brain regions and the periphery, which occur at different timescales and in spatial domains. We profiled three tissues (hippocampus, frontal cortex, and blood leukocytes) at the acute (24hr) and chronic (7days) phases of mTBI at single cell resolution and demonstrated that the coordinated gene expression patterns across cell types were disrupted and re-organized by TBI at different timescales with distinct regional and cellular patterns. Gene expression-based network modeling identified astrocytes as a key regulator of the cell-cell coordination following mTBI in both hippocampus and frontal cortex across timepoints, and mt-Rnr2, which encodes the mitochondrial peptide humanin, as a potential target for intervention based on its broad regional and dynamic dysregulation following mTBI. Treatment of a murine mTBI model with humanin reversed cognitive impairment caused by mTBI through the restoration of metabolic pathways within astrocytes. Our results offer a systems-level understanding of the dynamic and spatial regulation of gene programs by mTBI and pinpoint key target genes, pathways, and cell circuits that are amenable to therapeutics.


2021 ◽  
Author(s):  
Nisha R. Iyer ◽  
Junha Shin ◽  
Stephanie Cuskey ◽  
Yucheng Tian ◽  
Noah R. Nichol ◽  
...  

Our inability to derive the vast neuronal diversity of the posterior central nervous system (pCNS) using human pluripotent stem cells (hPSCs) poses a major impediment to understanding human neurodevelopment and disease in the hindbrain and spinal cord. Here we establish a modular differentiation paradigm that recapitulates patterning along both the rostrocaudal (R/C) and dorsoventral (D/V) axes of the pCNS, enabling derivation of any neuronal phenotype with discrete regional specificity. First, neuromesodermal progenitors (NMPs) with discrete Hox profiles are efficiently converted to pCNS progenitors (pCNSPs). Then by tuning D/V signaling, pCNSPs are directed to ventral Shh-dependent MNs (MNs) and locomotor interneurons (INs) or dorsal TGF-β-dependent proprioceptive INs and TGF-β-independent sensory INs. We applied D/V protocols to NMPs spanning the R/C axis for expansive single-cell RNA-sequencing (scRNAseq) analysis. By implementing a novel computational pipeline comprising sparse non-negative matrix factorization, consensus clustering, and combinatorial gene expression pattern identification, we detect hundreds of transcriptional markers within region-specific neuronal phenotypes, enabling discovery of gene expression patterns along the developmental axes. These findings highlight the potential of these resources to advance a mechanistic understanding of pCNS development, expand the potential and accuracy of in vitro models, and inform novel regenerative therapeutic strategies.


2021 ◽  
Vol 22 (S3) ◽  
Author(s):  
Yuanyuan Li ◽  
Ping Luo ◽  
Yi Lu ◽  
Fang-Xiang Wu

Abstract Background With the development of the technology of single-cell sequence, revealing homogeneity and heterogeneity between cells has become a new area of computational systems biology research. However, the clustering of cell types becomes more complex with the mutual penetration between different types of cells and the instability of gene expression. One way of overcoming this problem is to group similar, related single cells together by the means of various clustering analysis methods. Although some methods such as spectral clustering can do well in the identification of cell types, they only consider the similarities between cells and ignore the influence of dissimilarities on clustering results. This methodology may limit the performance of most of the conventional clustering algorithms for the identification of clusters, it needs to develop special methods for high-dimensional sparse categorical data. Results Inspired by the phenomenon that same type cells have similar gene expression patterns, but different types of cells evoke dissimilar gene expression patterns, we improve the existing spectral clustering method for clustering single-cell data that is based on both similarities and dissimilarities between cells. The method first measures the similarity/dissimilarity among cells, then constructs the incidence matrix by fusing similarity matrix with dissimilarity matrix, and, finally, uses the eigenvalues of the incidence matrix to perform dimensionality reduction and employs the K-means algorithm in the low dimensional space to achieve clustering. The proposed improved spectral clustering method is compared with the conventional spectral clustering method in recognizing cell types on several real single-cell RNA-seq datasets. Conclusions In summary, we show that adding intercellular dissimilarity can effectively improve accuracy and achieve robustness and that improved spectral clustering method outperforms the traditional spectral clustering method in grouping cells.


Sign in / Sign up

Export Citation Format

Share Document