Targeted Single-Cell RNA-seq Identifies Minority Cell Types of Kidney Distal Nephron

BackgroundProximal tubule cells dominate the kidney parenchyma numerically, although less abundant cell types of the distal nephron have disproportionate roles in water and electrolyte balance.MethodsCoupling of a FACS-based enrichment protocol with single-cell RNA-seq profiled the transcriptomes of 9099 cells from the thick ascending limb (CTAL)/distal convoluted tubule (DCT) region of the mouse nephron.ResultsUnsupervised clustering revealed Slc12a3+/Pvalb+ and Slc12a3+/Pvalb− cells, identified as DCT1 and DCT2 cells, respectively. DCT1 cells appear to be heterogeneous, with orthogonally variable expression of Slc8a1, Calb1, and Ckb. An additional DCT1 subcluster showed marked enrichment of cell cycle–/cell proliferation–associated mRNAs (e.g., Mki67, Stmn1, and Top2a), which fit with the known plasticity of DCT cells. No DCT2-specific transcripts were found. DCT2 cells contrast with DCT1 cells by expression of epithelial sodium channel β- and γ-subunits and much stronger expression of transcripts associated with calcium transport (Trpv5, Calb1, S100g, and Slc8a1). Additionally, scRNA-seq identified three distinct CTAL (Slc12a1+) cell subtypes. One of these expressed Nos1 and Avpr1a, consistent with macula densa cells. The other two CTAL clusters were distinguished by Cldn10 and Ptger3 in one and Cldn16 and Foxq1 in the other. These two CTAL cell types were also distinguished by expression of alternative Iroquois homeobox transcription factors, with Irx1 and Irx2 in the Cldn10+ CTAL cells and Irx3 in the Cldn16+ CTAL cells.ConclusionsSingle-cell transcriptomics revealed unexpected diversity among the cells of the distal nephron in mouse. Web-based data resources are provided for the single-cell data.

Download Full-text

ASAP: A web-based platform for the analysis and interactive visualization of single-cell RNA-seq data

10.1101/096222 ◽

2016 ◽

Cited By ~ 5

Author(s):

Vincent Gardeux ◽

Fabrice David ◽

Adrian Shajkofci ◽

Petra C Schwalie ◽

Bart Deplancke

Keyword(s):

Single Cell ◽

Single Cell Analysis ◽

Transcriptome Profiling ◽

Cell Types ◽

Complete Analysis ◽

Marker Genes ◽

Specific Marker ◽

Rna Seq ◽

Web Based ◽

Wide Range

AbstractMotivationSingle-cell RNA-sequencing (scRNA-seq) allows whole transcriptome profiling of thousands of individual cells, enabling the molecular exploration of tissues at the cellular level. Such analytical capacity is of great interest to many research groups in the world, yet, these groups often lack the expertise to handle complex scRNA-seq data sets.ResultsWe developed a fully integrated, web-based platform aimed at the complete analysis of scRNA-seq data post genome alignment: from the parsing, filtering, and normalization of the input count data files, to the visual representation of the data, identification of cell clusters, differentially expressed genes (including cluster-specific marker genes), and functional gene set enrichment. This Automated Single-cell Analysis Pipeline (ASAP) combines a wide range of commonly used algorithms with sophisticated visualization tools. Compared with existing scRNA-seq analysis platforms, researchers (including those lacking computational expertise) are able to interact with the data in a straightforward fashion and in real time. Furthermore, given the overlap between scRNA-seq and bulk RNA-seq analysis workflows, ASAP should conceptually be broadly applicable to any RNA-seq dataset. As a validation, we demonstrate how we can use ASAP to simply reproduce the results from a single-cell study of 91 mouse cells involving five distinct cell types.AvailabilityThe tool is freely available at http://[email protected]

Download Full-text

Targeted single-cell RNA-seq identifies minority cell types of kidney distal nephron that regulate blood pressure and calcium balance

10.1101/2020.07.19.209627 ◽

2020 ◽

Author(s):

Lihe Chen ◽

Chun-Lin Chou ◽

Mark A. Knepper

Keyword(s):

Single Cell ◽

Collecting Duct ◽

Cell Types ◽

Macula Densa ◽

Calcium Balance ◽

Rna Seq ◽

Modern Biology ◽

Data Resource ◽

Variable Expression ◽

Cell Diversity

ABSTRACTA major objective in modern biology is generation of comprehensive atlases of various organs identifying all cell types and their expressed genes. In kidney, extensive data exists for proximal tubule and collecting duct cells, but not for non-abundant intermediate epithelial cell types. Here, we coupled a FACS-enrichment protocol with single-cell RNA-seq analysis to profile the transcriptomes of 9099 cells from the nephron region adjacent to the macula densa. Clusters containing Slc12a3+/Pvalb+ and Slc12a3+/Pvalb- cells were identified as DCT1 and DCT2 cells. The DCT1 cells appear to be heterogeneously associated with variable expression of Slc8a1, Calb1, and Ckb among other mRNAs. No DCT2-specific transcripts were found. The analysis also identified two distinct cell types in the Slc12a1+ portion of Henle’s loop as well as Nos1+/Avpr1a+ macula densa cells. Thus, we identify unexpected cell diversity in the intermediate region of the nephron and create a web-based data resource for these cells.

Download Full-text

treeclimbR pinpoints the data-dependent resolution of hierarchical hypotheses

Genome Biology ◽

10.1186/s13059-021-02368-1 ◽

2021 ◽

Vol 22 (1) ◽

Author(s):

Ruizhu Huang ◽

Charlotte Soneson ◽

Pierre-Luc Germain ◽

Thomas S.B. Schmidt ◽

Christian Von Mering ◽

...

Keyword(s):

Single Cell ◽

Synthetic Data ◽

Cell Types ◽

Data Driven ◽

Rna Seq ◽

Hierarchical Trees

AbstracttreeclimbR is for analyzing hierarchical trees of entities, such as phylogenies or cell types, at different resolutions. It proposes multiple candidates that capture the latent signal and pinpoints branches or leaves that contain features of interest, in a data-driven way. It outperforms currently available methods on synthetic data, and we highlight the approach on various applications, including microbiome and microRNA surveys as well as single-cell cytometry and RNA-seq datasets. With the emergence of various multi-resolution genomic datasets, treeclimbR provides a thorough inspection on entities across resolutions and gives additional flexibility to uncover biological associations.

Download Full-text

Annotating cell types in human single-cell RNA-seq data with CellO

STAR Protocols ◽

10.1016/j.xpro.2021.100705 ◽

2021 ◽

Vol 2 (3) ◽

pp. 100705

Author(s):

Matthew N. Bernstein ◽

Colin N. Dewey

Keyword(s):

Single Cell ◽

Cell Types ◽

Rna Seq

Download Full-text

Systematic comparison of high-throughput single-cell RNA-seq methods for immune cell profiling

BMC Genomics ◽

10.1186/s12864-020-07358-4 ◽

2021 ◽

Vol 22 (1) ◽

Author(s):

Tracy M. Yamawaki ◽

Daniel R. Lu ◽

Daniel C. Ellwanger ◽

Dev Bhatt ◽

Paolo Manzanillo ◽

...

Keyword(s):

Single Cell ◽

High Throughput ◽

Immune Cell ◽

Cell Types ◽

Data Interpretation ◽

Detection Sensitivity ◽

Rna Seq ◽

Cell Recovery

Abstract Background Elucidation of immune populations with single-cell RNA-seq has greatly benefited the field of immunology by deepening the characterization of immune heterogeneity and leading to the discovery of new subtypes. However, single-cell methods inherently suffer from limitations in the recovery of complete transcriptomes due to the prevalence of cellular and transcriptional dropout events. This issue is often compounded by limited sample availability and limited prior knowledge of heterogeneity, which can confound data interpretation. Results Here, we systematically benchmarked seven high-throughput single-cell RNA-seq methods. We prepared 21 libraries under identical conditions of a defined mixture of two human and two murine lymphocyte cell lines, simulating heterogeneity across immune-cell types and cell sizes. We evaluated methods by their cell recovery rate, library efficiency, sensitivity, and ability to recover expression signatures for each cell type. We observed higher mRNA detection sensitivity with the 10x Genomics 5′ v1 and 3′ v3 methods. We demonstrate that these methods have fewer dropout events, which facilitates the identification of differentially-expressed genes and improves the concordance of single-cell profiles to immune bulk RNA-seq signatures. Conclusion Overall, our characterization of immune cell mixtures provides useful metrics, which can guide selection of a high-throughput single-cell RNA-seq method for profiling more complex immune-cell heterogeneity usually found in vivo.

Download Full-text

SC1: A web-based single cell RNA-seq analysis pipeline

2018 IEEE 8th International Conference on Computational Advances in Bio and Medical Sciences (ICCABS) ◽

10.1109/iccabs.2018.8542088 ◽

2018 ◽

Cited By ~ 2

Author(s):

Marmar Moussa ◽

Ion I. Mandoiu

Keyword(s):

Single Cell ◽

Rna Seq ◽

Analysis Pipeline ◽

Web Based

Download Full-text

JIND: Joint Integration and Discrimination for Automated Single-Cell Annotation

10.1101/2020.10.06.327601 ◽

2020 ◽

Author(s):

Mohit Goyal ◽

Guillermo Serrano ◽

Ilan Shomorony ◽

Mikel Hernaez ◽

Idoia Ochoa

Keyword(s):

Single Cell ◽

Cell Types ◽

Marker Genes ◽

Specific Marker ◽

Rna Seq ◽

Batch Effects ◽

Cell Type ◽

Latent Space ◽

Cell Type Specific ◽

Low Dimensional

AbstractSingle-cell RNA-seq is a powerful tool in the study of the cellular composition of different tissues and organisms. A key step in the analysis pipeline is the annotation of cell-types based on the expression of specific marker genes. Since manual annotation is labor-intensive and does not scale to large datasets, several methods for automated cell-type annotation have been proposed based on supervised learning. However, these methods generally require feature extraction and batch alignment prior to classification, and their performance may become unreliable in the presence of cell-types with very similar transcriptomic profiles, such as differentiating cells. We propose JIND, a framework for automated cell-type identification based on neural networks that directly learns a low-dimensional representation (latent code) in which cell-types can be reliably determined. To account for batch effects, JIND performs a novel asymmetric alignment in which the transcriptomic profile of unseen cells is mapped onto the previously learned latent space, hence avoiding the need of retraining the model whenever a new dataset becomes available. JIND also learns cell-type-specific confidence thresholds to identify and reject cells that cannot be reliably classified. We show on datasets with and without batch effects that JIND classifies cells more accurately than previously proposed methods while rejecting only a small proportion of cells. Moreover, JIND batch alignment is parallelizable, being more than five or six times faster than Seurat integration. Availability: https://github.com/mohit1997/JIND.

Download Full-text

Enhancing droplet-based single-nucleus RNA-seq resolution using the semi-supervised machine learning classifier DIEM

10.1101/786285 ◽

2019 ◽

Cited By ~ 4

Author(s):

Marcus Alvarez ◽

Elior Rahmani ◽

Brandon Jew ◽

Kristina M. Garske ◽

Zong Miao ◽

...

Keyword(s):

Gene Expression ◽

Single Cell ◽

Cell Types ◽

Supervised Machine Learning ◽

Data Sets ◽

Rna Seq ◽

Novel Approach ◽

Single Nucleus ◽

Downstream Analysis

AbstractSingle-nucleus RNA sequencing (snRNA-seq) measures gene expression in individual nuclei instead of cells, allowing for unbiased cell type characterization in solid tissues. Contrary to single-cell RNA seq (scRNA-seq), we observe that snRNA-seq is commonly subject to contamination by high amounts of extranuclear background RNA, which can lead to identification of spurious cell types in downstream clustering analyses if overlooked. We present a novel approach to remove debris-contaminated droplets in snRNA-seq experiments, called Debris Identification using Expectation Maximization (DIEM). Our likelihood-based approach models the gene expression distribution of debris and cell types, which are estimated using EM. We evaluated DIEM using three snRNA-seq data sets: 1) human differentiating preadipocytes in vitro, 2) fresh mouse brain tissue, and 3) human frozen adipose tissue (AT) from six individuals. All three data sets showed various degrees of extranuclear RNA contamination. We observed that existing methods fail to account for contaminated droplets and led to spurious cell types. When compared to filtering using these state of the art methods, DIEM better removed droplets containing high levels of extranuclear RNA and led to higher quality clusters. Although DIEM was designed for snRNA-seq data, we also successfully applied DIEM to single-cell data. To conclude, our novel method DIEM removes debris-contaminated droplets from single-cell-based data fast and effectively, leading to cleaner downstream analysis. Our code is freely available for use at https://github.com/marcalva/diem.

Download Full-text

CellO: Comprehensive and hierarchical cell type classification of human cells with the Cell Ontology

10.1101/634097 ◽

2019 ◽

Cited By ~ 1

Author(s):

Matthew N. Bernstein ◽

Zhongjie Ma ◽

Michael Gleicher ◽

Colin N. Dewey

Keyword(s):

Single Cell ◽

Web Application ◽

Cell Types ◽

Rna Seq ◽

Cell Type ◽

Training Set ◽

Sequence Read Archive ◽

Cell Ontology ◽

Cell Type Specific ◽

Type Classification

SummaryCell type annotation is a fundamental task in the analysis of single-cell RNA-sequencing data. In this work, we present CellO, a machine learning-based tool for annotating human RNA-seq data with the Cell Ontology. CellO enables accurate and standardized cell type classification by considering the rich hierarchical structure of known cell types, a source of prior knowledge that is not utilized by existing methods. Furthemore, CellO comes pre-trained on a novel, comprehensive dataset of human, healthy, untreated primary samples in the Sequence Read Archive, which to the best of our knowledge, is the most diverse curated collection of primary cell data to date. CellO’s comprehensive training set enables it to run out-of-the-box on diverse cell types and achieves superior or competitive performance when compared to existing state-of-the-art methods. Lastly, CellO’s linear models are easily interpreted, thereby enabling exploration of cell type-specific expression signatures across the ontology. To this end, we also present the CellO Viewer: a web application for exploring CellO’s models across the ontology.HighlightWe present CellO, a tool for hierarchically classifying cell type from single-cell RNA-seq data against the graph-structured Cell OntologyCellO is pre-trained on a comprehensive dataset comprising nearly all bulk RNA-seq primary cell samples in the Sequence Read ArchiveCellO achieves superior or comparable performance with existing methods while featuring a more comprehensive pre-packaged training setCellO is built with easily interpretable models which we expose through a novel web application, the CellO Viewer, for exploring cell type-specific signatures across the Cell OntologyGraphical Abstract

Download Full-text

CellMap: Characterizing the types and composition of iPSC-derived cells from RNA-seq data

10.1101/2021.05.24.445360 ◽

2021 ◽

Author(s):

Zhengyu Ouyang ◽

Nathanael Bourgeois ◽

Eugenia Lyashenko ◽

Paige Cundiff ◽

Patrick F Cullen ◽

...

Keyword(s):

Single Cell ◽

Induced Pluripotent Stem Cell ◽

Cell Types ◽

Model Systems ◽

Rna Seq ◽

Cell Type ◽

Fine Grained ◽

Single Nucleus ◽

Induced Pluripotent

Induced pluripotent stem cell (iPSC) derived cell types are increasingly employed as in vitro model systems for drug discovery. For these studies to be meaningful, it is important to understand the reproducibility of the iPSC-derived cultures and their similarity to equivalent endogenous cell types. Single-cell and single-nucleus RNA sequencing (RNA-seq) are useful to gain such understanding, but they are expensive and time consuming, while bulk RNA-seq data can be generated quicker and at lower cost. In silico cell type decomposition is an efficient, inexpensive, and convenient alternative that can leverage bulk RNA-seq to derive more fine-grained information about these cultures. We developed CellMap, a computational tool that derives cell type profiles from publicly available single-cell and single-nucleus datasets to infer cell types in bulk RNA-seq data from iPSC-derived cell lines.

Download Full-text