scholarly journals Sierra: discovery of differential transcript usage from polyA-captured single-cell RNA-seq data

2019 ◽  
Author(s):  
Ralph Patrick ◽  
David T. Humphreys ◽  
Vaibhao Janbandhu ◽  
Alicia Oshlack ◽  
Joshua W.K. Ho ◽  
...  

AbstractHigh-throughput single-cell RNA-seq (scRNA-seq) is a powerful tool for studying gene expression in single cells. Most current scRNA-seq bioinformatics tools focus on analysing overall expression levels, largely ignoring alternative mRNA isoform expression. We present a computational pipeline, Sierra, that readily detects differential transcript usage from data generated by commonly used polyA-captured scRNA-seq technology. We validate Sierra by comparing cardiac scRNA-seq cell-types to bulk RNA-seq of matched populations, finding significant overlap in differential transcripts. Sierra detects differential transcript usage across human peripheral blood mononuclear cells and the Tabula Muris, and 3’UTR shortening in cardiac fibroblasts. Sierra is available at https://github.com/VCCRI/Sierra.

2021 ◽  
Author(s):  
Michael Hagemann-Jensen ◽  
Christoph Ziegenhain ◽  
Rickard Sandberg

Plate-based single-cell RNA-sequencing methods with full-transcript coverage typically excel at sensitivity but are more resource and time-consuming. Here, we miniaturized and streamlined the Smart-seq3 protocol for drastically reduced cost and increased throughput. Applying Smart-seq3xpress to 16,349 human peripheral blood mononuclear cells revealed a highly granular atlas complete with both common and rare cell types whose identification previously relied on additional protein measurements or the integration with a reference atlas.


2017 ◽  
Author(s):  
Navpreet Ranu ◽  
Alexandra-Chloé Villani ◽  
Nir Hacohen ◽  
Paul C. Blainey

There is rising interest in applying single-cell transcriptome analysis and other single-cell sequencing methods to resolve differences between cells. Pooled processing of thousands of single cells is now routinely practiced by introducing cell-specific DNA barcodes early in cell processing protocols1-5. However, researchers must sequence a large number of cells to sample rare subpopulations6-8, even when fluorescence-activated cell sorting (FACS) is used to pre-enrich rare cell populations. Here, a new molecular enrichment method is used in conjunction with FACS enrichment to enable efficient sampling of rare dendritic cell (DC) populations, including the recently identified AXL+SIGLEC6+ (AS DCs) subset7, within a 10X Genomics single-cell RNA-Seq library. DC populations collectively represent 1-2% of total peripheral blood mononuclear cells (PBMC), with AS DC representing only 1-3% of human blood DCs and 0.01-0.06% of total PBMCs.


2019 ◽  
Author(s):  
Florian Wagner

AbstractClustering of cells by cell type is arguably the most common and repetitive task encountered during the analysis of single-cell RNA-Seq data. However, as popular clustering methods operate largely independently of visualization techniques, the fine-tuning of clustering parameters can be unintuitive and time-consuming. Here, I propose Galapagos, a simple and effective clustering workflow based on t-SNE and DBSCAN that does not require a gene selection step. In practice, Galapagos only involves the fine-tuning of two parameters, which is straightforward, as clustering is performed directly on the t-SNE visualization results. Using peripheral blood mononuclear cells as a model tissue, I validate the effectiveness of Galapagos in different ways. First, I show that Galapagos generates clusters corresponding to all main cell types present. Then, I demonstrate that the t-SNE results are robust to parameter choices and initialization points. Next, I employ a simulation approach to show that clustering with Galapagos is accurate and robust to the high levels of technical noise present. Finally, to demonstrate Galapagos’ accuracy on real data, I compare clustering results to true cell type identities established using CITE-Seq data. In this context, I also provide an example of the primary limitation of Galapagos, namely the difficulty to resolve related cell types in cases where t-SNE fails to clearly separate the cells. Galapagos helps to make clustering scRNA-Seq data more intuitive and reproducible, and can be implemented in most programming languages with only a few lines of code.


2021 ◽  
Vol 12 (1) ◽  
Author(s):  
Karol Nienałtowski ◽  
Rachel E. Rigby ◽  
Jarosław Walczak ◽  
Karolina E. Zakrzewska ◽  
Edyta Głów ◽  
...  

AbstractAlthough we can now measure single-cell signaling responses with multivariate, high-throughput techniques our ability to interpret such measurements is still limited. Even interpretation of dose–response based on single-cell data is not straightforward: signaling responses can differ significantly between cells, encompass multiple signaling effectors, and have dynamic character. Here, we use probabilistic modeling and information-theory to introduce fractional response analysis (FRA), which quantifies changes in fractions of cells with given response levels. FRA can be universally performed for heterogeneous, multivariate, and dynamic measurements and, as we demonstrate, quantifies otherwise hidden patterns in single-cell data. In particular, we show that fractional responses to type I interferon in human peripheral blood mononuclear cells are very similar across different cell types, despite significant differences in mean or median responses and degrees of cell-to-cell heterogeneity. Further, we demonstrate that fractional responses to cytokines scale linearly with the log of the cytokine dose, which uncovers that heterogeneous cellular populations are sensitive to fold-changes in the dose, as opposed to additive changes.


2020 ◽  
Author(s):  
Karol Nienałtowski ◽  
Rachel E. Rigby ◽  
Jarosław Walczak ◽  
Karolina E. Zakrzewska ◽  
Jan Rehwinkel ◽  
...  

ABSTRACTAlthough we can now measure single-cell signaling responses with multivariate, high-throughput techniques our ability to interpret such measurements is still limited. Even interpretation of dose-response based on single-cell data is not straightforward: signaling responses can differ significantly between cells, encompass multiple signaling effectors, and have dynamic character. Here, we use probabilistic modeling and information-theory to introduce fractional response analysis (FRA), which quantifies changes in fractions of cells with given response levels. FRA can be universally performed for heterogeneous, multivariate, and dynamic measurements and, as we demonstrate, uncovers otherwise hidden patterns in single-cell data. In particular, we show that fractional responses to type I interferon in human peripheral blood mononuclear cells are very similar across different cell types, despite significant differences in mean or median responses and degrees of cell-to-cell heterogeneity. Further, we demonstrate that fractional responses to cytokines scale linearly with the log of the cytokine dose, which uncovers that cellular populations are sensitive to fold-changes in the dose, as opposed to additive changes.


2022 ◽  
Vol 13 (1) ◽  
Author(s):  
Georgios Theocharidis ◽  
Beena E. Thomas ◽  
Debasree Sarkar ◽  
Hope L. Mumme ◽  
William J. R. Pilcher ◽  
...  

AbstractDiabetic foot ulceration (DFU) is a devastating complication of diabetes whose pathogenesis remains incompletely understood. Here, we profile 174,962 single cells from the foot, forearm, and peripheral blood mononuclear cells using single-cell RNA sequencing. Our analysis shows enrichment of a unique population of fibroblasts overexpressing MMP1, MMP3, MMP11, HIF1A, CHI3L1, and TNFAIP6 and increased M1 macrophage polarization in the DFU patients with healing wounds. Further, analysis of spatially separated samples from the same patient and spatial transcriptomics reveal preferential localization of these healing associated fibroblasts toward the wound bed as compared to the wound edge or unwounded skin. Spatial transcriptomics also validates our findings of higher abundance of M1 macrophages in healers and M2 macrophages in non-healers. Our analysis provides deep insights into the wound healing microenvironment, identifying cell types that could be critical in promoting DFU healing, and may inform novel therapeutic approaches for DFU treatment.


2021 ◽  
Vol 17 (11) ◽  
pp. e1009548
Author(s):  
Qunlun Shen ◽  
Shihua Zhang

With the rapid accumulation of biological omics datasets, decoding the underlying relationships of cross-dataset genes becomes an important issue. Previous studies have attempted to identify differentially expressed genes across datasets. However, it is hard for them to detect interrelated ones. Moreover, existing correlation-based algorithms can only measure the relationship between genes within a single dataset or two multi-modal datasets from the same samples. It is still unclear how to quantify the strength of association of the same gene across two biological datasets with different samples. To this end, we propose Approximate Distance Correlation (ADC) to select interrelated genes with statistical significance across two different biological datasets. ADC first obtains the k most correlated genes for each target gene as its approximate observations, and then calculates the distance correlation (DC) for the target gene across two datasets. ADC repeats this process for all genes and then performs the Benjamini-Hochberg adjustment to control the false discovery rate. We demonstrate the effectiveness of ADC with simulation data and four real applications to select highly interrelated genes across two datasets. These four applications including 21 cancer RNA-seq datasets of different tissues; six single-cell RNA-seq (scRNA-seq) datasets of mouse hematopoietic cells across six different cell types along the hematopoietic cell lineage; five scRNA-seq datasets of pancreatic islet cells across five different technologies; coupled single-cell ATAC-seq (scATAC-seq) and scRNA-seq data of peripheral blood mononuclear cells (PBMC). Extensive results demonstrate that ADC is a powerful tool to uncover interrelated genes with strong biological implications and is scalable to large-scale datasets. Moreover, the number of such genes can serve as a metric to measure the similarity between two datasets, which could characterize the relative difference of diverse cell types and technologies.


2021 ◽  
Author(s):  
Zhibin Li ◽  
chengcheng Sun ◽  
Fei Wang ◽  
Xiran Wang ◽  
Jiacheng Zhu ◽  
...  

Background: Immune cells play important roles in mediating immune response and host defense against invading pathogens. However, insights into the molecular mechanisms governing circulating immune cell diversity among multiple species are limited. Methods: In this study, we compared the single-cell transcriptomes of 77 957 immune cells from 12 species using single-cell RNA-sequencing (scRNA-seq). Distinct molecular profiles were characterized for different immune cell types, including T cells, B cells, natural killer cells, monocytes, and dendritic cells. Results: The results revealed the heterogeneity and compositions of circulating immune cells among 12 different species. Additionally, we explored the conserved and divergent cellular cross-talks and genetic regulatory networks among vertebrate immune cells. Notably, the ligand and receptor pair VIM-CD44 was highly conserved among the immune cells. Conclusions: This study is the first to provide a comprehensive analysis of the cross-species single-cell atlas for peripheral blood mononuclear cells (PBMCs). This research should advance our understanding of the cellular taxonomy and fundamental functions of PBMCs, with important implications in evolutionary biology, developmental biology, and immune system disorders


eLife ◽  
2021 ◽  
Vol 10 ◽  
Author(s):  
Elliott Swanson ◽  
Cara Lord ◽  
Julian Reading ◽  
Alexander T Heubeck ◽  
Palak C Genge ◽  
...  

Single-cell measurements of cellular characteristics have been instrumental in understanding the heterogeneous pathways that drive differentiation, cellular responses to signals, and human disease. Recent advances have allowed paired capture of protein abundance and transcriptomic state, but a lack of epigenetic information in these assays has left a missing link to gene regulation. Using the heterogeneous mixture of cells in human peripheral blood as a test case, we developed a novel scATAC-seq workflow that increases signal-to-noise and allows paired measurement of cell surface markers and chromatin accessibility: integrated cellular indexing of chromatin landscape and epitopes, called ICICLE-seq. We extended this approach using a droplet-based multiomics platform to develop a trimodal assay that simultaneously measures transcriptomics (scRNA-seq), epitopes, and chromatin accessibility (scATAC-seq) from thousands of single cells, which we term TEA-seq. Together, these multimodal single-cell assays provide a novel toolkit to identify type-specific gene regulation and expression grounded in phenotypically defined cell types.


Sign in / Sign up

Export Citation Format

Share Document