scholarly journals clustifyr: an R package for automated single-cell RNA sequencing cluster classification

F1000Research ◽  
2020 ◽  
Vol 9 ◽  
pp. 223 ◽  
Author(s):  
Rui Fu ◽  
Austin E. Gillen ◽  
Ryan M. Sheridan ◽  
Chengzhe Tian ◽  
Michelle Daya ◽  
...  

Assignment of cell types from single-cell RNA sequencing (scRNA-seq) data remains a time-consuming and error-prone process. Current packages for identity assignment use limited types of reference data and often have rigid data structure requirements. We developed the clustifyr R package to leverage several external data types, including gene expression profiles to assign likely cell types using data from scRNA-seq, bulk RNA-seq, microarray expression data, or signature gene lists. We benchmark various parameters of a correlation-based approach and implement gene list enrichment methods. clustifyr is a lightweight and effective cell-type assignment tool developed for compatibility with various scRNA-seq analysis workflows. clustifyr is publicly available at https://github.com/rnabioco/clustifyr

F1000Research ◽  
2020 ◽  
Vol 9 ◽  
pp. 223
Author(s):  
Rui Fu ◽  
Austin E. Gillen ◽  
Ryan M. Sheridan ◽  
Chengzhe Tian ◽  
Michelle Daya ◽  
...  

Assignment of cell types from single-cell RNA sequencing (scRNA-seq) data remains a time-consuming and error-prone process. Current packages for identity assignment use limited types of reference data and often have rigid data structure requirements. We developed the clustifyr R package to leverage several external data types, including gene expression profiles to assign likely cell types using data from scRNA-seq, bulk RNA-seq, microarray expression data, or signature gene lists. We benchmark various parameters of a correlation-based approach and implement gene list enrichment methods. clustifyr is a lightweight and effective cell-type assignment tool developed for compatibility with various scRNA-seq analysis workflows. clustifyr is publicly available at https://github.com/rnabioco/clustifyr


2019 ◽  
Author(s):  
Rui Fu ◽  
Austin E. Gillen ◽  
Ryan M. Sheridan ◽  
Chengzhe Tian ◽  
Michelle Daya ◽  
...  

ABSTRACTBackgroundIn single-cell RNA sequencing (scRNA-seq) analysis, assignment of likely cell types remains a time-consuming, error-prone, and biased process. Current packages for identity assignment use limited types of reference data, and often have rigid data structure requirements. As such, a more flexible tool, capable of handling multiple types of reference data and data structures, would be beneficial.FindingsTo address difficulties in cluster identity assignment, we developed the clustifyr R package. The package leverages external datasets, including gene expression profiles from scRNA-seq, bulk RNA-seq, microarray expression data, and/or signature gene lists, to assign likely cell types. We benchmark various parameters of a correlation-based approach, and also implement a variety of gene list enrichment methods. By providing tools for exploratory data analysis, we demonstrate the feasibility of a simple and effective data-driven approach for cell type assignment in scRNA-seq cell clusters.Conclusionsclustifyr is a lightweight and effective cell type assignment tool developed for compatibility with various scRNA-seq analysis workflows. clustifyr is publicly available at https://github.com/rnabioco/clustifyr


2020 ◽  
Author(s):  
Xiangru Shen ◽  
Xuefei Wang ◽  
Shan Chen ◽  
Hongyi Liu ◽  
Ni Hong ◽  
...  

Abstract Single cell RNA sequencing (scRNA-seq) clusters cells using genome-wide gene expression profiles. The relationship between scRNA-seq Clustered-Populations (scCPops) and cell surface marker-defined classic T cell subsets is unclear. Here, we interrogated 6 bead-enriched T cell subsets with 62,235 single cell transcriptomes and re-grouped them into 9 scCPops. Bead-enriched CD4 Naïve, CD8 Naïve and CD4 memory were mainly clustered into their scCPop counterparts, while the other T cell subsets were clustered into multiple scCPops including unexpected mucosal-associated invariant T cell and natural killer T cell. Most interestingly, we discovered a new T cell type that highly expressed Interferon Signaling Associated Genes (ISAGs), namely IFNhi T. We further enriched IFNhi T for scRNA-seq analyses. IFNhi T cluster disappeared on tSNE after removing ISAGs, and IFNhi T cluster showed up by tSNE analyses of ISAGs alone, indicating ISAGs are the major contributor of IFNhi T cluster. BST2+ cells and BST2- cells showing different efficiencies of T cell activation indicates high ISAGs may contribute to quick immune responses.


2021 ◽  
Author(s):  
Li Han ◽  
Carlos P Jara ◽  
Ou Wang ◽  
Sandra Thibivilliers ◽  
Rafał K. Wóycicki ◽  
...  

AbstractThe Pigskin architecture and physiology are similar to these of humans. Thus, the pig model is valuable for studying skin biology and testing therapeutics for skin diseases. The single-cell RNA sequencing technology allows quantitatively analyzing cell types, cell states, signaling, and receptor-ligand interactome at single-cell resolution and at high throughput. scRNA-Seq has been used to study mouse and human skins. However, studying pigskin with scRNA-Seq is still rare. Here we described a robust method for isolating and cryo-preserving pig single cells for scRNA-Seq. We showed that pigskin could be efficiently dissociated into single cells with high cell viability using the Miltenyi Human Whole Skin Dissociation kit and the Miltenyi gentleMACS Dissociator. Also, we showed that the subsequent single cells could be cryopreserved using DMSO without causing additional cell death, cell aggregation, or changes in gene expression profiles. Using the developed protocol, we were able to identify all the major skin cell types. The protocol and results from this study will be very valuable for the skin research scientific community.


Cephalalgia ◽  
2018 ◽  
Vol 38 (13) ◽  
pp. 1976-1983 ◽  
Author(s):  
William Renthal

Background Migraine is a debilitating disorder characterized by severe headaches and associated neurological symptoms. A key challenge to understanding migraine has been the cellular complexity of the human brain and the multiple cell types implicated in its pathophysiology. The present study leverages recent advances in single-cell transcriptomics to localize the specific human brain cell types in which putative migraine susceptibility genes are expressed. Methods The cell-type specific expression of both familial and common migraine-associated genes was determined bioinformatically using data from 2,039 individual human brain cells across two published single-cell RNA sequencing datasets. Enrichment of migraine-associated genes was determined for each brain cell type. Results Analysis of single-brain cell RNA sequencing data from five major subtypes of cells in the human cortex (neurons, oligodendrocytes, astrocytes, microglia, and endothelial cells) indicates that over 40% of known migraine-associated genes are enriched in the expression profiles of a specific brain cell type. Further analysis of neuronal migraine-associated genes demonstrated that approximately 70% were significantly enriched in inhibitory neurons and 30% in excitatory neurons. Conclusions This study takes the next step in understanding the human brain cell types in which putative migraine susceptibility genes are expressed. Both familial and common migraine may arise from dysfunction of discrete cell types within the neurovascular unit, and localization of the affected cell type(s) in an individual patient may provide insight into to their susceptibility to migraine.


Science ◽  
2020 ◽  
Vol 371 (6531) ◽  
pp. eaba5257 ◽  
Author(s):  
Anna Kuchina ◽  
Leandra M. Brettner ◽  
Luana Paleologu ◽  
Charles M. Roco ◽  
Alexander B. Rosenberg ◽  
...  

Single-cell RNA sequencing (scRNA-seq) has become an essential tool for characterizing gene expression in eukaryotes, but current methods are incompatible with bacteria. Here, we introduce microSPLiT (microbial split-pool ligation transcriptomics), a high-throughput scRNA-seq method for Gram-negative and Gram-positive bacteria that can resolve heterogeneous transcriptional states. We applied microSPLiT to >25,000 Bacillus subtilis cells sampled at different growth stages, creating an atlas of changes in metabolism and lifestyle. We retrieved detailed gene expression profiles associated with known, but rare, states such as competence and prophage induction and also identified unexpected gene expression states, including the heterogeneous activation of a niche metabolic pathway in a subpopulation of cells. MicroSPLiT paves the way to high-throughput analysis of gene expression in bacterial communities that are otherwise not amenable to single-cell analysis, such as natural microbiota.


GigaScience ◽  
2020 ◽  
Vol 9 (10) ◽  
Author(s):  
Francesca Pia Caruso ◽  
Luciano Garofano ◽  
Fulvio D'Angelo ◽  
Kai Yu ◽  
Fuchou Tang ◽  
...  

ABSTRACT Background Single-cell RNA sequencing is the reference technique for characterizing the heterogeneity of the tumor microenvironment. The composition of the various cell types making up the microenvironment can significantly affect the way in which the immune system activates cancer rejection mechanisms. Understanding the cross-talk signals between immune cells and cancer cells is of fundamental importance for the identification of immuno-oncology therapeutic targets. Results We present a novel method, single-cell Tumor–Host Interaction tool (scTHI), to identify significantly activated ligand–receptor interactions across clusters of cells from single-cell RNA sequencing data. We apply our approach to uncover the ligand–receptor interactions in glioma using 6 publicly available human glioma datasets encompassing 57,060 gene expression profiles from 71 patients. By leveraging this large-scale collection we show that unexpected cross-talk partners are highly conserved across different datasets in the majority of the tumor samples. This suggests that shared cross-talk mechanisms exist in glioma. Conclusions Our results provide a complete map of the active tumor–host interaction pairs in glioma that can be therapeutically exploited to reduce the immunosuppressive action of the microenvironment in brain tumor.


2020 ◽  
Author(s):  
Weimiao Wu ◽  
Qile Dai ◽  
Yunqing Liu ◽  
Xiting Yan ◽  
Zuoheng Wang

AbstractSingle-cell RNA sequencing provides an opportunity to study gene expression at single-cell resolution. However, prevalent dropout events result in high data sparsity and noise that may obscure downstream analyses. We propose a novel method, G2S3, that imputes dropouts by borrowing information from adjacent genes in a sparse gene graph learned from gene expression profiles across cells. We applied G2S3 and other existing methods to seven single-cell datasets to compare their performance. Our results demonstrated that G2S3 is superior in recovering true expression levels, identifying cell subtypes, improving differential expression analyses, and recovering gene regulatory relationships, especially for mildly expressed genes.


Author(s):  
Meichen Dong ◽  
Aatish Thennavan ◽  
Eugene Urrutia ◽  
Yun Li ◽  
Charles M Perou ◽  
...  

Abstract Recent advances in single-cell RNA sequencing (scRNA-seq) enable characterization of transcriptomic profiles with single-cell resolution and circumvent averaging artifacts associated with traditional bulk RNA sequencing (RNA-seq) data. Here, we propose SCDC, a deconvolution method for bulk RNA-seq that leverages cell-type specific gene expression profiles from multiple scRNA-seq reference datasets. SCDC adopts an ENSEMBLE method to integrate deconvolution results from different scRNA-seq datasets that are produced in different laboratories and at different times, implicitly addressing the problem of batch-effect confounding. SCDC is benchmarked against existing methods using both in silico generated pseudo-bulk samples and experimentally mixed cell lines, whose known cell-type compositions serve as ground truths. We show that SCDC outperforms existing methods with improved accuracy of cell-type decomposition under both settings. To illustrate how the ENSEMBLE framework performs in complex tissues under different scenarios, we further apply our method to a human pancreatic islet dataset and a mouse mammary gland dataset. SCDC returns results that are more consistent with experimental designs and that reproduce more significant associations between cell-type proportions and measured phenotypes.


2021 ◽  
Vol 129 (Suppl_1) ◽  
Author(s):  
Benjamin Kopecky ◽  
Junedh Amrute ◽  
Hao Dun ◽  
C. Corbin Frye ◽  
DANIEL KREISEL ◽  
...  

Heart transplant rejection is common and is associated with significant morbidity and mortality. Current immunosuppressive therapies primarily target recipient T-cells and have a multitude of untoward effects including infections, malignancies, and end-organ damage. Recent studies implicate the roles of antigen presenting cells towards pathogenesis of allograft rejection through recruitment and activation of T-cells. The importance of antigen presenting cell origin, identity, and functional importance remains unknown. Using complimentary imaging and single cell RNA sequencing techniques, we show that donor and recipient monocytes and macrophages co-exist after heart transplantation. These myeloid populations have diverse transcriptional signatures that evolve throughout ongoing rejection. Donor macrophages can be defined ontologically and based on their expression of C-C chemokine receptor 2 (CCR2) and expression of MHC-II. Donor CCR2+ and CCR2- populations can be further defined based on their gene expression profiles, highlighting the marked heterogeneity in the donor macrophage population. Selective depletion of CCR2+ macrophages result in prolonged allograft survival. We use longitudinal single cell RNA sequencing to show that donor CCR2+ and CCR2- macrophages have distinct activation mechanisms such that donor CCR2+ macrophages signal through MyD88/NF-kB. Conditional depletion of MyD88 in donor macrophages recapitulates the donor CCR2+ depletion phenotype. Further interrogation of MyD88 conditionally depleted allografts shows reduced T-cell alloreactivity, holding promise for a potential therapeutic target pathway. Together, we show the molecular identity, diversity, and evolution of donor and recipient monocytes and macrophages as well as the functional relevance and activation pathways of donor macrophages in cardiac allografts.


Sign in / Sign up

Export Citation Format

Share Document