scholarly journals Strategies for Integrating Single-Cell RNA Sequencing Results With Multiple Species

2019 ◽  
Author(s):  
Ronald P. Hart

Single-cell RNA sequencing (scRNAseq) is a robust technology for parsing gene expression in individual cells from a tissue or other complex source. One application involves experiments where cells from multiple species are recovered from a single sample, such as when human cells are transplanted into an animal model. We transplanted microglial precursor cells into newborn mouse brain and then recovered unenriched cortical tissue six months later. Dissociated cells were assessed by scRNAseq. The default method for analyzing these results begins by aligning sequencing reads with a mixture of both mouse and human reference genomes. While this clearly identifies the human cells as a distinct cluster, the clustering is artificially driven by expression from non-comparable gene identifiers from different species. We devised a method for translating expression counts from human to mouse and evaluated four algorithms for parsing mixed-species scRNAseq data. Our optimal approach split raw sequencing reads according to the best alignment score in each genome, and then re-aligned reads only with the appropriate genome. After gene symbol translation, pooled results indicate that cell types are more appropriately clustered and that differential expression analysis identifies species-specific patterns. This method should be applicable to any mixed-species scRNAseq experiment.Summary of optimal strategyMixed-species scRNAseq data are aligned with mixture of mouse and human reference genomesThe BAM file is scanned to find the best alignment score for each sequencing read identifier; these are used to split the paired FASTQ files into two sets of filesEach set of species-specific, paired FASTQ files is re-aligned with only the appropriate reference genomeRaw counts imported into SeuratThe human counts table is translated to mouse gene symbols using a custom HomoloGene translation tableResults are merged and analyzed


2020 ◽  
Author(s):  
Tito Candelli ◽  
Pauline Schneider ◽  
Patricia Garrido Castro ◽  
Luke A. Jones ◽  
Rob Pieters ◽  
...  

AbstractInfants with MLL-rearranged acute lymphoblastic leukemia (ALL) undergo intense therapy to counter a highly aggressive leukemia with survival rates of only 30-40%. The majority of patients initially show therapy response, but in two-thirds of cases the leukemia returns, typically during treatment. Accurate relapse prediction would enable treatment strategies that take relapse risk into account, with potential benefits for all patients. Through analysis of diagnostic bone marrow biopsies, we show that single-cell RNA sequencing can predict future relapse occurrence. By analysing gene modules derived from an independent study of the gene expression response to the key drug prednisone, individual leukemic cells are predicted to be either resistant or sensitive to treatment. Quantification of the proportion of cells classified by single-cell transcriptomics as resistant or sensitive, accurately predicts the occurrence of future relapse in individual patients. Strikingly, the single-cell based classification is even consistent with the order of relapse timing. These results lay the foundation for risk-based treatment of MLL-rearranged infant ALL, through single-cell classification. This work also sheds light on the subpopulation of cells from which leukemic relapse arises. Leukemic cells associated with high relapse risk are characterized by a smaller size and a quiescent gene expression program. These cells have significantly fewer transcripts, thereby also demonstrating why single-cell analyses may outperform bulk mRNA studies for risk stratification. This study indicates that single-cell RNA sequencing will be a valuable tool for risk stratification of MLL-rearranged infant ALL, and shows how clinically relevant information can be derived from single-cell genomics.Key PointsSingle-cell RNA sequencing accurately predicts relapse in MLL-rearranged infant ALLIdentification of cells from which MLL-rearranged infant ALL relapses occur



2021 ◽  
Author(s):  
Ralf Schulze Brüning ◽  
Lukas Tombor ◽  
Marcel H. Schulz ◽  
Stefanie Dimmeler ◽  
David John

AbstractWith the rise of single cell RNA sequencing new bioinformatic tools became available to handle specific demands, such as quantifying unique molecular identifiers and correcting cell barcodes. Here, we analysed several datasets with the most common alignment tools for scRNA-seq data. We evaluated differences in the whitelisting, gene quantification, overall performance and potential variations in clustering or detection of differentially expressed genes.We compared the tools Cell Ranger, STARsolo, Kallisto and Alevin on three published datasets for human and mouse, sequenced with different versions of the 10X sequencing protocol.Striking differences have been observed in the overall runtime of the mappers. Besides that Kallisto and Alevin showed variances in the number of valid cells and detected genes per cell. Kallisto reported the highest number of cells, however, we observed an overrepresentation of cells with low gene content and unknown celtype. Conversely, Alevin rarely reported such low content cells.Further variations were detected in the set of expressed genes. While STARsolo, Cell Ranger and Alevin released similar gene sets, Kallisto detected additional genes from the Vmn and Olfr gene family, which are likely mapping artifacts. We also observed differences in the mitochondrial content of the resulting cells when comparing a prefiltered annotation set to the full annotation set that includes pseudogenes and other biotypes.Overall, this study provides a detailed comparison of common scRNA-seq mappers and shows their specific properties on 10X Genomics data.Key messagesMapping and gene quantifications are the most resource and time intensive steps during the analysis of scRNA-Seq data.The usage of alternative alignment tools reduces the time for analysing scRNA-Seq data.Different mapping strategies influence key properties of scRNA-SEQ e.g. total cell counts or genes per cellA better understanding of advantages and disadvantages for each mapping algorithm might improve analysis results.



2021 ◽  
Author(s):  
Kayt Scott ◽  
Rebecca O’Rourke ◽  
Caitlin C. Winkler ◽  
Christina A. Kearns ◽  
Bruce Appel

AbstractVentral spinal cord progenitor cells, which express the basic helix loop helix transcription factor Olig2, sequentially produce motor neurons and oligodendrocyte precursor cells (OPCs). Following specification some OPCs differentiate as myelinating oligodendrocytes while others persist as OPCs. Though a considerable amount of work has described the molecular profiles that define motor neurons, OPCs, and oligodendrocytes, less is known about the progenitors that produce them. To identify the developmental origins and transcriptional profiles of motor neurons and OPCs, we performed single-cell RNA sequencing on isolated pMN cells from embryonic zebrafish trunk tissue at stages that encompassed motor neurogenesis, OPC specification, and initiation of oligodendrocyte differentiation. Downstream analyses revealed two distinct pMN progenitor populations: one that appears to produce neurons and one that appears to produce OPCs. This latter population, called Pre-OPCs, is marked by expression of GS Homeobox 2 (gsx2), a gene that encodes a homeobox transcription factor. Using fluorescent in situ hybridizations, we identified gsx2-expressing Pre-OPCs in the spinal cord prior to expression of canonical OPC marker genes. Our data therefore reveal heterogeneous gene expression profiles among pMN progenitors, supporting prior fate mapping evidence.HighlightsSingle-cell RNA sequencing reveals the developmental trajectories of neurons and glia that arise from spinal cord pMN progenitor cells in zebrafish embryosTranscriptionally distinct subpopulations of pMN progenitors are the apparent sources of neurons or oligodendrocytes, consistent with fate mapping datagsx2 expression marks pMN progenitors that produce oligodendrocyte lineage cells



2018 ◽  
Author(s):  
Matthew D Young ◽  
Sam Behjati

AbstractBackgroundDroplet based single-cell RNA sequence analyses assume all acquired RNAs are endogenous to cells. However, any cell free RNAs contained within the input solution are also captured by these assays. This sequencing of cell free RNA constitutes a background contamination that confounds the biological interpretation of single-cell transcriptomic data.ResultsWe demonstrate that contamination from this ‘soup’ of cell free RNAs is ubiquitous, with experiment-specific variations in composition and magnitude. We present a method, SoupX, for quantifying the extent of the contamination and estimating ‘background corrected’ cell expression profiles that seamlessly integrate with existing downstream analysis tools. Applying this method to several datasets using multiple droplet sequencing technologies, we demonstrate that its application improves biological interpretation of otherwise misleading data, as well as improving quality control metrics.ConclusionsWe present ‘SoupX’, a tool for removing ambient RNA contamination from droplet based single cell RNA sequencing experiments. This tool has broad applicability and its application can improve the biological utility of existing and future data sets.Key PointsThe signal from droplet based single cell RNA sequencing is ubiquitously contaminated by capture of ambient mRNA.SoupX is a method to quantify the abundance of these ambient mRNAs and remove them.Correcting for ambient mRNA contamination improves biological interpretation.



2018 ◽  
Author(s):  
Qingyun Li ◽  
Zuolin Cheng ◽  
Lu Zhou ◽  
Spyros Darmanis ◽  
Norma Neff ◽  
...  

SummaryMicroglia are increasingly recognized for their major contributions during brain development and neurodegenerative disease. It is currently unknown if these functions are carried out by subsets of microglia during different stages of development and adulthood or within specific brain regions. Here, we performed deep single-cell RNA sequencing (scRNA-seq) of microglia and related myeloid cells sorted from various regions of embryonic, postnatal, and adult mouse brains. We found that the majority of adult microglia with homeostatic signatures are remarkably similar in transcriptomes, regardless of brain region. By contrast, postnatal microglia represent a more heterogeneous population. We discovered that postnatal white matter-associated microglia (WAM) are strikingly different from microglia in other regions and express genes enriched in degenerative disease-associated microglia. These postnatal WAM have distinct amoeboid morphology, are metabolically active, and phagocytose newly formed oligodendrocytes. This scRNA-seq atlas will be a valuable resource for dissecting innate immune functions in health and disease.HighlightsMyeloid scRNA-seq atlas across brain regions and developmental stagesLimited transcriptomic heterogeneity of homeostatic microglia in the adult brainPhase-specific gene sets of proliferating microglia along cell cycle pseudotimePhagocytic postnatal white matter-associated microglia sharing DAM gene signatures



Diabetes ◽  
2020 ◽  
Vol 69 (Supplement 1) ◽  
pp. 41-OR
Author(s):  
FARNAZ SHAMSI ◽  
MARY PIPER ◽  
LI-LUN HO ◽  
TIAN LIAN HUANG ◽  
YU-HUA TSENG


Sign in / Sign up

Export Citation Format

Share Document