The Application of Single-Cell RNA Sequencing in Mammalian Meiosis Studies

Meiosis is a cellular division process that produces gametes for sexual reproduction. Disruption of complex events throughout meiosis, such as synapsis and homologous recombination, can lead to infertility and aneuploidy. To reveal the molecular mechanisms of these events, transcriptome studies of specific substages must be conducted. However, conventional methods, such as bulk RNA-seq and RT-qPCR, are not able to detect the transcriptional variations effectively and precisely, especially for identifying cell types and stages with subtle differences. In recent years, mammalian meiotic transcriptomes have been intensively studied at the single-cell level by using single-cell RNA-seq (scRNA-seq) approaches, especially through two widely used platforms, Smart-seq2 and Drop-seq. The scRNA-seq protocols along with their downstream analysis enable researchers to accurately identify cell heterogeneities and investigate meiotic transcriptomes at a higher resolution. In this review, we compared bulk RNA-seq and scRNA-seq to show the advantages of the scRNA-seq in meiosis studies; meanwhile, we also pointed out the challenges and limitations of the scRNA-seq. We listed recent findings from mammalian meiosis (male and female) studies where scRNA-seq applied. Next, we summarized the scRNA-seq analysis methods and the meiotic marker genes from spermatocytes and oocytes. Specifically, we emphasized the different features of the two scRNA-seq protocols (Smart-seq2 and Drop-seq) in the context of meiosis studies and discussed their strengths and weaknesses in terms of different research purposes. Finally, we discussed the future applications of scRNA-seq in the meiosis field.

Download Full-text

SDImpute: A statistical block imputation method based on cell-level and gene-level information for dropouts in single-cell RNA-seq data

PLoS Computational Biology ◽

10.1371/journal.pcbi.1009118 ◽

2021 ◽

Vol 17 (6) ◽

pp. e1009118

Author(s):

Jing Qi ◽

Yang Zhou ◽

Zicen Zhao ◽

Shuilin Jin

Keyword(s):

Gene Expression ◽

Single Cell ◽

Differential Expression Analysis ◽

Cell Types ◽

Rna Seq ◽

Cell Level ◽

Gene Level ◽

Level Information ◽

Downstream Analysis ◽

Gene Expression Levels

The single-cell RNA sequencing (scRNA-seq) technologies obtain gene expression at single-cell resolution and provide a tool for exploring cell heterogeneity and cell types. As the low amount of extracted mRNA copies per cell, scRNA-seq data exhibit a large number of dropouts, which hinders the downstream analysis of the scRNA-seq data. We propose a statistical method, SDImpute (Single-cell RNA-seq Dropout Imputation), to implement block imputation for dropout events in scRNA-seq data. SDImpute automatically identifies the dropout events based on the gene expression levels and the variations of gene expression across similar cells and similar genes, and it implements block imputation for dropouts by utilizing gene expression unaffected by dropouts from similar cells. In the experiments, the results of the simulated datasets and real datasets suggest that SDImpute is an effective tool to recover the data and preserve the heterogeneity of gene expression across cells. Compared with the state-of-the-art imputation methods, SDImpute improves the accuracy of the downstream analysis including clustering, visualization, and differential expression analysis.

Download Full-text

Single-cell transcriptomic analysis elucidates APOE genotype specific changes across cell types in two brain regions in Alzheimer’s disease

10.21203/rs.3.rs-291648/v1 ◽

2021 ◽

Author(s):

Stella Belonwu ◽

Yaqiao Li ◽

Daniel Bunis ◽

Arjun Arkal Rao ◽

Caroline Warly Solsberg ◽

...

Keyword(s):

Alzheimer’S Disease ◽

Alzheimer's Disease ◽

Single Cell ◽

Molecular Mechanisms ◽

Apoe Genotype ◽

Cell Types ◽

Brain Regions ◽

Single Cell Level ◽

Cell Level ◽

Single Nucleus

Abstract Alzheimer’s Disease (AD) is a complex neurodegenerative disease that gravely affects patients and imposes an immense burden on caregivers. Apolipoprotein E4 (APOE4) has been identified as the most common genetic risk factor for AD, yet the molecular mechanisms connecting APOE4 to AD are not well understood. Past transcriptomic analyses in AD have revealed APOE genotype-specific transcriptomic differences; however, these differences have not been explored at a single-cell level. Here, we leverage the first two single-nucleus RNA sequencing AD datasets from human brain samples, including nearly 55,000 cells from the prefrontal and entorhinal cortices. We observed more global transcriptomic changes in APOE4 positive AD cells and identified differences across APOE genotypes primarily in glial cell types. Our findings highlight the differential transcriptomic perturbations of APOE isoforms at a single-cell level in AD pathogenesis and have implications for precision medicine development in the diagnosis and treatment of AD.

Download Full-text

JIND: Joint Integration and Discrimination for Automated Single-Cell Annotation

10.1101/2020.10.06.327601 ◽

2020 ◽

Author(s):

Mohit Goyal ◽

Guillermo Serrano ◽

Ilan Shomorony ◽

Mikel Hernaez ◽

Idoia Ochoa

Keyword(s):

Single Cell ◽

Cell Types ◽

Marker Genes ◽

Specific Marker ◽

Rna Seq ◽

Batch Effects ◽

Cell Type ◽

Latent Space ◽

Cell Type Specific ◽

Low Dimensional

AbstractSingle-cell RNA-seq is a powerful tool in the study of the cellular composition of different tissues and organisms. A key step in the analysis pipeline is the annotation of cell-types based on the expression of specific marker genes. Since manual annotation is labor-intensive and does not scale to large datasets, several methods for automated cell-type annotation have been proposed based on supervised learning. However, these methods generally require feature extraction and batch alignment prior to classification, and their performance may become unreliable in the presence of cell-types with very similar transcriptomic profiles, such as differentiating cells. We propose JIND, a framework for automated cell-type identification based on neural networks that directly learns a low-dimensional representation (latent code) in which cell-types can be reliably determined. To account for batch effects, JIND performs a novel asymmetric alignment in which the transcriptomic profile of unseen cells is mapped onto the previously learned latent space, hence avoiding the need of retraining the model whenever a new dataset becomes available. JIND also learns cell-type-specific confidence thresholds to identify and reject cells that cannot be reliably classified. We show on datasets with and without batch effects that JIND classifies cells more accurately than previously proposed methods while rejecting only a small proportion of cells. Moreover, JIND batch alignment is parallelizable, being more than five or six times faster than Seurat integration. Availability: https://github.com/mohit1997/JIND.

Download Full-text

Enhancing droplet-based single-nucleus RNA-seq resolution using the semi-supervised machine learning classifier DIEM

10.1101/786285 ◽

2019 ◽

Cited By ~ 4

Author(s):

Marcus Alvarez ◽

Elior Rahmani ◽

Brandon Jew ◽

Kristina M. Garske ◽

Zong Miao ◽

...

Keyword(s):

Gene Expression ◽

Single Cell ◽

Cell Types ◽

Supervised Machine Learning ◽

Data Sets ◽

Rna Seq ◽

Novel Approach ◽

Single Nucleus ◽

Downstream Analysis

AbstractSingle-nucleus RNA sequencing (snRNA-seq) measures gene expression in individual nuclei instead of cells, allowing for unbiased cell type characterization in solid tissues. Contrary to single-cell RNA seq (scRNA-seq), we observe that snRNA-seq is commonly subject to contamination by high amounts of extranuclear background RNA, which can lead to identification of spurious cell types in downstream clustering analyses if overlooked. We present a novel approach to remove debris-contaminated droplets in snRNA-seq experiments, called Debris Identification using Expectation Maximization (DIEM). Our likelihood-based approach models the gene expression distribution of debris and cell types, which are estimated using EM. We evaluated DIEM using three snRNA-seq data sets: 1) human differentiating preadipocytes in vitro, 2) fresh mouse brain tissue, and 3) human frozen adipose tissue (AT) from six individuals. All three data sets showed various degrees of extranuclear RNA contamination. We observed that existing methods fail to account for contaminated droplets and led to spurious cell types. When compared to filtering using these state of the art methods, DIEM better removed droplets containing high levels of extranuclear RNA and led to higher quality clusters. Although DIEM was designed for snRNA-seq data, we also successfully applied DIEM to single-cell data. To conclude, our novel method DIEM removes debris-contaminated droplets from single-cell-based data fast and effectively, leading to cleaner downstream analysis. Our code is freely available for use at https://github.com/marcalva/diem.

Download Full-text

Single-nuclei RNA-seq on human retinal tissue provides improved transcriptome profiling

Nature Communications ◽

10.1038/s41467-019-12917-9 ◽

2019 ◽

Vol 10 (1) ◽

Cited By ~ 16

Author(s):

Qingnan Liang ◽

Rachayata Dharmat ◽

Leah Owen ◽

Akbar Shakoor ◽

Yumei Li ◽

...

Keyword(s):

Single Cell ◽

Transcriptome Profiling ◽

Cell Types ◽

Retinal Cell ◽

Peripheral Retina ◽

Marker Genes ◽

Rna Seq ◽

Cell Type ◽

Retinal Tissue ◽

The Individual

AbstractSingle-cell RNA-seq is a powerful tool in decoding the heterogeneity in complex tissues by generating transcriptomic profiles of the individual cell. Here, we report a single-nuclei RNA-seq (snRNA-seq) transcriptomic study on human retinal tissue, which is composed of multiple cell types with distinct functions. Six samples from three healthy donors are profiled and high-quality RNA-seq data is obtained for 5873 single nuclei. All major retinal cell types are observed and marker genes for each cell type are identified. The gene expression of the macular and peripheral retina is compared to each other at cell-type level. Furthermore, our dataset shows an improved power for prioritizing genes associated with human retinal diseases compared to both mouse single-cell RNA-seq and human bulk RNA-seq results. In conclusion, we demonstrate that obtaining single cell transcriptomes from human frozen tissues can provide insight missed by either human bulk RNA-seq or animal models.

Download Full-text

Integrated profiling of single cell epigenomic and transcriptomic landscape of Parkinson’s disease mouse brain

10.1101/2020.02.04.933259 ◽

2020 ◽

Author(s):

Jixing Zhong ◽

Gen Tang ◽

Jiacheng Zhu ◽

Xin Qiu ◽

Weiying Wu ◽

...

Keyword(s):

Parkinson’S Disease ◽

Parkinson's Disease ◽

Single Cell ◽

Early Stage ◽

Cell Types ◽

Cellular Heterogeneity ◽

Rna Seq ◽

Cell Level ◽

Distinct Cell ◽

Single Nucleus

AbstractParkinson’s disease (PD) is a neurodegenerative disease leading to the impairment of execution of movement. PD pathogenesis has been largely investigated, but either restricted in bulk level or at certain cell types, which failed to capture cellular heterogeneity and intrinsic interplays among distinct cell types. To overcome this, we applied single-nucleus RNA-seq and single cell ATAC-seq on cerebellum, midbrain and striatum of PD mouse and matched control. With 74,493 cells in total, we comprehensively depicted the dysfunctions under PD pathology covering proteostasis, neuroinflammation, calcium homeostasis and extracellular neurotransmitter homeostasis. Besides, by multi-omics approach, we identified putative biomarkers for early stage of PD, based on the relationships between transcriptomic and epigenetic profiles. We located certain cell types that primarily contribute to PD early pathology, narrowing the gap between genotypes and phenotypes. Taken together, our study provides a valuable resource to dissect the molecular mechanism of PD pathogenesis at single cell level, which could facilitate the development of novel methods regarding diagnosis, monitoring and practical therapies against PD at early stage.

Download Full-text

scAPAdb: a comprehensive database of alternative polyadenylation at single-cell resolution

Nucleic Acids Research ◽

10.1093/nar/gkab795 ◽

2021 ◽

Author(s):

Sheng Zhu ◽

Qiwei Lian ◽

Wenbin Ye ◽

Wei Qin ◽

Zhe Wu ◽

...

Keyword(s):

Single Cell ◽

Alternative Polyadenylation ◽

Cell Types ◽

Single Cell Level ◽

Cell Heterogeneity ◽

Rna Seq ◽

Cell Level ◽

Eukaryotic Gene ◽

User Friendly ◽

Different Cell Types

Abstract Alternative polyadenylation (APA) is a widespread regulatory mechanism of transcript diversification in eukaryotes, which is increasingly recognized as an important layer for eukaryotic gene expression. Recent studies based on single-cell RNA-seq (scRNA-seq) have revealed cell-to-cell heterogeneity in APA usage and APA dynamics across different cell types in various tissues, biological processes and diseases. However, currently available APA databases were all collected from bulk 3′-seq and/or RNA-seq data, and no existing database has provided APA information at single-cell resolution. Here, we present a user-friendly database called scAPAdb (http://www.bmibig.cn/scAPAdb), which provides a comprehensive and manually curated atlas of poly(A) sites, APA events and poly(A) signals at the single-cell level. Currently, scAPAdb collects APA information from > 360 scRNA-seq experiments, covering six species including human, mouse and several other plant species. scAPAdb also provides batch download of data, and users can query the database through a variety of keywords such as gene identifier, gene function and accession number. scAPAdb would be a valuable and extendable resource for the study of cell-to-cell heterogeneity in APA isoform usages and APA-mediated gene regulation at the single-cell level under diverse cell types, tissues and species.

Download Full-text

Meta-Analysis of cortical inhibitory interneurons markers landscape and their performances in scRNA-seq studies.

10.1101/2021.11.03.467049 ◽

2021 ◽

Author(s):

Lorenzo Martini ◽

Roberta Bardini ◽

Stefano Di Carlo

Keyword(s):

Single Cell ◽

Meta Analysis ◽

Cell Types ◽

Cellular Heterogeneity ◽

Marker Genes ◽

Inhibitory Interneurons ◽

Rna Seq ◽

Circuit Function ◽

The Brain

The mammalian cortex contains a great variety of neuronal cells. In particular, GABAergic interneurons, which play a major role in neuronal circuit function, exhibit an extraordinary diversity of cell types. In this regard, single-cell RNA-seq analysis is crucial to study cellular heterogeneity. To identify and analyze rare cell types, it is necessary to reliably label cells through known markers. In this way, all the related studies are dependent on the quality of the employed marker genes. Therefore, in this work, we investigate how a set of chosen inhibitory interneurons markers perform. The gene set consists of both immunohistochemistry-derived genes and single-cell RNA-seq taxonomy ones. We employed various human and mouse datasets of the brain cortex, consequently processed with the Monocle3 pipeline. We defined metrics based on the relations between unsupervised cluster results and the marker expression. Specifically, we calculated the specificity, the fraction of cells expressing, and some metrics derived from decision tree analysis like entropy gain and impurity reduction. The results highlighted the strong reliability of some markers but also the low quality of others. More interestingly, though, a correlation emerges between the general performances of the genes set and the experimental quality of the datasets. Therefore, the proposed method allows evaluating the quality of a dataset in relation to its reliability regarding the inhibitory interneurons cellular heterogeneity study.

Download Full-text

MarkerCount: A stable, count-based cell type identifier for single cell RNA-Seq experiments

10.21203/rs.3.rs-418249/v1 ◽

2021 ◽

Author(s):

Hanbyeol Kim ◽

Joongho Lee ◽

Keunsoo Kang ◽

Seokhyun Yoon

Keyword(s):

Gene Expression ◽

Single Cell ◽

Cell Types ◽

Batch Effect ◽

Expression Level ◽

Rna Seq ◽

Cell Type ◽

Stable Performance ◽

Downstream Analysis

Abstract Cell type identification is a key step to downstream analysis of single cell RNA-seq experiments. Indispensible information for this is gene expression, which is used to cluster cells, train the model and set rejection thresholds. Problem is they are subject to batch effect arising from different platforms and preprocessing. We present MarkerCount, which uses the number of markers expressed regardless of their expression level to initially identify cell types and, then, reassign cell type in cluster-basis. MarkerCount works both in reference and marker-based mode, where the latter utilizes only the existing lists of markers, while the former required pre-annotated dataset to train the model. The performance was evaluated and compared with the existing identifiers, both marker and reference-based, that can be customized with publicly available datasets and marker DB. The results show that MarkerCount provides a stable performance when comparing with other reference-based and marker-based cell type identifiers.

Download Full-text

LRcell: detecting the source of differential expression at the sub-cell type level from bulk RNA-seq data

10.1101/2021.08.10.455821 ◽

2021 ◽

Author(s):

Wenjing Ma ◽

Sumeet Sharma ◽

Peng Jin ◽

Shannon L Gourley ◽

Zhaohui Qin

Keyword(s):

Single Cell ◽

Cell Types ◽

Marker Genes ◽

Bioconductor Package ◽

Rna Seq ◽

Cell Type ◽

Reference Dataset ◽

Cell Type Composition ◽

Type Composition ◽

Differential Gene

The rapid proliferation of single-cell RNA-sequencing (scRNA-seq) datasets have revealed cell heterogeneity at unprecedented scales. Several deconvolution methods have been developed to decompose bulk experiments to reveal cell type contributions. However, these methods lack power in identifying the accurate cell type composition when having a considerable amount of sub-cell types in the reference dataset. Here, we present LRcell, a R Bioconductor package (http://bioconductor.org/packages/release/bioc/html/LRcell.html) aiming to identify specific sub-cell type(s) that drives the changes observed in a bulk RNA-seq differential gene expression experiment. In addition, LRcell provides pre-embedded marker genes computed from putative single-cell RNA-seq experiments as options to execute the analyses.

Download Full-text