scholarly journals Single-nucleus RNA-seq reveals dysregulation of striatal cell identity due to Huntington’s disease mutations

2020 ◽  
Author(s):  
Sonia Malaiya ◽  
Marcia Cortes-Gutierrez ◽  
Brian R. Herb ◽  
Sydney R. Coffey ◽  
Samuel R.W. Legg ◽  
...  

ABSTRACTHuntington’s disease (HD) is a dominantly inherited neurodegenerative disorder caused by a trinucleotide expansion in exon 1 of the huntingtin (Htt) gene. Cell death in HD occurs primarily in striatal medium spiny neurons (MSNs), but the involvement of specific MSN subtypes and of other striatal cell types remains poorly understood. To gain insight into cell type-specific disease processes, we studied the nuclear transcriptomes of 4,524 cells from the striatum of a genetically precise knock-in mouse model of the HD mutation, HttQ175/+, and from wildtype controls. We used 14-15-month-old mice, a time point roughly equivalent to an early stage of symptomatic human disease. Cell type distributions indicated selective loss of D2 MSNs and increased microglia in aged HttQ175/+ mice. Thousands of differentially expressed genes were distributed across most striatal cell types, including transcriptional changes in glial populations that are not apparent from RNA-seq of bulk tissue. Reconstruction of cell typespecific transcriptional networks revealed a striking pattern of bidirectional dysregulation for many cell type-specific genes. Typically, these genes were repressed in their primary cell type, yet de-repressed in other striatal cell types. Integration with existing epigenomic and transcriptomic data suggest that partial loss-of-function of the Polycomb Repressive Complex 2 (PRC2) may underlie many of these transcriptional changes, leading to deficits in the maintenance of cell identity across virtually all cell types in the adult striatum.

Author(s):  
Xuelin He ◽  
Li Liu ◽  
Baode Chen ◽  
Chao Wu

In vitro differentiation or expansion of stem and progenitor cells under chemical stimulation or genetic manipulation is used for understanding the molecular mechanisms of cell differentiation and self-renewal. However, concerns around the cell identity of in vitro–cultured cells exist. Bioinformatics methods, which rely heavily on signatures of cell types, have been developed to estimate cell types in bulk samples. The Tabula Muris Senis project provides an important basis for the comprehensive identification of signatures for different cell types. Here, we identified 46 cell type–specific (CTS) gene clusters for 83 mouse cell types. We conducted Gene Ontology term enrichment analysis on the gene clusters and revealed the specific functions of the relevant cell types. Next, we proposed a simple method, named CTSFinder, to identify different cell types between bulk RNA-Seq samples using the 46 CTS gene clusters. We applied CTSFinder on bulk RNA-Seq data from 17 organs and from developing mouse liver over different stages. We successfully identified the specific cell types between organs and captured the dynamics of different cell types during liver development. We applied CTSFinder with bulk RNA-Seq data from a growth factor–induced neural progenitor cell culture system and identified the dynamics of brain immune cells and nonimmune cells during the long-time cell culture. We also applied CTSFinder with bulk RNA-Seq data from reprogramming induced pluripotent stem cells and identified the stage when those cells were massively induced. Finally, we applied CTSFinder with bulk RNA-Seq data from in vivo and in vitro developing mouse retina and captured the dynamics of different cell types in the two development systems. The CTS gene clusters and CTSFinder method could thus serve as promising toolkits for assessing the cell identity of in vitro culture systems.


2020 ◽  
Author(s):  
Mohit Goyal ◽  
Guillermo Serrano ◽  
Ilan Shomorony ◽  
Mikel Hernaez ◽  
Idoia Ochoa

AbstractSingle-cell RNA-seq is a powerful tool in the study of the cellular composition of different tissues and organisms. A key step in the analysis pipeline is the annotation of cell-types based on the expression of specific marker genes. Since manual annotation is labor-intensive and does not scale to large datasets, several methods for automated cell-type annotation have been proposed based on supervised learning. However, these methods generally require feature extraction and batch alignment prior to classification, and their performance may become unreliable in the presence of cell-types with very similar transcriptomic profiles, such as differentiating cells. We propose JIND, a framework for automated cell-type identification based on neural networks that directly learns a low-dimensional representation (latent code) in which cell-types can be reliably determined. To account for batch effects, JIND performs a novel asymmetric alignment in which the transcriptomic profile of unseen cells is mapped onto the previously learned latent space, hence avoiding the need of retraining the model whenever a new dataset becomes available. JIND also learns cell-type-specific confidence thresholds to identify and reject cells that cannot be reliably classified. We show on datasets with and without batch effects that JIND classifies cells more accurately than previously proposed methods while rejecting only a small proportion of cells. Moreover, JIND batch alignment is parallelizable, being more than five or six times faster than Seurat integration. Availability: https://github.com/mohit1997/JIND.


2019 ◽  
Author(s):  
Matthew N. Bernstein ◽  
Zhongjie Ma ◽  
Michael Gleicher ◽  
Colin N. Dewey

SummaryCell type annotation is a fundamental task in the analysis of single-cell RNA-sequencing data. In this work, we present CellO, a machine learning-based tool for annotating human RNA-seq data with the Cell Ontology. CellO enables accurate and standardized cell type classification by considering the rich hierarchical structure of known cell types, a source of prior knowledge that is not utilized by existing methods. Furthemore, CellO comes pre-trained on a novel, comprehensive dataset of human, healthy, untreated primary samples in the Sequence Read Archive, which to the best of our knowledge, is the most diverse curated collection of primary cell data to date. CellO’s comprehensive training set enables it to run out-of-the-box on diverse cell types and achieves superior or competitive performance when compared to existing state-of-the-art methods. Lastly, CellO’s linear models are easily interpreted, thereby enabling exploration of cell type-specific expression signatures across the ontology. To this end, we also present the CellO Viewer: a web application for exploring CellO’s models across the ontology.HighlightWe present CellO, a tool for hierarchically classifying cell type from single-cell RNA-seq data against the graph-structured Cell OntologyCellO is pre-trained on a comprehensive dataset comprising nearly all bulk RNA-seq primary cell samples in the Sequence Read ArchiveCellO achieves superior or comparable performance with existing methods while featuring a more comprehensive pre-packaged training setCellO is built with easily interpretable models which we expose through a novel web application, the CellO Viewer, for exploring cell type-specific signatures across the Cell OntologyGraphical Abstract


2021 ◽  
Vol 39 (15_suppl) ◽  
pp. e19013-e19013
Author(s):  
Marianne T. Santaguida ◽  
Ryosuke Kita ◽  
Steven A. Schaffert ◽  
Erica K. Anderson ◽  
Kamran A Ali ◽  
...  

e19013 Background: Understanding the heterogeneity of AML is necessary for developing targeted drugs and diagnostics. A key measure of heterogeneity is the variance in response to treatments. Previously, we developed an ex vivo flow cytometry drug sensitivity assay (DSA) that predicted response to treatments in myelodysplastic syndrome. Unlike bulk cell viability measures of other drug sensitivity assays, our flow cytometry assay provides single cell resolution. The assay measures a drug’s effect on the viability or functional state of specific cell types. Here we present the development of this technology for AML, with additional measurements of DNA-Seq and RNA-Seq. Using the data from this assay, we aim to characterize the heterogeneity in AML drug sensitivity and the molecular mechanisms that drive it. Methods: As an initial feasibility analysis, we assayed 1 bone marrow and 3 peripheral blood AML patient samples. For the DSA, the samples were cultured with six AML standard of care (SOC) compounds across seven doses, in addition to two combinations. The cells were stained to detect multiple cell types including tumor blasts, and drug response was measured by flow cytometry. For the multi-omics, the cells were magnetically sorted to enrich for blasts and then assayed using a targeted 400 gene DNA-Seq panel and whole bulk transcriptome RNA-Seq. For comparison with BeatAML, Pearson correlations between gene expression and venetoclax sensitivity were investigated. Results: In our drug sensitivity assay, we measured dose response curves for the six SOC compounds, for each different cell type across each sample. The dose responses had cell type specific effects, including differences in drug response between CD11b+ blasts, CD11b- blasts, and other non-blast populations. Integrating with the DNA-Seq and RNA-Seq data, known associations between ex vivo drug response and gene expression were identified with additional cell type specificity. For example, BCL2A1 expression was negatively correlated with venetoclax sensitivity in CD11b- blasts but not in CD11b+ blasts. To further corroborate, among the top 1000 genes associated with venetoclax sensitivity in BeatAML, 93.7% had concordant directionality in effect. Conclusions: Here we describe the development of an integrated ex vivo drug sensitivity assay and multi-omics dataset. The data demonstrated that ex vivo responses to compounds differ between cell types, highlighting the importance of measuring drug response in specific cell types. In addition, we demonstrated that integrating these data will provide unique insights on molecular mechanisms that affect cell type specific drug response. As we continue to expand the number of patient samples evaluated with our multi-dimensional platform, this dataset will provide insights for novel drug target discovery, biomarker development, and, in the future, informing treatment decisions.


2018 ◽  
Author(s):  
Xuran Wang ◽  
Jihwan Park ◽  
Katalin Susztak ◽  
Nancy R. Zhang ◽  
Mingyao Li

AbstractWe present MuSiC, a method that utilizes cell-type specific gene expression from single-cell RNA sequencing (RNA-seq) data to characterize cell type compositions from bulk RNA-seq data in complex tissues. When applied to pancreatic islet and whole kidney expression data in human, mouse, and rats, MuSiC outperformed existing methods, especially for tissues with closely related cell types. MuSiC enables characterization of cellular heterogeneity of complex tissues for identification of disease mechanisms.


2021 ◽  
Vol 14 ◽  
Author(s):  
Ian A. Taukulis ◽  
Rafal T. Olszewski ◽  
Soumya Korrapati ◽  
Katharine A. Fernandez ◽  
Erich T. Boger ◽  
...  

The endocochlear potential (EP) generated by the stria vascularis (SV) is necessary for hair cell mechanotransduction in the mammalian cochlea. We sought to create a model of EP dysfunction for the purposes of transcriptional analysis and treatment testing. By administering a single dose of cisplatin, a commonly prescribed cancer treatment drug with ototoxic side effects, to the adult mouse, we acutely disrupt EP generation. By combining these data with single cell RNA-sequencing findings, we identify transcriptional changes induced by cisplatin exposure, and by extension transcriptional changes accompanying EP reduction, in the major cell types of the SV. We use these data to identify gene regulatory networks unique to cisplatin treated SV, as well as the differentially expressed and druggable gene targets within those networks. Our results reconstruct transcriptional responses that occur in gene expression on the cellular level while identifying possible targets for interventions not only in cisplatin ototoxicity but also in EP dysfunction.


2021 ◽  
Vol 14 (1) ◽  
Author(s):  
Yujuan Gui ◽  
Kamil Grzyb ◽  
Mélanie H. Thomas ◽  
Jochen Ohnmacht ◽  
Pierre Garcia ◽  
...  

Abstract Background Cell types in ventral midbrain are involved in diseases with variable genetic susceptibility, such as Parkinson’s disease and schizophrenia. Many genetic variants affect regulatory regions and alter gene expression in a cell-type-specific manner depending on the chromatin structure and accessibility. Results We report 20,658 single-nuclei chromatin accessibility profiles of ventral midbrain from two genetically and phenotypically distinct mouse strains. We distinguish ten cell types based on chromatin profiles and analysis of accessible regions controlling cell identity genes highlights cell-type-specific key transcription factors. Regulatory variation segregating the mouse strains manifests more on transcriptome than chromatin level. However, cell-type-level data reveals changes not captured at tissue level. To discover the scope and cell-type specificity of cis-acting variation in midbrain gene expression, we identify putative regulatory variants and show them to be enriched at differentially expressed loci. Finally, we find TCF7L2 to mediate trans-acting variation selectively in midbrain neurons. Conclusions Our data set provides an extensive resource to study gene regulation in mesencephalon and provides insights into control of cell identity in the midbrain and identifies cell-type-specific regulatory variation possibly underlying phenotypic and behavioural differences between mouse strains.


2018 ◽  
Author(s):  
Aziz Khan ◽  
Anthony Mathelier ◽  
Xuegong Zhang

AbstractBackgroundSuper-enhancers and stretch enhancers represent classes of transcriptional enhancers that have been shown to control the expression of cell identity genes and carry disease- and trait-associated variants. Specifically, super-enhancers are clusters of enhancers defined based on the binding occupancy of master transcription factors (TFs), chromatin regulators, or chromatin marks, while stretch enhancers are large chromatin-defined regulatory regions of at least 3,000 base pairs. Several studies have characterized these regulatory regions in numerous cell types and tissues to decipher their functional importance. However, the differences and similarities between these regulatory regions have not been fully assessed.ResultsWe integrated genomic, epigenomic, and transcriptomic data from ten human cell types to perform a comparative analysis of super and stretch enhancers with respect to their chromatin profiles, cell-type-specificity, and ability to control gene expression. We found that stretch enhancers are more abundant, more distal to transcription start sites, cover twice as much the genome and are significantly less conserved than super-enhancers. In contrast, super-enhancers are significantly more enriched for active chromatin marks and cohesin complex and transcriptionally active than stretch enhancers. Importantly, a vast majority of superenhancers (85%) overlap with only a small subset of stretch enhancers (13%), which are enriched for cell-type-specific biological functions, and control cell identity genes.ConclusionsThese results suggest that super-enhancers are transcriptionally more active and cell-type-specific than stretch enhancers, and importantly, most of the stretch enhancers that are distinct from superenhancers do not show an association with cell identity genes, are less active, and more likely to be poised enhancers.


2019 ◽  
Vol 31 (3) ◽  
pp. 496 ◽  
Author(s):  
Iside Scaravaggi ◽  
Nicole Borel ◽  
Rebekka Romer ◽  
Isabel Imboden ◽  
Susanne E. Ulbrich ◽  
...  

Previous endometrial gene expression studies during the time of conceptus migration did not provide final conclusions on the mechanisms of maternal recognition of pregnancy (MRP) in the mare. This called for a cell type-specific endometrial gene expression analysis in response to embryo signals to improve the understanding of gene expression regulation in the context of MRP. Laser capture microdissection was used to collect luminal epithelium (LE), glandular epithelium and stroma from endometrial biopsies from Day 12 of pregnancy and Day 12 of the oestrous cycle. RNA sequencing (RNA-Seq) showed greater expression differences between cell types than between pregnant and cyclic states; differences between the pregnant and cyclic states were mainly found in LE. Comparison with a previous RNA-Seq dataset for whole biopsy samples revealed the specific origin of gene expression differences. Furthermore, genes specifically differentially expressed (DE) in one cell type were found that were not detectable as DE in biopsies. Overall, this study revealed spatial information about endometrial gene expression during the phase of initial MRP. The conceptus induced changes in the expression of genes involved in blood vessel development, specific spatial regulation of the immune system, growth factors, regulation of prostaglandin synthesis, transport prostaglandin receptors, specifically prostaglandin F receptor (PTGFR) in the context of prevention of luteolysis.


Sign in / Sign up

Export Citation Format

Share Document