scholarly journals Molecular Profiling to Infer Neuronal Cell Identity: Lessons from small ganglia of the Crab Cancer borealis

2019 ◽  
Author(s):  
Adam J. Northcutt ◽  
Daniel R. Kick ◽  
Adriane G. Otopalik ◽  
Benjamin M. Goetz ◽  
Rayna M. Harris ◽  
...  

ABSTRACTUnderstanding circuit organization depends on identification of cell types. Recent advances in transcriptional profiling methods have enabled classification of cell types by their gene expression. While exceptionally powerful and high throughput, the ground-truth validation of these methods is difficult: if cell type is unknown, how does one assess whether a given analysis accurately captures neuronal identity? To shed light on the capabilities and limitations of solely using transcriptional profiling for cell type classification, we performed two forms of transcriptional profiling – RNA-seq and quantitative RT-PCR, in single, unambiguously identified neurons from two small crustacean networks: the stomatogastric and cardiac ganglia. We then combined our knowledge of cell type with unbiased clustering analyses and supervised machine learning to determine how accurately functionally-defined neuron types can be classified by expression profile alone. Our results demonstrate that expression profile is able to capture neuronal identity most accurately when combined with multimodal information that allows for post-hoc grouping so analysis can proceed from a supervised perspective. Solely unsupervised clustering can lead to misidentification and an inability to distinguish between two or more cell types. Therefore, our study supports the general utility of cell identification by transcriptional profiling, but adds a caution: it is difficult or impossible to know under what conditions transcriptional profiling alone is capable of assigning cell identity. Only by combining multiple modalities of information such as physiology, morphology or innervation target can neuronal identity be unambiguously determined.SIGNIFICANCE STATEMENTSingle cell transcriptional profiling has become a widespread tool in cell identification, particularly in the nervous system, based on the notion that genomic information determines cell identity. However, many cell type classification studies are unconstrained by other cellular attributes (e.g., morphology, physiology). Here, we systematically test how accurately transcriptional profiling can assign cell identity to well-studied anatomically- and functionally-identified neurons in two small neuronal networks. While these neurons clearly possess distinct patterns of gene expression across cell types, their expression profiles are not sufficient to unambiguously confirm their identity. We suggest that true cell identity can only be determined by combining gene expression data with other cellular attributes such as innervation pattern, morphology, or physiology.

2019 ◽  
Vol 116 (52) ◽  
pp. 26980-26990 ◽  
Author(s):  
Adam J. Northcutt ◽  
Daniel R. Kick ◽  
Adriane G. Otopalik ◽  
Benjamin M. Goetz ◽  
Rayna M. Harris ◽  
...  

Understanding circuit organization depends on identification of cell types. Recent advances in transcriptional profiling methods have enabled classification of cell types by their gene expression. While exceptionally powerful and high throughput, the ground-truth validation of these methods is difficult: If cell type is unknown, how does one assess whether a given analysis accurately captures neuronal identity? To shed light on the capabilities and limitations of solely using transcriptional profiling for cell-type classification, we performed 2 forms of transcriptional profiling—RNA-seq and quantitative RT-PCR, in single, unambiguously identified neurons from 2 small crustacean neuronal networks: The stomatogastric and cardiac ganglia. We then combined our knowledge of cell type with unbiased clustering analyses and supervised machine learning to determine how accurately functionally defined neuron types can be classified by expression profile alone. The results demonstrate that expression profile is able to capture neuronal identity most accurately when combined with multimodal information that allows for post hoc grouping, so analysis can proceed from a supervised perspective. Solely unsupervised clustering can lead to misidentification and an inability to distinguish between 2 or more cell types. Therefore, this study supports the general utility of cell identification by transcriptional profiling, but adds a caution: It is difficult or impossible to know under what conditions transcriptional profiling alone is capable of assigning cell identity. Only by combining multiple modalities of information such as physiology, morphology, or innervation target can neuronal identity be unambiguously determined.


2021 ◽  
Vol 14 (1) ◽  
Author(s):  
Yujuan Gui ◽  
Kamil Grzyb ◽  
Mélanie H. Thomas ◽  
Jochen Ohnmacht ◽  
Pierre Garcia ◽  
...  

Abstract Background Cell types in ventral midbrain are involved in diseases with variable genetic susceptibility, such as Parkinson’s disease and schizophrenia. Many genetic variants affect regulatory regions and alter gene expression in a cell-type-specific manner depending on the chromatin structure and accessibility. Results We report 20,658 single-nuclei chromatin accessibility profiles of ventral midbrain from two genetically and phenotypically distinct mouse strains. We distinguish ten cell types based on chromatin profiles and analysis of accessible regions controlling cell identity genes highlights cell-type-specific key transcription factors. Regulatory variation segregating the mouse strains manifests more on transcriptome than chromatin level. However, cell-type-level data reveals changes not captured at tissue level. To discover the scope and cell-type specificity of cis-acting variation in midbrain gene expression, we identify putative regulatory variants and show them to be enriched at differentially expressed loci. Finally, we find TCF7L2 to mediate trans-acting variation selectively in midbrain neurons. Conclusions Our data set provides an extensive resource to study gene regulation in mesencephalon and provides insights into control of cell identity in the midbrain and identifies cell-type-specific regulatory variation possibly underlying phenotypic and behavioural differences between mouse strains.


2020 ◽  
Author(s):  
Yujuan Gui ◽  
Kamil Grzyb ◽  
Mélanie H. Thomas ◽  
Jochen Ohnmacht ◽  
Pierre Garcia ◽  
...  

SUMMARYCell types in ventral midbrain are involved in diseases with variable genetic susceptibility such as Parkinson’s disease and schizophrenia. Many genetic variants affect regulatory regions and alter gene expression. We report 20 658 single nuclei chromatin accessibility profiles of ventral midbrain from two genetically and phenotypically distinct mouse strains. We distinguish ten cell types based on chromatin profiles and analysis of accessible regions controlling cell identity genes highlights cell type-specific key transcription factors. Regulatory variation segregating the mouse strains manifests more on transcriptome than chromatin level. However, cell type-level data reveals changes not captured at tissue level. To discover the scope and cell-type specificity of cis-acting variation in midbrain gene expression, we identify putative regulatory variants and show them to be enriched at differentially expressed loci. Finally, we find TCF7L2 to mediate trans-acting variation selectively in midbrain neurons. Our dataset provides an extensive resource to study gene regulation in mesencephalon.


2021 ◽  
Vol 11 (1) ◽  
Author(s):  
John A. Halsall ◽  
Simon Andrews ◽  
Felix Krueger ◽  
Charlotte E. Rutledge ◽  
Gabriella Ficz ◽  
...  

AbstractChromatin configuration influences gene expression in eukaryotes at multiple levels, from individual nucleosomes to chromatin domains several Mb long. Post-translational modifications (PTM) of core histones seem to be involved in chromatin structural transitions, but how remains unclear. To explore this, we used ChIP-seq and two cell types, HeLa and lymphoblastoid (LCL), to define how changes in chromatin packaging through the cell cycle influence the distributions of three transcription-associated histone modifications, H3K9ac, H3K4me3 and H3K27me3. We show that chromosome regions (bands) of 10–50 Mb, detectable by immunofluorescence microscopy of metaphase (M) chromosomes, are also present in G1 and G2. They comprise 1–5 Mb sub-bands that differ between HeLa and LCL but remain consistent through the cell cycle. The same sub-bands are defined by H3K9ac and H3K4me3, while H3K27me3 spreads more widely. We found little change between cell cycle phases, whether compared by 5 Kb rolling windows or when analysis was restricted to functional elements such as transcription start sites and topologically associating domains. Only a small number of genes showed cell-cycle related changes: at genes encoding proteins involved in mitosis, H3K9 became highly acetylated in G2M, possibly because of ongoing transcription. In conclusion, modified histone isoforms H3K9ac, H3K4me3 and H3K27me3 exhibit a characteristic genomic distribution at resolutions of 1 Mb and below that differs between HeLa and lymphoblastoid cells but remains remarkably consistent through the cell cycle. We suggest that this cell-type-specific chromosomal bar-code is part of a homeostatic mechanism by which cells retain their characteristic gene expression patterns, and hence their identity, through multiple mitoses.


1985 ◽  
Vol 5 (2) ◽  
pp. 419-421
Author(s):  
K M Zezulak ◽  
H Green

During the differentiation of preadipose 3T3 cells into adipose cells, the mRNAs for three proteins increase strikingly in abundance. To determine the degree of cell-type specificity in the expression of these mRNAs, we estimated their abundances in several nonadipose tissues of the mouse. None of these mRNAs was strictly confined to adipocytes, but the ensemble of three mRNAs was rather specific to adipocytes. Insofar as is revealed by these three markers, the distinctive phenotype of adipocytes is the result of the enhanced expression of a number of genes, none of which is completely silent in all other cell types.


eLife ◽  
2019 ◽  
Vol 8 ◽  
Author(s):  
Sinisa Hrvatin ◽  
Christopher P Tzeng ◽  
M Aurel Nagy ◽  
Hume Stroud ◽  
Charalampia Koutsioumpa ◽  
...  

Enhancers are the primary DNA regulatory elements that confer cell type specificity of gene expression. Recent studies characterizing individual enhancers have revealed their potential to direct heterologous gene expression in a highly cell-type-specific manner. However, it has not yet been possible to systematically identify and test the function of enhancers for each of the many cell types in an organism. We have developed PESCA, a scalable and generalizable method that leverages ATAC- and single-cell RNA-sequencing protocols, to characterize cell-type-specific enhancers that should enable genetic access and perturbation of gene function across mammalian cell types. Focusing on the highly heterogeneous mammalian cerebral cortex, we apply PESCA to find enhancers and generate viral reagents capable of accessing and manipulating a subset of somatostatin-expressing cortical interneurons with high specificity. This study demonstrates the utility of this platform for developing new cell-type-specific viral reagents, with significant implications for both basic and translational research.


2012 ◽  
Vol 93 (5) ◽  
pp. 1046-1058 ◽  
Author(s):  
James C. Towler ◽  
Bahram Ebrahimi ◽  
Brian Lane ◽  
Andrew J. Davison ◽  
Derrick J. Dargan

Broad cell tropism contributes to the pathogenesis of human cytomegalovirus (HCMV), but the extent to which cell type influences HCMV gene expression is unclear. A bespoke HCMV DNA microarray was used to monitor the transcriptome activity of the low passage Merlin strain of HCMV at 12, 24, 48 and 72 h post-infection, during a single round of replication in human fetal foreskin fibroblast cells (HFFF-2s), human retinal pigmented epithelial cells (RPE-1s) and human astrocytoma cells (U373MGs). In order to correlate transcriptome activity with concurrent biological responses, viral cytopathic effect, growth kinetics and genomic loads were examined in the three cell types. The temporal expression pattern of viral genes was broadly similar in HFFF-2s and RPE-1s, but dramatically different in U373MGs. Of the 165 known HCMV protein-coding genes, 41 and 48 were differentially regulated in RPE-1s and U373MGs, respectively, compared with HFFF-2s, and 22 of these were differentially regulated in both RPE-1s and U373MGs. In RPE-1s, all differentially regulated genes were downregulated, but, in U373MGs, some were down- and others upregulated. Differentially regulated genes were identified among the immediate-early, early, early late and true-late viral gene classes. Grouping of downregulated genes according to function at landmark stages of the replication cycle led to the identification of potential bottleneck stages (genome replication, virion assembly, and virion maturation and release) that may account for cell type-dependent viral growth kinetics. The possibility that cell type-specific differences in expressed cellular factors are responsible for modulation of viral gene expression is discussed.


2022 ◽  
Author(s):  
Takaho Tsuchiya ◽  
Hiroki Hori ◽  
Haruka Ozaki

Motivation: Cell-cell communications regulate internal cellular states of the cell, e.g., gene expression and cell functions, and play pivotal roles in normal development and disease states. Furthermore, single-cell RNA sequencing methods have revealed cell-to-cell expression variability of highly variable genes (HVGs), which is also crucial. Nevertheless, the regulation on cell-to-cell expression variability of HVGs via cell-cell communications is still unexplored. The recent advent of spatial transcriptome measurement methods has linked gene expression profiles to the spatial context of single cells, which has provided opportunities to reveal those regulations. The existing computational methods extract genes with expression levels that are influenced by neighboring cell types based on the spatial transcriptome data. However, limitations remain in the quantitativeness and interpretability: it neither focuses on HVGs, considers cooperation of neighboring cell types, nor quantifies the degree of regulation with each neighboring cell type. Results: Here, we propose CCPLS (Cell-Cell communications analysis by Partial Least Square regression modeling), which is a statistical framework for identifying cell-cell communications as the effects of multiple neighboring cell types on cell-to-cell expression variability of HVGs, based on the spatial transcriptome data. For each cell type, CCPLS performs PLS regression modeling and reports coefficients as the quantitative index of the cell-cell communications. Evaluation using simulated data showed our method accurately estimated effects of multiple neighboring cell types on HVGs. Furthermore, by applying CCPLS to the two real datasets, we demonstrate CCPLS can be used to extract biologically interpretable insights from the inferred cell-cell communications.


2019 ◽  
Author(s):  
Andre Macedo ◽  
Alisson M. Gontijo

The human body is made up of hundreds, perhaps thousands of cell types and states, most of which are currently inaccessible genetically. Genetic accessibility carries significant diagnostic and therapeutic potential by allowing the selective delivery of genetic messages or cures to cells. Research in model organisms has shown that single regulatory element (RE) activities are seldom cell type specific, limiting their usage in genetic systems designed to restrict gene expression posteriorly to their delivery to cells. Intersectional genetic approaches can increase the number of genetically accessible cells. A typical intersectional method acts like an AND logic gate by converting the input of two or more active REs into a single synthetic output, which becomes unique for that cell. Here, we systematically assessed the intersectional genetics landscape of human using a curated subset of cells from a large RE usage atlas obtained by Cap Analysis of Gene Expression Sequencing (CAGE-Seq) of thousands of primary and cancer cells (the FANTOM5 consortium atlas). We developed the heuristics and algorithms to retrieve and quality rank AND gate intersections intra- and inter-individually. We find that >90% of the 154 primary cell types surveyed can be distinguished from each other with as little as 3 to 4 active REs, with quantifiable safety and robustness. We call these minimal intersections of active REs with cell-type diagnostic potential “Versatile Entry Codes” (VEnCodes). We show that VEnCodes could be found for 100% of the 158 cancer cell types surveyed, and that most of these are highly robust to intra- and interindividual variation. Our tools for generating and quality-ranking VEnCodes can be adapted to other RE usage databases and to other intersectional methods using alternative Boolean logic operations. Our work demonstrate the potential of intersectional approaches for future gene delivery technologies in human.


2019 ◽  
Author(s):  
Matthew N. Bernstein ◽  
Zhongjie Ma ◽  
Michael Gleicher ◽  
Colin N. Dewey

SummaryCell type annotation is a fundamental task in the analysis of single-cell RNA-sequencing data. In this work, we present CellO, a machine learning-based tool for annotating human RNA-seq data with the Cell Ontology. CellO enables accurate and standardized cell type classification by considering the rich hierarchical structure of known cell types, a source of prior knowledge that is not utilized by existing methods. Furthemore, CellO comes pre-trained on a novel, comprehensive dataset of human, healthy, untreated primary samples in the Sequence Read Archive, which to the best of our knowledge, is the most diverse curated collection of primary cell data to date. CellO’s comprehensive training set enables it to run out-of-the-box on diverse cell types and achieves superior or competitive performance when compared to existing state-of-the-art methods. Lastly, CellO’s linear models are easily interpreted, thereby enabling exploration of cell type-specific expression signatures across the ontology. To this end, we also present the CellO Viewer: a web application for exploring CellO’s models across the ontology.HighlightWe present CellO, a tool for hierarchically classifying cell type from single-cell RNA-seq data against the graph-structured Cell OntologyCellO is pre-trained on a comprehensive dataset comprising nearly all bulk RNA-seq primary cell samples in the Sequence Read ArchiveCellO achieves superior or comparable performance with existing methods while featuring a more comprehensive pre-packaged training setCellO is built with easily interpretable models which we expose through a novel web application, the CellO Viewer, for exploring cell type-specific signatures across the Cell OntologyGraphical Abstract


Sign in / Sign up

Export Citation Format

Share Document