scholarly journals Inference of cell-type specific imprinted regulatory elements and genes during human neuronal differentiation

2021 ◽  
Author(s):  
Dan Liang ◽  
Nil Aygün ◽  
Nana Matoba ◽  
Folami Ideraabdullah ◽  
Michael I Love ◽  
...  

Genomic imprinting results in gene expression biased by parental chromosome of origin and occurs in genes with important roles during human brain development. However, the cell-type and temporal specificity of imprinting during human neurogenesis is generally unknown. By detecting within-donor allelic biases in chromatin accessibility and gene expression that are unrelated to cross-donor genotype, we inferred imprinting in both primary human neural progenitor cells (phNPCs) and their differentiated neuronal progeny from up to 85 donors. We identified 43/20 putatively imprinted regulatory elements (IREs) in neurons/progenitors, and 133/79 putatively imprinted genes in neurons/progenitors. Though 10 IREs and 42 genes were shared between neurons and progenitors, most imprinting was only detected within specific cell types. In addition to well-known imprinted genes and their promoters, we inferred novel IREs and imprinted genes. We found IREs overlapped with CpG islands more than non-imprinted regulatory elements. Consistent with DNA methylation-based regulation of imprinted expression, some putatively imprinted regulatory elements also overlapped with differentially methylated regions on the maternal germline. Finally, we identified a progenitor-specific putatively imprinted gene overlap with copy number variation that is associated with uniparental disomy-like phenotypes. Our results can therefore be useful in interpreting the function of variants identified in future parent-of-origin association studies.

eLife ◽  
2019 ◽  
Vol 8 ◽  
Author(s):  
Sinisa Hrvatin ◽  
Christopher P Tzeng ◽  
M Aurel Nagy ◽  
Hume Stroud ◽  
Charalampia Koutsioumpa ◽  
...  

Enhancers are the primary DNA regulatory elements that confer cell type specificity of gene expression. Recent studies characterizing individual enhancers have revealed their potential to direct heterologous gene expression in a highly cell-type-specific manner. However, it has not yet been possible to systematically identify and test the function of enhancers for each of the many cell types in an organism. We have developed PESCA, a scalable and generalizable method that leverages ATAC- and single-cell RNA-sequencing protocols, to characterize cell-type-specific enhancers that should enable genetic access and perturbation of gene function across mammalian cell types. Focusing on the highly heterogeneous mammalian cerebral cortex, we apply PESCA to find enhancers and generate viral reagents capable of accessing and manipulating a subset of somatostatin-expressing cortical interneurons with high specificity. This study demonstrates the utility of this platform for developing new cell-type-specific viral reagents, with significant implications for both basic and translational research.


2020 ◽  
Vol 29 (11) ◽  
pp. 1922-1932
Author(s):  
Priyanka Nandakumar ◽  
Dongwon Lee ◽  
Thomas J Hoffmann ◽  
Georg B Ehret ◽  
Dan Arking ◽  
...  

Abstract Hundreds of loci have been associated with blood pressure (BP) traits from many genome-wide association studies. We identified an enrichment of these loci in aorta and tibial artery expression quantitative trait loci in our previous work in ~100 000 Genetic Epidemiology Research on Aging study participants. In the present study, we sought to fine-map known loci and identify novel genes by determining putative regulatory regions for these and other tissues relevant to BP. We constructed maps of putative cis-regulatory elements (CREs) using publicly available open chromatin data for the heart, aorta and tibial arteries, and multiple kidney cell types. Variants within these regions may be evaluated quantitatively for their tissue- or cell-type-specific regulatory impact using deltaSVM functional scores, as described in our previous work. We aggregate variants within these putative CREs within 50 Kb of the start or end of ‘expressed’ genes in these tissues or cell types using public expression data and use deltaSVM scores as weights in the group-wise sequence kernel association test to identify candidates. We test for association with both BP traits and expression within these tissues or cell types of interest and identify the candidates MTHFR, C10orf32, CSK, NOV, ULK4, SDCCAG8, SCAMP5, RPP25, HDGFRP3, VPS37B and PPCDC. Additionally, we examined two known QT interval genes, SCN5A and NOS1AP, in the Atherosclerosis Risk in Communities Study, as a positive control, and observed the expected heart-specific effect. Thus, our method identifies variants and genes for further functional testing using tissue- or cell-type-specific putative regulatory information.


2019 ◽  
Author(s):  
Priyanka Nandakumar ◽  
Dongwon Lee ◽  
Thomas J. Hoffmann ◽  
Georg B. Ehret ◽  
Dan Arking ◽  
...  

AbstractHundreds of loci have been associated with blood pressure traits from many genome-wide association studies. We identified an enrichment of these loci in aorta and tibial artery expression quantitative trait loci in our previous work in ∼100,000 Genetic Epidemiology Research on Aging (GERA) study participants. In the present study, we subsequently focused on determining putative regulatory regions for these and other tissues of relevance to blood pressure, to both fine-map these loci by pinpointing genes and variants of functional interest within them, and to identify any novel genes.We constructed maps of putative cis-regulatory elements using publicly available open chromatin data for the heart, aorta and tibial arteries, and multiple kidney cell types. Sequence variants within these regions may be evaluated quantitatively for their tissue- or cell-type-specific regulatory impact using deltaSVM functional scores, as described in our previous work. In order to identify genes of interest, we aggregate these variants in these putative cis-regulatory elements within 50Kb of the start or end of genes considered as “expressed” in these tissues or cell types using publicly available gene expression data, and use the deltaSVM scores as weights in the well-known group-wise sequence kernel association test (SKAT). We test for association with both blood pressure traits as well as expression within these tissues or cell types of interest, and identify several genes, including MTHFR, C10orf32, CSK, NOV, ULK4, SDCCAG8, SCAMP5, RPP25, HDGFRP3, VPS37B, and PPCDC. Although our study centers on blood pressure traits, we additionally examined two known genes, SCN5A and NOS1AP involved in the cardiac trait QT interval, in the Atherosclerosis Risk in Communities Study (ARIC), as a positive control, and observed an expected heart-specific effect. Thus, our method may be used to identify variants and genes for further functional testing using tissue- or cell-type-specific putative regulatory information.Author SummarySequence change in genes (“variants”) are linked to the presence and severity of different traits or diseases. However, as genes may be expressed in different tissues and at different times and degrees, using this information is expected to more accurately identify genes of interest. Variants within the genes are essential, but also in the sequences (“regulatory elements”) that control the genes’ expression in different tissues or cell types. In this study, we aim to use this information about expression and variants potentially involved in gene expression regulation to better pinpoint genes and variants in regulatory elements of interest for blood pressure regulation. We do so by taking advantage of such data that are publicly available, and use methods to combine information about variants in aggregate within a gene’s putative regulatory elements in tissues thought to be relevant for blood pressure, and identify several genes, meant to enable experimental follow-up.


2021 ◽  
Author(s):  
Jinyue Liao ◽  
Hoi Ching Suen ◽  
Shitao Rao ◽  
Alfred Chun Shui Luk ◽  
Ruoyu Zhang ◽  
...  

AbstractSpermatogenesis depends on an orchestrated series of developing events in germ cells and full maturation of the somatic microenvironment. To date, the majority of efforts to study cellular heterogeneity in testis has been focused on single-cell gene expression rather than the chromatin landscape shaping gene expression. To advance our understanding of the regulatory programs underlying testicular cell types, we analyzed single-cell chromatin accessibility profiles in more than 25,000 cells from mouse developing testis. We showed that scATAC-Seq allowed us to deconvolve distinct cell populations and identify cis-regulatory elements (CREs) underlying cell type specification. We identified sets of transcription factors associated with cell type-specific accessibility, revealing novel regulators of cell fate specification and maintenance. Pseudotime reconstruction revealed detailed regulatory dynamics coordinating the sequential developmental progressions of germ cells and somatic cells. This high-resolution data also revealed putative stem cells within the Sertoli and Leydig cell populations. Further, we defined candidate target cell types and genes of several GWAS signals, including those associated with testosterone levels and coronary artery disease. Collectively, our data provide a blueprint of the ‘regulon’ of the mouse male germline and supporting somatic cells.


2021 ◽  
Author(s):  
Julien Bryois ◽  
Daniela Calini ◽  
Will Macnair ◽  
Lynette Foo ◽  
Eduard Urich ◽  
...  

Most expression quantitative trait loci (eQTL) studies to date have been performed in heterogeneous brain tissues as opposed to specific cell types. To investigate the genetics of gene expression in adult human cell types from the central nervous system (CNS), we performed an eQTL analysis using single nuclei RNA-seq from 196 individuals in eight CNS cell types. We identified 6108 eGenes, a substantial fraction (43%, 2620 out of 6108) of which show cell-type specific effects, with strongest effects in microglia. Integration of CNS cell-type eQTLs with GWAS revealed novel relationships between expression and disease risk for neuropsychiatric and neurodegenerative diseases. For most GWAS loci, a single gene colocalized in a single cell type providing new clues into disease etiology. Our findings demonstrate substantial contrast in genetic regulation of gene expression among CNS cell types and reveal genetic mechanisms by which disease risk genes influence neurological disorders.


2020 ◽  
Author(s):  
Yating Liu ◽  
Anthony D. Fischer ◽  
Celine L. St. Pierre ◽  
Juan F. Macias-Velasco ◽  
Heather A. Lawson ◽  
...  

AbstractThe alteration of gene expression due to variations in the sequences of transcriptional regulatory elements has been a focus of substantial inquiry in humans and model organisms. However, less is known about the extent to which natural variation contributes to post-transcriptional regulation. Allelic Expression Imbalance (AEI) is a classical approach for studying the association of specific haplotypes with relative changes in transcript abundance. Here, we piloted a new TRAP based approach to associate genetic variation with transcript occupancy on ribosomes in specific cell types, to determine if it will allow examination of Allelic Translation Imbalance (ATI), and Allelic Translation Efficiency Imbalance, using as a test case mouse astrocytes in vivo. We show that most changes of the mRNA levels on ribosomes were reflected in transcript abundance, though ∼1.5% of transcripts have variants that clearly alter loading onto ribosomes orthogonally to transcript levels. These variants were often in conserved residues and altered sequences known to regulate translation such as upstream ORFs, PolyA sites, and predicted miRNA binding sites. Such variants were also common in transcripts showing altered abundance, suggesting some genetic regulation of gene expression may function through post-transcriptional mechanisms. Overall, our work shows that naturally occurring genetic variants can impact ribosome occupancy in astrocytes in vivo and suggests that mechanisms may also play a role in genetic contributions to disease.


Author(s):  
Zhen Miao ◽  
Michael S. Balzer ◽  
Ziyuan Ma ◽  
Hongbo Liu ◽  
Junnan Wu ◽  
...  

AbstractDetermining the epigenetic program that generates unique cell types in the kidney is critical for understanding cell-type heterogeneity during tissue homeostasis and injury response.Here, we profiled open chromatin and gene expression in developing and adult mouse kidneys at single cell resolution. We show critical reliance of gene expression on distal regulatory elements (enhancers). We define key cell type-specific transcription factors and major gene-regulatory circuits for kidney cells. Dynamic chromatin and expression changes during nephron progenitor differentiation demonstrated that podocyte commitment occurs early and is associated with sustained Foxl1 expression. Renal tubule cells followed a more complex differentiation, where Hfn4a was associated with proximal and Tfap2b with distal fate. Mapping single nucleotide variants associated with human kidney disease identified critical cell types, developmental stages, genes, and regulatory mechanisms.We provide a global single cell resolution view of chromatin accessibility of kidney development. The dataset is available via interactive public websites.


Science ◽  
2020 ◽  
Vol 370 (6518) ◽  
pp. eaba7612 ◽  
Author(s):  
Silvia Domcke ◽  
Andrew J. Hill ◽  
Riza M. Daza ◽  
Junyue Cao ◽  
Diana R. O’Day ◽  
...  

The chromatin landscape underlying the specification of human cell types is of fundamental interest. We generated human cell atlases of chromatin accessibility and gene expression in fetal tissues. For chromatin accessibility, we devised a three-level combinatorial indexing assay and applied it to 53 samples representing 15 organs, profiling ~800,000 single cells. We leveraged cell types defined by gene expression to annotate these data and cataloged hundreds of thousands of candidate regulatory elements that exhibit cell type–specific chromatin accessibility. We investigated the properties of lineage-specific transcription factors (such as POU2F1 in neurons), organ-specific specializations of broadly distributed cell types (such as blood and endothelial), and cell type–specific enrichments of complex trait heritability. These data represent a rich resource for the exploration of in vivo human gene regulation in diverse tissues and cell types.


2021 ◽  
Vol 22 (1) ◽  
Author(s):  
Kai Kang ◽  
Caizhi Huang ◽  
Yuanyuan Li ◽  
David M. Umbach ◽  
Leping Li

Abstract Background Biological tissues consist of heterogenous populations of cells. Because gene expression patterns from bulk tissue samples reflect the contributions from all cells in the tissue, understanding the contribution of individual cell types to the overall gene expression in the tissue is fundamentally important. We recently developed a computational method, CDSeq, that can simultaneously estimate both sample-specific cell-type proportions and cell-type-specific gene expression profiles using only bulk RNA-Seq counts from multiple samples. Here we present an R implementation of CDSeq (CDSeqR) with significant performance improvement over the original implementation in MATLAB and an added new function to aid cell type annotation. The R package would be of interest for the broader R community. Result We developed a novel strategy to substantially improve computational efficiency in both speed and memory usage. In addition, we designed and implemented a new function for annotating the CDSeq estimated cell types using single-cell RNA sequencing (scRNA-seq) data. This function allows users to readily interpret and visualize the CDSeq estimated cell types. In addition, this new function further allows the users to annotate CDSeq-estimated cell types using marker genes. We carried out additional validations of the CDSeqR software using synthetic, real cell mixtures, and real bulk RNA-seq data from the Cancer Genome Atlas (TCGA) and the Genotype-Tissue Expression (GTEx) project. Conclusions The existing bulk RNA-seq repositories, such as TCGA and GTEx, provide enormous resources for better understanding changes in transcriptomics and human diseases. They are also potentially useful for studying cell–cell interactions in the tissue microenvironment. Bulk level analyses neglect tissue heterogeneity, however, and hinder investigation of a cell-type-specific expression. The CDSeqR package may aid in silico dissection of bulk expression data, enabling researchers to recover cell-type-specific information.


2019 ◽  
Author(s):  
Alexander J. Cammack ◽  
Arnav Moudgil ◽  
Tomas Lagunas ◽  
Michael J. Vasek ◽  
Mark Shabsovich ◽  
...  

AbstractTranscription factors (TFs) play a central role in the regulation of gene expression, controlling everything from cell fate decisions to activity dependent gene expression. However, widely-used methods for TF profiling in vivo (e.g. ChIP-seq) yield only an aggregated picture of TF binding across all cell types present within the harvested tissue; thus, it is challenging or impossible to determine how the same TF might bind different portions of the genome in different cell types, or even to identify its binding events at all in rare cell types in a complex tissue such as the brain. Here we present a versatile methodology, FLEX Calling Cards, for the mapping of TF occupancy in specific cell types from heterogenous tissues. In this method, the TF of interest is fused to a hyperactive piggyBac transposase (hypPB), and this bipartite gene is delivered, along with donor transposons, to mouse tissue via a Cre-dependent adeno-associated virus (AAV). The fusion protein is expressed in Cre-expressing cells where it inserts transposon “Calling Cards” near to TF binding sites. These transposons permanently mark TF binding events and can be mapped using high-throughput sequencing. Alternatively, unfused hypPB interacts with and records the binding of the super enhancer (SE)-associated bromodomain protein, Brd4. To demonstrate the FLEX Calling Card method, we first show that donor transposon and transposase constructs can be efficiently delivered to the postnatal day 1 (P1) mouse brain with AAV and that insertion profiles report TF occupancy. Then, using a Cre-dependent hypPB virus, we show utility of this tool in defining cell type-specific TF profiles in multiple cell types of the brain. This approach will enable important cell type-specific studies of TF-mediated gene regulation in the brain and will provide valuable insights into brain development, homeostasis, and disease.


Sign in / Sign up

Export Citation Format

Share Document