scholarly journals Epiphany: predicting Hi-C contact maps from 1D epigenomic signals

2021 ◽  
Author(s):  
Rui Yang ◽  
Arnav Das ◽  
Vianne R. Gao ◽  
Alireza Karbalayghareh ◽  
William S. Noble ◽  
...  

AbstractRecent deep learning models that predict the Hi-C contact map from DNA sequence achieve promising accuracy but cannot generalize to new cell types and indeed do not capture cell-type-specific differences among training cell types. We propose Epiphany, a neural network to predict cell-type-specific Hi-C contact maps from five epigenomic tracks that are already available in hundreds of cell types and tissues: DNase I hypersensitive sites and ChIP-seq for CTCF, H3K27ac, H3K27me3, and H3K4me3. Epiphany uses 1D convolutional layers to learn local representations from the input tracks, a bidirectional long short-term memory (Bi-LSTM) layers to capture long term dependencies along the epigenome, as well as a generative adversarial network (GAN) architecture to encourage contact map realism. To improve the usability of predicted contact matrices, we trained and evaluated models using multiple normalization and matrix balancing techniques including KR, ICE, and HiC-DC+ Z-score and observed-over-expected count ratio. Epiphany is trained with a combination of MSE and adversarial (i.a., a GAN) loss to enhance its ability to produce realistic Hi-C contact maps for downstream analysis. Epiphany shows robust performance and generalization to held-out chromosomes within and across cell types and species, and its predicted contact matrices yield accurate TAD and significant interaction calls. At inference time, Epiphany can be used to study the contribution of specific epigenomic peaks to 3D architecture and to predict the structural changes caused by perturbations of epigenomic signals.

2020 ◽  
Author(s):  
Michael C. Dimmick ◽  
Leo J. Lee ◽  
Brendan J. Frey

AbstractMotivationHi-C data has enabled the genome-wide study of chromatin folding and architecture, and has led to important discoveries in the structure and function of chromatin conformation. Here, high resolution data plays a particularly important role as many chromatin substructures such as Topologically Associating Domains (TADs) and chromatin loops cannot be adequately studied with low resolution contact maps. However, the high sequencing costs associated with the generation of high resolution Hi-C data has become an experimental barrier. Data driven machine learning models, which allow low resolution Hi-C data to be computationally enhanced, offer a promising avenue to address this challenge.ResultsBy carefully examining the properties of Hi-C maps and integrating various recent advances in deep learning, we developed a Hi-C Super-Resolution (HiCSR) framework capable of accurately recovering the fine details, textures, and substructures found in high resolution contact maps. This was achieved using a novel loss function tailored to the Hi-C enhancement problem which optimizes for an adversarial loss from a Generative Adversarial Network (GAN), a feature reconstruction loss derived from the latent representation of a denoising autoencoder, and a pixel-wise loss. Not only can the resulting framework generate enhanced Hi-C maps more visually similar to the original high resolution maps, it also excels on a suite of reproducibility metrics produced by members of the ENCODE Consortium compared to existing approaches, including HiCPlus, HiCNN, hicGAN and DeepHiC. Finally, we demonstrate that HiCSR is capable of enhancing Hi-C data across sequencing depth, cell types, and species, recovering biologically significant contact domain boundaries.AvailabilityWe make our implementation available for download at: https://github.com/PSI-Lab/[email protected] informationAvailable Online


2021 ◽  
Vol 22 (1) ◽  
Author(s):  
Luca Ducoli ◽  
Saumya Agrawal ◽  
Chung-Chau Hon ◽  
Jordan A. Ramilowski ◽  
Eliane Sibler ◽  
...  

Abstract Background The lymphatic and the blood vasculature are closely related systems that collaborate to ensure the organism’s physiological function. Despite their common developmental origin, they present distinct functional fates in adulthood that rely on robust lineage-specific regulatory programs. The recent technological boost in sequencing approaches unveiled long noncoding RNAs (lncRNAs) as prominent regulatory players of various gene expression levels in a cell-type-specific manner. Results To investigate the potential roles of lncRNAs in vascular biology, we performed antisense oligonucleotide (ASO) knockdowns of lncRNA candidates specifically expressed either in human lymphatic or blood vascular endothelial cells (LECs or BECs) followed by Cap Analysis of Gene Expression (CAGE-Seq). Here, we describe the quality control steps adopted in our analysis pipeline before determining the knockdown effects of three ASOs per lncRNA target on the LEC or BEC transcriptomes. In this regard, we especially observed that the choice of negative control ASOs can dramatically impact the conclusions drawn from the analysis depending on the cellular background. Conclusion In conclusion, the comparison of negative control ASO effects on the targeted cell type transcriptomes highlights the essential need to select a proper control set of multiple negative control ASO based on the investigated cell types.


Author(s):  
Emma Puighermanal ◽  
Emmanuel Valjent

Addictive drugs trigger persistent synaptic and structural changes in the neuronal reward circuits that are thought to underlie the development of drug-adaptive behavior. While transcriptional and epigenetic modifications are known to contribute to these circuit changes, accumulating evidence indicates that altered mRNA translation is also a key molecular mechanism. This chapter reviews recent studies demonstrating how addictive drugs alter protein synthesis and/or the translational machinery and how this leads to neuronal circuit remodeling and behavioral changes. Future work will determine precisely which neuronal circuits and cell types participate in these changes. The chapter summarizes current methodologies for identifying cell type-specific mRNAs whose translation is affected by drugs of abuse and gives recent examples of the mechanistic insights into addiction they provide.


2021 ◽  
Vol 12 (1) ◽  
Author(s):  
Houri Hintiryan ◽  
Ian Bowman ◽  
David L. Johnson ◽  
Laura Korobkova ◽  
Muye Zhu ◽  
...  

AbstractThe basolateral amygdalar complex (BLA) is implicated in behaviors ranging from fear acquisition to addiction. Optogenetic methods have enabled the association of circuit-specific functions to uniquely connected BLA cell types. Thus, a systematic and detailed connectivity profile of BLA projection neurons to inform granular, cell type-specific interrogations is warranted. Here, we apply machine-learning based computational and informatics analysis techniques to the results of circuit-tracing experiments to create a foundational, comprehensive BLA connectivity map. The analyses identify three distinct domains within the anterior BLA (BLAa) that house target-specific projection neurons with distinguishable morphological features. We identify brain-wide targets of projection neurons in the three BLAa domains, as well as in the posterior BLA, ventral BLA, posterior basomedial, and lateral amygdalar nuclei. Inputs to each nucleus also are identified via retrograde tracing. The data suggests that connectionally unique, domain-specific BLAa neurons are associated with distinct behavior networks.


Author(s):  
Hee-Dae Kim ◽  
Jing Wei ◽  
Tanessa Call ◽  
Nicole Teru Quintus ◽  
Alexander J. Summers ◽  
...  

AbstractDepression is the leading cause of disability and produces enormous health and economic burdens. Current treatment approaches for depression are largely ineffective and leave more than 50% of patients symptomatic, mainly because of non-selective and broad action of antidepressants. Thus, there is an urgent need to design and develop novel therapeutics to treat depression. Given the heterogeneity and complexity of the brain, identification of molecular mechanisms within specific cell-types responsible for producing depression-like behaviors will advance development of therapies. In the reward circuitry, the nucleus accumbens (NAc) is a key brain region of depression pathophysiology, possibly based on differential activity of D1- or D2- medium spiny neurons (MSNs). Here we report a circuit- and cell-type specific molecular target for depression, Shisa6, recently defined as an AMPAR component, which is increased only in D1-MSNs in the NAc of susceptible mice. Using the Ribotag approach, we dissected the transcriptional profile of D1- and D2-MSNs by RNA sequencing following a mouse model of depression, chronic social defeat stress (CSDS). Bioinformatic analyses identified cell-type specific genes that may contribute to the pathogenesis of depression, including Shisa6. We found selective optogenetic activation of the ventral tegmental area (VTA) to NAc circuit increases Shisa6 expression in D1-MSNs. Shisa6 is specifically located in excitatory synapses of D1-MSNs and increases excitability of neurons, which promotes anxiety- and depression-like behaviors in mice. Cell-type and circuit-specific action of Shisa6, which directly modulates excitatory synapses that convey aversive information, identifies the protein as a potential rapid-antidepressant target for aberrant circuit function in depression.


2021 ◽  
Vol 12 (1) ◽  
Author(s):  
Jiao Li ◽  
Jakob Seidlitz ◽  
John Suckling ◽  
Feiyang Fan ◽  
Gong-Jun Ji ◽  
...  

AbstractMajor depressive disorder (MDD) has been shown to be associated with structural abnormalities in a variety of spatially diverse brain regions. However, the correlation between brain structural changes in MDD and gene expression is unclear. Here, we examine the link between brain-wide gene expression and morphometric changes in individuals with MDD, using neuroimaging data from two independent cohorts and a publicly available transcriptomic dataset. Morphometric similarity network (MSN) analysis shows replicable cortical structural differences in individuals with MDD compared to control subjects. Using human brain gene expression data, we observe that the expression of MDD-associated genes spatially correlates with MSN differences. Analysis of cell type-specific signature genes suggests that microglia and neuronal specific transcriptional changes account for most of the observed correlation with MDD-specific MSN differences. Collectively, our findings link molecular and structural changes relevant for MDD.


2021 ◽  
Vol 11 (1) ◽  
Author(s):  
John A. Halsall ◽  
Simon Andrews ◽  
Felix Krueger ◽  
Charlotte E. Rutledge ◽  
Gabriella Ficz ◽  
...  

AbstractChromatin configuration influences gene expression in eukaryotes at multiple levels, from individual nucleosomes to chromatin domains several Mb long. Post-translational modifications (PTM) of core histones seem to be involved in chromatin structural transitions, but how remains unclear. To explore this, we used ChIP-seq and two cell types, HeLa and lymphoblastoid (LCL), to define how changes in chromatin packaging through the cell cycle influence the distributions of three transcription-associated histone modifications, H3K9ac, H3K4me3 and H3K27me3. We show that chromosome regions (bands) of 10–50 Mb, detectable by immunofluorescence microscopy of metaphase (M) chromosomes, are also present in G1 and G2. They comprise 1–5 Mb sub-bands that differ between HeLa and LCL but remain consistent through the cell cycle. The same sub-bands are defined by H3K9ac and H3K4me3, while H3K27me3 spreads more widely. We found little change between cell cycle phases, whether compared by 5 Kb rolling windows or when analysis was restricted to functional elements such as transcription start sites and topologically associating domains. Only a small number of genes showed cell-cycle related changes: at genes encoding proteins involved in mitosis, H3K9 became highly acetylated in G2M, possibly because of ongoing transcription. In conclusion, modified histone isoforms H3K9ac, H3K4me3 and H3K27me3 exhibit a characteristic genomic distribution at resolutions of 1 Mb and below that differs between HeLa and lymphoblastoid cells but remains remarkably consistent through the cell cycle. We suggest that this cell-type-specific chromosomal bar-code is part of a homeostatic mechanism by which cells retain their characteristic gene expression patterns, and hence their identity, through multiple mitoses.


2021 ◽  
Vol 19 (1) ◽  
Author(s):  
Jinting Guan ◽  
Yiping Lin ◽  
Yang Wang ◽  
Junchao Gao ◽  
Guoli Ji

Abstract Background Genome-wide association studies have identified genetic variants associated with the risk of brain-related diseases, such as neurological and psychiatric disorders, while the causal variants and the specific vulnerable cell types are often needed to be studied. Many disease-associated genes are expressed in multiple cell types of human brains, while the pathologic variants affect primarily specific cell types. We hypothesize a model in which what determines the manifestation of a disease in a cell type is the presence of disease module comprised of disease-associated genes, instead of individual genes. Therefore, it is essential to identify the presence/absence of disease gene modules in cells. Methods To characterize the cell type-specificity of brain-related diseases, we construct human brain cell type-specific gene interaction networks integrating human brain nucleus gene expression data with a referenced tissue-specific gene interaction network. Then from the cell type-specific gene interaction networks, we identify significant cell type-specific disease gene modules by performing statistical tests. Results Between neurons and glia cells, the constructed cell type-specific gene networks and their gene functions are distinct. Then we identify cell type-specific disease gene modules associated with autism spectrum disorder and find that different gene modules are formed and distinct gene functions may be dysregulated in different cells. We also study the similarity and dissimilarity in cell type-specific disease gene modules among autism spectrum disorder, schizophrenia and bipolar disorder. The functions of neurons-specific disease gene modules are associated with synapse for all three diseases, while those in glia cells are different. To facilitate the use of our method, we develop an R package, CtsDGM, for the identification of cell type-specific disease gene modules. Conclusions The results support our hypothesis that a disease manifests itself in a cell type through forming a statistically significant disease gene module. The identification of cell type-specific disease gene modules can promote the development of more targeted biomarkers and treatments for the disease. Our method can be applied for depicting the cell type heterogeneity of a given disease, and also for studying the similarity and dissimilarity between different disorders, providing new insights into the molecular mechanisms underlying the pathogenesis and progression of diseases.


1989 ◽  
Vol 92 (2) ◽  
pp. 231-239
Author(s):  
P.I. Francz ◽  
K. Bayreuther ◽  
H.P. Rodemann

Methods for the selective enrichment of various subpopulations of the human skin fibroblast cell line HH-8 have been developed. These methods permit the selection of homogeneous populations of the three mitotic fibroblast cell types MF I, II and III, and the four postmitotic cell types PMF IV, V, VI and VII. These seven cell types exhibit differentiation-dependent and cell-type-specific patterns of [35S]methionine-labelled polypeptides in total soluble cytoplasmic and nuclear proteins, also in membrane-bound proteins, and in secreted proteins. In the differentiation sequence MF II-MF III-PMF IV - PMF V - PMF VI 14 cell-type-specific marker proteins have been found in the cytoplasmic and nuclear fraction, also 24 cell-type-specific marker proteins have been found in the membrane-bound protein fraction, and 11 cell-type-specific marker proteins in the secreted protein fraction. Markers in spontaneously arising and experimentally selected or induced populations of a single fibroblast cell type were found to be identical.


eLife ◽  
2019 ◽  
Vol 8 ◽  
Author(s):  
Sinisa Hrvatin ◽  
Christopher P Tzeng ◽  
M Aurel Nagy ◽  
Hume Stroud ◽  
Charalampia Koutsioumpa ◽  
...  

Enhancers are the primary DNA regulatory elements that confer cell type specificity of gene expression. Recent studies characterizing individual enhancers have revealed their potential to direct heterologous gene expression in a highly cell-type-specific manner. However, it has not yet been possible to systematically identify and test the function of enhancers for each of the many cell types in an organism. We have developed PESCA, a scalable and generalizable method that leverages ATAC- and single-cell RNA-sequencing protocols, to characterize cell-type-specific enhancers that should enable genetic access and perturbation of gene function across mammalian cell types. Focusing on the highly heterogeneous mammalian cerebral cortex, we apply PESCA to find enhancers and generate viral reagents capable of accessing and manipulating a subset of somatostatin-expressing cortical interneurons with high specificity. This study demonstrates the utility of this platform for developing new cell-type-specific viral reagents, with significant implications for both basic and translational research.


Sign in / Sign up

Export Citation Format

Share Document