scholarly journals H3K9me3-heterochromatin loss at protein-coding genes enables developmental lineage specification

Science ◽  
2019 ◽  
Vol 363 (6424) ◽  
pp. 294-297 ◽  
Author(s):  
Dario Nicetto ◽  
Greg Donahue ◽  
Tanya Jain ◽  
Tao Peng ◽  
Simone Sidoli ◽  
...  

Gene silencing by chromatin compaction is integral to establishing and maintaining cell fates. Trimethylated histone 3 lysine 9 (H3K9me3)–marked heterochromatin is reduced in embryonic stem cells compared to differentiated cells. However, the establishment and dynamics of closed regions of chromatin at protein-coding genes, in embryologic development, remain elusive. We developed an antibody-independent method to isolate and map compacted heterochromatin from low–cell number samples. We discovered high levels of compacted heterochromatin, H3K9me3-decorated, at protein-coding genes in early, uncommitted cells at the germ-layer stage, undergoing profound rearrangements and reduction upon differentiation, concomitant with cell type–specific gene expression. Perturbation of the three H3K9me3-related methyltransferases revealed a pivotal role for H3K9me3 heterochromatin during lineage commitment at the onset of organogenesis and for lineage fidelity maintenance.

2019 ◽  
Author(s):  
Wei Fang ◽  
Yi Wen ◽  
Xiangyun Wei

AbstractTissue-specific or cell type-specific transcription of protein-coding genes is controlled by both trans-regulatory elements (TREs) and cis-regulatory elements (CREs). However, it is challenging to identify TREs and CREs, which are unknown for most genes. Here, we describe a protocol for identifying two types of transcription-activating CREs—core promoters and enhancers—of zebrafish photoreceptor type-specific genes. This protocol is composed of three phases: bioinformatic prediction, experimental validation, and characterization of the CREs. To better illustrate the principles and logic of this protocol, we exemplify it with the discovery of the core promoter and enhancer of the mpp5b apical polarity gene (also known as ponli), whose red, green, and blue (RGB) cone-specific transcription requires its enhancer, a member of the rainbow enhancer family. While exemplified with an RGB cone-specific gene, this protocol is general and can be used to identify the core promoters and enhancers of other protein-coding genes.


2021 ◽  
Vol 12 (1) ◽  
Author(s):  
Chet H. Loh ◽  
Siebe van Genesen ◽  
Matteo Perino ◽  
Magnus R. Bark ◽  
Gert Jan C. Veenstra

AbstractPolycomb Repressive Complex 2 (PRC2) is crucial for the coordinated expression of genes during early embryonic development, catalyzing histone H3 lysine 27 trimethylation. Two distinct PRC2 complexes, PRC2.1 and PRC2.2, contain respectively MTF2 and JARID2 in embryonic stem cells (ESCs). In this study, we explored their roles in lineage specification and commitment, using single-cell transcriptomics and mouse embryoid bodies derived from Mtf2 and Jarid2 null ESCs. We observe that the loss of Mtf2 results in enhanced and faster differentiation towards cell fates from all germ layers, while the Jarid2 null cells are predominantly directed towards early differentiating precursors, with reduced efficiency towards mesendodermal lineages. These effects are caused by derepression of developmental regulators that are poised for activation in pluripotent cells and gain H3K4me3 at their promoters in the absence of PRC2 repression. Upon lineage commitment, the differentiation trajectories are relatively similar to those of wild-type cells. Together, our results uncover a major role for MTF2-containing PRC2.1 in balancing poised lineage-specific gene activation, whereas the contribution of JARID2-containing PRC2 is more selective in nature compared to MTF2. These data explain how PRC2 imposes thresholds for lineage choice during the exit of pluripotency.


2019 ◽  
Vol 10 (1) ◽  
Author(s):  
Natalie M. Clark ◽  
Eli Buckner ◽  
Adam P. Fisher ◽  
Emily C. Nelson ◽  
Thomas T. Nguyen ◽  
...  

AbstractStem cells are responsible for generating all of the differentiated cells, tissues, and organs in a multicellular organism and, thus, play a crucial role in cell renewal, regeneration, and organization. A number of stem cell type-specific genes have a known role in stem cell maintenance, identity, and/or division. Yet, how genes expressed across different stem cell types, referred to here as stem-cell-ubiquitous genes, contribute to stem cell regulation is less understood. Here, we find that, in the Arabidopsis root, a stem-cell-ubiquitous gene, TESMIN-LIKE CXC2 (TCX2), controls stem cell division by regulating stem cell-type specific networks. Development of a mathematical model of TCX2 expression allows us to show that TCX2 orchestrates the coordinated division of different stem cell types. Our results highlight that genes expressed across different stem cell types ensure cross-communication among cells, allowing them to divide and develop harmonically together.


2020 ◽  
Vol 7 (1) ◽  
Author(s):  
Qingzhen Wei ◽  
Jinglei Wang ◽  
Wuhong Wang ◽  
Tianhua Hu ◽  
Haijiao Hu ◽  
...  

Abstract Eggplant (Solanum melongena L.) is an economically important vegetable crop in the Solanaceae family, with extensive diversity among landraces and close relatives. Here, we report a high-quality reference genome for the eggplant inbred line HQ-1315 (S. melongena-HQ) using a combination of Illumina, Nanopore and 10X genomics sequencing technologies and Hi-C technology for genome assembly. The assembled genome has a total size of ~1.17 Gb and 12 chromosomes, with a contig N50 of 5.26 Mb, consisting of 36,582 protein-coding genes. Repetitive sequences comprise 70.09% (811.14 Mb) of the eggplant genome, most of which are long terminal repeat (LTR) retrotransposons (65.80%), followed by long interspersed nuclear elements (LINEs, 1.54%) and DNA transposons (0.85%). The S. melongena-HQ eggplant genome carries a total of 563 accession-specific gene families containing 1009 genes. In total, 73 expanded gene families (892 genes) and 34 contraction gene families (114 genes) were functionally annotated. Comparative analysis of different eggplant genomes identified three types of variations, including single-nucleotide polymorphisms (SNPs), insertions/deletions (indels) and structural variants (SVs). Asymmetric SV accumulation was found in potential regulatory regions of protein-coding genes among the different eggplant genomes. Furthermore, we performed QTL-seq for eggplant fruit length using the S. melongena-HQ reference genome and detected a QTL interval of 71.29–78.26 Mb on chromosome E03. The gene Smechr0301963, which belongs to the SUN gene family, is predicted to be a key candidate gene for eggplant fruit length regulation. Moreover, we anchored a total of 210 linkage markers associated with 71 traits to the eggplant chromosomes and finally obtained 26 QTL hotspots. The eggplant HQ-1315 genome assembly can be accessed at http://eggplant-hq.cn. In conclusion, the eggplant genome presented herein provides a global view of genomic divergence at the whole-genome level and powerful tools for the identification of candidate genes for important traits in eggplant.


2020 ◽  
Author(s):  
Markus J. Sommer ◽  
Steven L. Salzberg

AbstractLow-cost, high-throughput sequencing has led to an enormous increase in the number of sequenced microbial genomes, with well over 100,000 genomes in public archives today. Automatic genome annotation tools are integral to understanding these organisms, yet older gene finding methods must be retrained on each new genome. We have developed a universal model of prokaryotic genes by fitting a temporal convolutional network to amino-acid sequences from a large, diverse set of microbial genomes. We incorporated the new model into a gene finding system, Balrog (Bacterial Annotation by Learned Representation Of Genes), which does not require genome-specific training and which matches or outperforms other state-of-the-art gene finding tools. Balrog is freely available under the MIT license at https://github.com/salzberg-lab/Balrog.Author summaryAnnotating the protein-coding genes in a newly sequenced prokaryotic genome is a critical part of describing their biological function. Relative to eukaryotic genomes, prokaryotic genomes are small and structurally simple, with 90% of their DNA typically devoted to protein-coding genes. Current computational gene finding tools are therefore able to achieve close to 99% sensitivity to known genes using species-specific gene models.Though highly sensitive at finding known genes, all current prokaryotic gene finders also predict large numbers of additional genes, which are labelled as “hypothetical protein” in GenBank and other annotation databases. Many hypothetical gene predictions likely represent true protein-coding sequence, but it is not known how many of them represent false positives. Additionally, all current gene finding tools must be trained specifically for each genome as a preliminary step in order to achieve high sensitivity. This requirement limits their ability to detect genes in fragmented sequences commonly seen in metagenomic samples.We took a data-driven approach to prokaryotic gene finding, relying on the large and diverse collection of already-sequenced genomes. By training a single, universal model of bacterial genes on protein sequences from many different species, we were able to match the sensitivity of current gene finders while reducing the overall number of gene predictions. Our model does not need to be refit on any new genome. Balrog (Bacterial Annotation by Learned Representation of Genes) represents a fundamentally different yet effective method for prokaryotic gene finding.


2019 ◽  
Author(s):  
Chen Xu ◽  
Bo Cao ◽  
Ying-dong Huo ◽  
Gang Niu ◽  
Michael Q Zhang ◽  
...  

AbstractLipid rafts are packed nanoscopic domains on plasma membrane and essential signalling platforms for transducing extracellular stimuli into cellular responses. Although depletion of raft component glycoshpingolipids causes abnormality particularly in ectoderm layer formation, it remains unclear whether rafts play a role in lineage determination, a critical but less-known stage in lineage commitment. Here, inducing mouse embryonic stem cell (mESC) differentiation with retinoic acid (RA), we observed lipid rafts increased since early stage, especially in ectoderm-like cells. Stochastic optical reconstruction microscopy characterized at super-resolution the distinct raft features in mESCs and the derived differentiated cells. Furthermore, RA-induced commitment of ectoderm-like cells was significantly diminished not only by genetic ablation of rafts but by applying inhibitor for glycosphingolipids or cholesterol at early differentiation stages. Meanwhile, raft inhibition delayed RA-induced pluripotency exit, an early step required for differentiation. Therefore, lipid rafts increase and facilitate ectoderm lineage specification as well as pluripotency exit during mESC differentiation.


Oncogene ◽  
2020 ◽  
Vol 39 (43) ◽  
pp. 6633-6646 ◽  
Author(s):  
Ye Chen ◽  
Liang Xu ◽  
Ruby Yu-Tong Lin ◽  
Markus Müschen ◽  
H. Phillip Koeffler

Abstract Transcription factors (TFs) coordinate the on-and-off states of gene expression typically in a combinatorial fashion. Studies from embryonic stem cells and other cell types have revealed that a clique of self-regulated core TFs control cell identity and cell state. These core TFs form interconnected feed-forward transcriptional loops to establish and reinforce the cell-type-specific gene-expression program; the ensemble of core TFs and their regulatory loops constitutes core transcriptional regulatory circuitry (CRC). Here, we summarize recent progress in computational reconstitution and biologic exploration of CRCs across various human malignancies, and consolidate the strategy and methodology for CRC discovery. We also discuss the genetic basis and therapeutic vulnerability of CRC, and highlight new frontiers and future efforts for the study of CRC in cancer. Knowledge of CRC in cancer is fundamental to understanding cancer-specific transcriptional addiction, and should provide important insight to both pathobiology and therapeutics.


2013 ◽  
Vol 33 (9) ◽  
pp. 1845-1858 ◽  
Author(s):  
Da-Hai Yu ◽  
Carol Ware ◽  
Robert A. Waterland ◽  
Jiexin Zhang ◽  
Miao-Hsueh Chen ◽  
...  

During development, a small but significant number of CpG islands (CGIs) become methylated. The timing of developmentally programmed CGI methylation and associated mechanisms of transcriptional regulation during cellular differentiation, however, remain poorly characterized. Here, we used genome-wide DNA methylation microarrays to identify epigenetic changes during human embryonic stem cell (hESC) differentiation. We discovered a group of CGIs associated with developmental genes that gain methylation after hESCs differentiate. Conversely, erasure of methylation was observed at the identified CGIs during subsequent reprogramming to induced pluripotent stem cells (iPSCs), further supporting a functional role for the CGI methylation. Both global gene expression profiling and quantitative reverse transcription-PCR (RT-PCR) validation indicated opposing effects of CGI methylation in transcriptional regulation during differentiation, with promoter CGI methylation repressing and 3′ CGI methylation activating transcription. By studying diverse human tissues and mouse models, we further confirmed that developmentally programmed 3′ CGI methylation confers tissue- and cell-type-specific gene activationin vivo. Importantly, luciferase reporter assays provided evidence that 3′ CGI methylation regulates transcriptional activation via a CTCF-dependent enhancer-blocking mechanism. These findings expand the classic view of mammalian CGI methylation as a mechanism for transcriptional silencing and indicate a functional role for 3′ CGI methylation in developmental gene regulation.


2020 ◽  
Vol 10 (1) ◽  
Author(s):  
Yan Sun ◽  
Qichao Yu ◽  
Lei Li ◽  
Zhanlong Mei ◽  
Biaofeng Zhou ◽  
...  

Abstract Recent studies show that non-coding RNAs (ncRNAs) can regulate the expression of protein-coding genes and play important roles in mammalian development. Previous studies have revealed that during C. elegans (Caenorhabditis elegans) embryo development, numerous genes in each cell are spatiotemporally regulated, causing the cell to differentiate into distinct cell types and tissues. We ask whether ncRNAs participate in the spatiotemporal regulation of genes in different types of cells and tissues during the embryogenesis of C. elegans. Here, by using marker-free full-length high-depth single-cell RNA sequencing (scRNA-seq) technique, we sequence the whole transcriptomes from 1031 embryonic cells of C. elegans and detect 20,431 protein-coding genes, including 22 cell-type-specific protein-coding markers, and 9843 ncRNAs including 11 cell-type-specific ncRNA markers. We induce a ncRNAs-based clustering strategy as a complementary strategy to the protein-coding gene-based clustering strategy for single-cell classification. We identify 94 ncRNAs that have never been reported to regulate gene expressions, are co-expressed with 1208 protein-coding genes in cell type specific and/or embryo time specific manners. Our findings suggest that these ncRNAs could potentially influence the spatiotemporal expression of the corresponding genes during the embryogenesis of C. elegans.


Sign in / Sign up

Export Citation Format

Share Document