scholarly journals Origin and evolution of developmental enhancers in the mammalian neocortex

2016 ◽  
Vol 113 (19) ◽  
pp. E2617-E2626 ◽  
Author(s):  
Deena Emera ◽  
Jun Yin ◽  
Steven K. Reilly ◽  
Jake Gockley ◽  
James P. Noonan

Morphological innovations such as the mammalian neocortex may involve the evolution of novel regulatory sequences. However, de novo birth of regulatory elements active during morphogenesis has not been extensively studied in mammals. Here, we use H3K27ac-defined regulatory elements active during human and mouse corticogenesis to identify enhancers that were likely active in the ancient mammalian forebrain. We infer the phylogenetic origins of these enhancers and find that ∼20% arose in the mammalian stem lineage, coincident with the emergence of the neocortex. Implementing a permutation strategy that controls for the nonrandom variation in the ages of background genomic sequences, we find that mammal-specific enhancers are overrepresented near genes involved in cell migration, cell signaling, and axon guidance. Mammal-specific enhancers are also overrepresented in modules of coexpressed genes in the cortex that are associated with these pathways, notably ephrin and semaphorin signaling. Our results also provide insight into the mechanisms of regulatory innovation in mammals. We find that most neocortical enhancers did not originate by en bloc exaptation of transposons. Young neocortical enhancers exhibit smaller H3K27ac footprints and weaker evolutionary constraint in eutherian mammals than older neocortical enhancers. Based on these observations, we present a model of the enhancer life cycle in which neocortical enhancers initially emerge from genomic background as short, weakly constrained “proto-enhancers.” Many proto-enhancers are likely lost, but some may serve as nucleation points for complex enhancers to evolve.

2010 ◽  
Vol 08 (02) ◽  
pp. 219-246 ◽  
Author(s):  
ARVIND RAO ◽  
DAVID J. STATES ◽  
ALFRED O. HERO ◽  
JAMES DOUGLAS ENGEL

Gene regulation in eukaryotes involves a complex interplay between the proximal promoter and distal genomic elements (such as enhancers) which work in concert to drive precise spatio-temporal gene expression. The experimental localization and characterization of gene regulatory elements is a very complex and resource-intensive process. The computational identification of regulatory regions that confer spatiotemporally specific tissue-restricted expression of a gene is thus an important challenge for computational biology. One of the most popular strategies for enhancer localization from DNA sequence is the use of conservation-based prefiltering and more recently, the use of canonical (transcription factor motifs) or de novo tissue-specific sequence motifs. However, there is an ongoing effort in the computational biology community to further improve the fidelity of enhancer predictions from sequence data by integrating other, complementary genomic modalities. In this work, we propose a framework that complements existing methodologies for prospective enhancer identification. The methods in this work are derived from two key insights: (i) that chromatin modification signatures can discriminate proximal and distally located regulatory regions and (ii) the notion of promoter-enhancer cross-talk (as assayed in 3C/5C experiments) might have implications in the search for regulatory sequences that co-operate with the promoter to yield tissue-restricted, gene-specific expression.


2018 ◽  
Author(s):  
Niklas Bruse ◽  
Simon J. van Heeringen

AbstractBackgroundTranscription factors (TFs) bind to specific DNA sequences, TF motifs, in cis-regulatory sequences and control the expression of the diverse transcriptional programs encoded in the genome. The concerted action of TFs within the chromatin context enables precise temporal and spatial expression patterns. To understand how TFs control gene expression it is essential to model TF binding. TF motif information can help to interpret the exact role of individual regulatory elements, for instance to predict the functional impact of non-coding variants.FindingsHere we present GimmeMotifs, a comprehensive computational framework for TF motif analysis. Compared to the previously published version, this release adds a whole range of new functionality and analysis methods. It now includes tools for de novo motif discovery, motif scanning and sequence analysis, motif clustering, calculation of performance metrics and visualization. Included with GimmeMotifs is a non-redundant database of clustered motifs. Compared to other motif databases, this collection of motifs shows competitive performance in discriminating bound from unbound sequences. Using our de novo motif discovery pipeline we find large differences in performance between de novo motif finders on ChIP-seq data. Using an ensemble method such as implemented in GimmeMotifs will generally result in improved motif identification compared to a single motif finder. Finally, we demonstrate maelstrom, a new ensemble method that enables comparative analysis of TF motifs between multiple high-throughput sequencing experiments, such as ChIP-seq or ATAC-seq. Using a collection of ~200 H3K27ac ChIP-seq data sets we identify TFs that play a role in hematopoietic differentiation and lineage commitment.ConclusionGimmeMotifs is a fully-featured and flexible framework for TF motif analysis. It contains both command-line tools as well as a Python API and is freely available at: https://github.com/vanheeringen-lab/gimmemotifs.


2018 ◽  
Author(s):  
Leslie A. Mitchell ◽  
Laura H. McCulloch ◽  
Sudarshan Pinglay ◽  
Henri Berger ◽  
Nazario Bosco ◽  
...  

AbstractDesign and large-scale synthesis of DNA has been applied to the functional study of viral and microbial genomes. New and expanded technology development is required to unlock the transformative potential of such bottom-up approaches to the study of larger mammalian genomes. Two major challenges include assembling and delivering long DNA sequences. Here we describe a pipeline for de novo DNA assembly and delivery that enables functional evaluation of mammalian genes on the length scale of 100 kb. The DNA assembly step is supported by an integrated robotic workcell. We assembled the 101 kb human HPRT1 gene in yeast, delivered it to mouse embryonic stem cells, and showed expression of the human protein from its full-length gene. This pipeline provides a framework for producing systematic, designer variants of any mammalian gene locus for functional evaluation in cells.Significance StatementMammalian genomes consist of a tiny proportion of relatively well-characterized coding regions and vast swaths of poorly characterized “dark matter” containing critical but much less well-defined regulatory sequences. Given the dominant role of noncoding DNA in common human diseases and traits, the interconnectivity of regulatory elements, and the importance of genomic context, de novo design, assembly, and delivery can enable large-scale manipulation of these elements on a locus scale. Here we outline a pipeline for de novo assembly, delivery and expression of mammalian genes replete with native regulatory sequences. We expect this pipeline will be useful for dissecting the function of non-coding sequence variation in mammalian genomes.


2014 ◽  
Author(s):  
Olgert Denas ◽  
Richard Sandstrom ◽  
Yong Cheng ◽  
Kathryn Beal ◽  
Javier Herrero ◽  
...  

Background: Because species-specific gene expression is driven by species-specific regulation, understanding the relationship between sequence and function of the regulatory regions in different species will help elucidate how differences among species arise. Despite active experimental and computational research, the relationships among sequence, conservation, and function are still poorly understood. Results: We compared transcription factor occupied segments (TFos) for 116 human and 35 mouse TFs in 546 human and 125 mouse cell types and tissues from the Human and the Mouse ENCODE projects. We based the map between human and mouse TFos on a one-to-one nucleotide cross-species mapper, bnMapper, that utilizes whole genome alignments (WGA). Our analysis shows that TFos are under evolutionary constraint, but a substantial portion (25.1% of mouse and 25.85% of human on average) of the TFos does not have a homologous sequence on the other species; this portion varies among cell types and TFs. Furthermore, 47.67% and 57.01% of the homologous TFos sequence shows binding activity on the other species for human and mouse respectively. However, 79.87% and 69.22% is repurposed such that it binds the same TF in different cells or different TFs in the same cells. Remarkably, within the set of TFos not showing conservation of occupancy, the corresponding genome regions in the other species are preferred locations of novel TFos. These events suggest that a substantial amount of functional regulatory sequences is exapted from other biochemically active genomic material. Despite substantial repurposing of TFos, we did not find substantial changes in their predicted target genes, suggesting that CRMs buffer evolutionary events allowing little or no change in the TF – target gene associations. Thus, the small portion of TFos with strictly conserved occupancy underestimates the degree of conservation of regulatory interactions. Conclusion: We mapped regulatory sequences from an extensive number of TFs and cell types between human and mouse. A comparative analysis of this correspondence unveiled the extent of the shared regulatory sequence across TFs and cell types under study. Importantly, a large part of the shared regulatory sequence repurposed on the other species. This sequence, fueled by turnover events, provides a strong case for exaptation in regulatory elements.


2016 ◽  
Vol 283 (1826) ◽  
pp. 20152917 ◽  
Author(s):  
Joseph N. Keating ◽  
Philip C. J. Donoghue

The assembly of the gnathostome bodyplan constitutes a formative episode in vertebrate evolutionary history, an interval in which the mineralized skeleton and its canonical suite of cell and tissue types originated. Fossil jawless fishes, assigned to the gnathostome stem-lineage, provide an unparalleled insight into the origin and evolution of the skeleton, hindered only by uncertainty over the phylogenetic position and evolutionary significance of key clades. Chief among these are the jawless anaspids, whose skeletal composition, a rich source of phylogenetic information, is poorly characterized. Here we survey the histology of representatives spanning anaspid diversity and infer their generalized skeletal architecture. The anaspid dermal skeleton is composed of odontodes comprising spheritic dentine and enameloid, overlying a basal layer of acellular parallel fibre bone containing an extensive shallow canal network. A recoded and revised phylogenetic analysis using equal and implied weights parsimony resolves anaspids as monophyletic, nested among stem-gnathostomes. Our results suggest the anaspid dermal skeleton is a degenerate derivative of a histologically more complex ancestral vertebrate skeleton, rather than reflecting primitive simplicity. Hypotheses that anaspids are ancestral skeletonizing lampreys, or a derived lineage of jawless vertebrates with paired fins, are rejected.


2021 ◽  
Vol 22 (12) ◽  
pp. 6450
Author(s):  
Anita Wiśniewska ◽  
Kamila Wojszko ◽  
Elżbieta Różańska ◽  
Klaudia Lenarczyk ◽  
Karol Kuczerski ◽  
...  

Transcription factors are proteins that directly bind to regulatory sequences of genes to modulate and adjust plants’ responses to different stimuli including biotic and abiotic stresses. Sedentary plant parasitic nematodes, such as beet cyst nematode, Heterodera schachtii, have developed molecular tools to reprogram plant cell metabolism via the sophisticated manipulation of genes expression, to allow root invasion and the induction of a sequence of structural and physiological changes in plant tissues, leading to the formation of permanent feeding sites composed of modified plant cells (commonly called a syncytium). Here, we report on the AtMYB59 gene encoding putative MYB transcription factor that is downregulated in syncytia, as confirmed by RT-PCR and a promoter pMyb59::GUS activity assays. The constitutive overexpression of AtMYB59 led to the reduction in A. thaliana susceptibility, as indicated by decreased numbers of developed females, and to the disturbed development of nematode-induced syncytia. In contrast, mutant lines with a silenced expression of AtMYB59 were more susceptible to this parasite. The involvement of ABA in the modulation of AtMYB59 gene transcription appears feasible by several ABA-responsive cis regulatory elements, which were identified in silico in the gene promoter sequence, and experimental assays showed the induction of AtMYB59 transcription after ABA treatment. Based on these results, we suggest that AtMYB59 plays an important role in the successful parasitism of H. schachtii on A. thaliana roots.


Genes ◽  
2021 ◽  
Vol 12 (2) ◽  
pp. 246
Author(s):  
Xiaomeng Chen ◽  
Rui Li ◽  
Yonglin Wang ◽  
Aining Li

An emerging poplar canker caused by the gram-negative bacterium, Lonsdalea populi, has led to high mortality of hybrid poplars Populus × euramericana in China and Europe. The molecular bases of pathogenicity and bark adaptation of L. populi have become a focus of recent research. This study revealed the whole genome sequence and identified putative virulence factors of L. populi. A high-quality L. populi genome sequence was assembled de novo, with a genome size of 3,859,707 bp, containing approximately 3434 genes and 107 RNAs (75 tRNA, 22 rRNA, and 10 ncRNA). The L. populi genome contained 380 virulence-associated genes, mainly encoding for adhesion, extracellular enzymes, secretory systems, and two-component transduction systems. The genome had 110 carbohydrate-active enzyme (CAZy)-coding genes and putative secreted proteins. The antibiotic-resistance database annotation listed that L. populi was resistant to penicillin, fluoroquinolone, and kasugamycin. Analysis of comparative genomics found that L. populi exhibited the highest homology with the L. britannica genome and L. populi encompassed 1905 specific genes, 1769 dispensable genes, and 1381 conserved genes, suggesting high evolutionary diversity and genomic plasticity. Moreover, the pan genome analysis revealed that the N-5-1 genome is an open genome. These findings provide important resources for understanding the molecular basis of the pathogenicity and biology of L. populi and the poplar-bacterium interaction.


2021 ◽  
Vol 22 (7) ◽  
pp. 3735
Author(s):  
Guillaume Velasco ◽  
Damien Ulveling ◽  
Sophie Rondeau ◽  
Pauline Marzin ◽  
Motoko Unoki ◽  
...  

DNA methylation (DNAme) profiling is used to establish specific biomarkers to improve the diagnosis of patients with inherited neurodevelopmental disorders and to guide mutation screening. In the specific case of mendelian disorders of the epigenetic machinery, it also provides the basis to infer mechanistic aspects with regard to DNAme determinants and interplay between histone and DNAme that apply to humans. Here, we present comparative methylomes from patients with mutations in the de novo DNA methyltransferases DNMT3A and DNMT3B, in their catalytic domain or their N-terminal parts involved in reading histone methylation, or in histone H3 lysine (K) methylases NSD1 or SETD2 (H3 K36) or KMT2D/MLL2 (H3 K4). We provide disease-specific DNAme signatures and document the distinct consequences of mutations in enzymes with very similar or intertwined functions, including at repeated sequences and imprinted loci. We found that KMT2D and SETD2 germline mutations have little impact on DNAme profiles. In contrast, the overlapping DNAme alterations downstream of NSD1 or DNMT3 mutations underlines functional links, more specifically between NSD1 and DNMT3B at heterochromatin regions or DNMT3A at regulatory elements. Together, these data indicate certain discrepancy with the mechanisms described in animal models or the existence of redundant or complementary functions unforeseen in humans.


Sign in / Sign up

Export Citation Format

Share Document