scholarly journals How to find genomic regions relevant for gene regulation

2021 ◽  
Vol 33 (2) ◽  
pp. 157-165
Author(s):  
Xuanzong Guo ◽  
Uwe Ohler ◽  
Ferah Yildirim

Abstract Genetic variants associated with human diseases are often located outside the protein coding regions of the genome. Identification and functional characterization of the regulatory elements in the non-coding genome is therefore of crucial importance for understanding the consequences of genetic variation and the mechanisms of disease. The past decade has seen rapid progress in high-throughput analysis and mapping of chromatin accessibility, looping, structure, and occupancy by transcription factors, as well as epigenetic modifications, all of which contribute to the proper execution of regulatory functions in the non-coding genome. Here, we review the current technologies for the definition and functional validation of non-coding regulatory regions in the genome.

2018 ◽  
Author(s):  
Jürgen Jänes ◽  
Yan Dong ◽  
Michael Schoof ◽  
Jacques Serizay ◽  
Alex Appert ◽  
...  

AbstractAn essential step for understanding the transcriptional circuits that control development and physiology is the global identification and characterization of regulatory elements. Here we present the first map of regulatory elements across the development and ageing of an animal, identifying 42,245 elements accessible in at least one C. elegans stage. Based on nuclear transcription profiles, we define 15,714 protein-coding promoters and 19,231 putative enhancers, and find that both types of element can drive orientation-independent transcription. Additionally, hundreds of promoters produce transcripts antisense to protein coding genes, suggesting involvement in a widespread regulatory mechanism. We find that the accessibility of most elements is regulated during development and/or ageing and that patterns of accessibility change are linked to specific developmental or physiological processes. The map and characterization of regulatory elements across C. elegans life provides a platform for understanding how transcription controls development and ageing.


eLife ◽  
2018 ◽  
Vol 7 ◽  
Author(s):  
Jürgen Jänes ◽  
Yan Dong ◽  
Michael Schoof ◽  
Jacques Serizay ◽  
Alex Appert ◽  
...  

An essential step for understanding the transcriptional circuits that control development and physiology is the global identification and characterization of regulatory elements. Here, we present the first map of regulatory elements across the development and ageing of an animal, identifying 42,245 elements accessible in at least one Caenorhabditis elegans stage. Based on nuclear transcription profiles, we define 15,714 protein-coding promoters and 19,231 putative enhancers, and find that both types of element can drive orientation-independent transcription. Additionally, more than 1000 promoters produce transcripts antisense to protein coding genes, suggesting involvement in a widespread regulatory mechanism. We find that the accessibility of most elements changes during development and/or ageing and that patterns of accessibility change are linked to specific developmental or physiological processes. The map and characterization of regulatory elements across C. elegans life provides a platform for understanding how transcription controls development and ageing.


2020 ◽  
Vol 48 (W1) ◽  
pp. W193-W199 ◽  
Author(s):  
Nina Baumgarten ◽  
Dennis Hecker ◽  
Sivarajan Karunanithi ◽  
Florian Schmidt ◽  
Markus List ◽  
...  

Abstract A current challenge in genomics is to interpret non-coding regions and their role in transcriptional regulation of possibly distant target genes. Genome-wide association studies show that a large part of genomic variants are found in those non-coding regions, but their mechanisms of gene regulation are often unknown. An additional challenge is to reliably identify the target genes of the regulatory regions, which is an essential step in understanding their impact on gene expression. Here we present the EpiRegio web server, a resource of regulatory elements (REMs). REMs are genomic regions that exhibit variations in their chromatin accessibility profile associated with changes in expression of their target genes. EpiRegio incorporates both epigenomic and gene expression data for various human primary cell types and tissues, providing an integrated view of REMs in the genome. Our web server allows the analysis of genes and their associated REMs, including the REM’s activity and its estimated cell type-specific contribution to its target gene’s expression. Further, it is possible to explore genomic regions for their regulatory potential, investigate overlapping REMs and by that the dissection of regions of large epigenomic complexity. EpiRegio allows programmatic access through a REST API and is freely available at https://epiregio.de/.


2021 ◽  
Author(s):  
Cesar Arenas-Mena ◽  
Sofija Miljovska ◽  
Sevinc Ercan ◽  
Tanvi Shashikant ◽  
Charles G. Danko ◽  
...  

The transcription of developmental regulatory genes is often controlled by multiple cis-regulatory elements. The identification and functional characterization of distal regulatory elements remains challenging, even in tractable model organisms like sea urchins. We evaluate the use of chromatin accessibility, transcription and RNA Polymerase II for their ability to predict enhancer activity of genomic regions in sea urchin embryos. ATAC-seq, PRO-seq, and Pol II ChIP-seq from early and late blastula embryos are manually contrasted with experimental cis-regulatory analyses available in sea urchin embryos, with particular attention to common developmental regulatory elements known to have enhancer and silencer functions differentially deployed among embryonic territories. Using the three functional genomic data types, machine learning models are trained and tested to classify and quantitatively predict the enhancer activity of several hundred genomic regions previously validated with reporter constructs in vivo. Overall, chromatin accessibility and transcription have substantial power for predicting enhancer activity. For promoter-overlapping cis-regulatory elements in particular, the distribution of Pol II is the best predictor of enhancer activity in blastula embryos. Furthermore, ATAC- and PRO-seq predictive value is stage dependent for the promoter-overlapping subset. This suggests that the sequence of regulatory mechanisms leading to transcriptional activation have distinct relevance at different levels of the developmental gene regulatory hierarchy deployed during embryogenesis.


Author(s):  
I. B. Trindade ◽  
G. Hernandez ◽  
E. Lebègue ◽  
F. Barrière ◽  
T. Cordeiro ◽  
...  

AbstractIron is a fundamental element for virtually all forms of life. Despite its abundance, its bioavailability is limited, and thus, microbes developed siderophores, small molecules, which are synthesized inside the cell and then released outside for iron scavenging. Once inside the cell, iron removal does not occur spontaneously, instead this process is mediated by siderophore-interacting proteins (SIP) and/or by ferric-siderophore reductases (FSR). In the past two decades, representatives of the SIP subfamily have been structurally and biochemically characterized; however, the same was not achieved for the FSR subfamily. Here, we initiate the structural and functional characterization of FhuF, the first and only FSR ever isolated. FhuF is a globular monomeric protein mainly composed by α-helices sheltering internal cavities in a fold resembling the “palm” domain found in siderophore biosynthetic enzymes. Paramagnetic NMR spectroscopy revealed that the core of the cluster has electronic properties in line with those of previously characterized 2Fe–2S ferredoxins and differences appear to be confined to the coordination of Fe(III) in the reduced protein. In particular, the two cysteines coordinating this iron appear to have substantially different bond strengths. In similarity with the proteins from the SIP subfamily, FhuF binds both the iron-loaded and the apo forms of ferrichrome in the micromolar range and cyclic voltammetry reveals the presence of redox-Bohr effect, which broadens the range of ferric-siderophore substrates that can be thermodynamically accessible for reduction. This study suggests that despite the structural differences between FSR and SIP proteins, mechanistic similarities exist between the two classes of proteins. Graphic abstract


Circulation ◽  
2012 ◽  
Vol 125 (suppl_10) ◽  
Author(s):  
Christy L Avery ◽  
Praveen Sethupathy ◽  
Steven Buyske ◽  
Q. C He ◽  
Dan Y Lin ◽  
...  

The QT interval (QT) is a heritable trait and its prolongation is an established risk factor for ventricular tachyarrhythmia and sudden cardiac death. Most genetic studies of QT have examined populations of European ancestry, although the increased genetic diversity in populations of African descent provides opportunity for fine-mapping, which can help narrow association signals and identify candidates for functional characterization. We examined whether eleven previously identified QT loci comprising 6,681 variants on the Illumina Metabochip array were associated with QT in 7,516 African American participants from the Atherosclerosis Risk in Communities study and Women’s Health Initiative clinical trial. Among associated loci, we used conditional analyses and queried bioinformatics databases to identify and functionally categorize signals. We identified nine of the eleven QT loci in African American populations ( P <0.0045 under an additive genetic model adjusting for ancestry and demographic characteristics: NOS1AP, ATP1B1, SCN5A, SLC35F1, KCNH2, KCNQ1, LITAF, NDRG4, and RFFL ). We also identified two independent secondary signals in NOS1AP and ATP1B1 ( P < 7.4x10 −6 ). Conditional analyses adjusting for published loci in European populations demonstrated that eight of these eleven SNPs (nine primary; two secondary) were independent of previously reported SNPs. We then performed the first bioinformatics-based functional characterization of QT loci using the eleven primary and secondary variants and SNPs in strong LD (r 2 > 0.5) among these African American participants. Only the SCN5A locus included a non-synonymous coding variant (rs1805124, H558R, r 2 = 0.7 with primary SNP rs9871385, P = 4.7x10 −4 ). The remaining ten loci harbored variants located exclusively within non-coding regions. Specifically, three contained SNPs within candidate long-range regulatory elements in human cardiomyocytes, five were in or near annotated promoter regions, and the remaining two were in un-annotated, but highly conserved non-coding elements. Several of the QT risk alleles at these SNPs significantly alter the predicted binding affinity for transcription factors, such as TBX5 and AhR, which have been previously implicated in cardiac formation and function. In summary, the findings provide compelling evidence that the same genes influence variation in QT across global populations and that additional, independent signals exist in African Americans. Moreover, of those SNPs identified as strong candidates for functional evaluation, the majority implicate gene regulatory dysfunction in QT prolongation.


2019 ◽  
Author(s):  
Wei Fang ◽  
Yi Wen ◽  
Xiangyun Wei

AbstractTissue-specific or cell type-specific transcription of protein-coding genes is controlled by both trans-regulatory elements (TREs) and cis-regulatory elements (CREs). However, it is challenging to identify TREs and CREs, which are unknown for most genes. Here, we describe a protocol for identifying two types of transcription-activating CREs—core promoters and enhancers—of zebrafish photoreceptor type-specific genes. This protocol is composed of three phases: bioinformatic prediction, experimental validation, and characterization of the CREs. To better illustrate the principles and logic of this protocol, we exemplify it with the discovery of the core promoter and enhancer of the mpp5b apical polarity gene (also known as ponli), whose red, green, and blue (RGB) cone-specific transcription requires its enhancer, a member of the rainbow enhancer family. While exemplified with an RGB cone-specific gene, this protocol is general and can be used to identify the core promoters and enhancers of other protein-coding genes.


PeerJ ◽  
2021 ◽  
Vol 9 ◽  
pp. e11508
Author(s):  
Yubing Yong ◽  
Yue Zhang ◽  
Yingmin Lyu

Background. We have previously performed an analysis of the cold-responsive transcriptome in the mature leaves of tiger lily (Lilium lancifolium) by gene co-expression network identification. The results has revealed that a ZFHD gene, notated as encoding zinc finger homeodomain protein, may play an essential regulating role in tiger lily response to cold stress. Methods. A further investigation of the ZFHD gene (termed as LlZFHD4) responding to osmotic stresses, including cold, salt, water stresses, and abscisic acid (ABA) was performed in this study. Based on the transcriptome sequences, the coding region and 5′ promoter region of LlZFHD4 were cloned from mature tiger lily leaves. Stress response analysis was performed under continuous 4 °C, NaCl, PEG, and ABA treatments. Functional characterization of LlZFHD4 was conducted in transgenic Arabidopsis, tobacco, and yeast. Results. LlZFHD4 encodes a nuclear-localized protein consisting of 180 amino acids. The N-terminal region of LlZFHD4 has transcriptional activation activity in yeast. The 4 °C, NaCl, PEG, and ABA treatments induced the expression of LlZFHD4. Several stress- or hormone-responsive cis-acting regulatory elements (T-Box, BoxI. and ARF) and binding sites of transcription factors (MYC, DRE and W-box) were found in the core promoter region (789 bp) of LlZFHD4. Also, the GUS gene driven by LlZFHD4 promoter was up-regulated by cold, NaCl, water stresses, and ABA in Arabidopsis. Overexpression of LlZFHD4 improved cold and drought tolerance in transgenic Arabidopsis; higher survival rate and better osmotic adjustment capacity were observed in LlZFHD4 transgenic plants compared to wild type (WT) plants under 4 °C and PEG conditions. However, LlZFHD4 transgenic plants were less tolerant to salinity and more hypersensitive to ABA compared to WT plants. The transcript levels of stress- and ABA-responsive genes were much more up-regulated in LlZFHD4 transgenic Arabidopsis than WT. These results indicate LlZFHD4 is involved in ABA signaling pathway and plays a crucial role in regulating the response of tiger lily to cold, salt and water stresses.


Gene Reports ◽  
2019 ◽  
Vol 16 ◽  
pp. 100402
Author(s):  
Swapnarani Nayak ◽  
Lipika Patnaik ◽  
Meenati Manjari Soren ◽  
V. Chakrapani ◽  
Shibani Dutta Mohapatra ◽  
...  

2019 ◽  
Vol 20 (12) ◽  
pp. 2883 ◽  
Author(s):  
Simon J. Baumgart ◽  
Ekaterina Nevedomskaya ◽  
Bernard Haendler

Recent advances in whole-genome and transcriptome sequencing of prostate cancer at different stages indicate that a large number of mutations found in tumors are present in non-protein coding regions of the genome and lead to dysregulated gene expression. Single nucleotide variations and small mutations affecting the recruitment of transcription factor complexes to DNA regulatory elements are observed in an increasing number of cases. Genomic rearrangements may position coding regions under the novel control of regulatory elements, as exemplified by the TMPRSS2-ERG fusion and the amplified enhancer identified upstream of the androgen receptor (AR) gene. Super-enhancers are increasingly found to play important roles in aberrant oncogenic transcription. Several players involved in these processes are currently being evaluated as drug targets and may represent new vulnerabilities that can be exploited for prostate cancer treatment. They include factors involved in enhancer and super-enhancer function such as bromodomain proteins and cyclin-dependent kinases. In addition, non-coding RNAs with an important gene regulatory role are being explored. The rapid progress made in understanding the influence of the non-coding part of the genome and of transcription dysregulation in prostate cancer could pave the way for the identification of novel treatment paradigms for the benefit of patients.


Sign in / Sign up

Export Citation Format

Share Document