scholarly journals Genome‐Wide Maps of Transcription Regulatory Elements and Transcription Enhancers in Development and Disease

2018 ◽  
pp. 439-455 ◽  
Author(s):  
Puja Agrawal ◽  
Katelyn E. Heimbruch ◽  
Sridhar Rao
Author(s):  
Yanrong Ji ◽  
Zhihan Zhou ◽  
Han Liu ◽  
Ramana V Davuluri

Abstract Motivation Deciphering the language of non-coding DNA is one of the fundamental problems in genome research. Gene regulatory code is highly complex due to the existence of polysemy and distant semantic relationship, which previous informatics methods often fail to capture especially in data-scarce scenarios. Results To address this challenge, we developed a novel pre-trained bidirectional encoder representation, named DNABERT, to capture global and transferrable understanding of genomic DNA sequences based on up and downstream nucleotide contexts. We compared DNABERT to the most widely used programs for genome-wide regulatory elements prediction and demonstrate its ease of use, accuracy and efficiency. We show that the single pre-trained transformers model can simultaneously achieve state-of-the-art performance on prediction of promoters, splice sites and transcription factor binding sites, after easy fine-tuning using small task-specific labeled data. Further, DNABERT enables direct visualization of nucleotide-level importance and semantic relationship within input sequences for better interpretability and accurate identification of conserved sequence motifs and functional genetic variant candidates. Finally, we demonstrate that pre-trained DNABERT with human genome can even be readily applied to other organisms with exceptional performance. We anticipate that the pre-trained DNABERT model can be fined tuned to many other sequence analyses tasks. Availability and implementation The source code, pretrained and finetuned model for DNABERT are available at GitHub (https://github.com/jerryji1993/DNABERT). Supplementary information Supplementary data are available at Bioinformatics online.


Lab Animal ◽  
2020 ◽  
Vol 50 (1) ◽  
pp. 17-17
Author(s):  
Alexandra Le Bras

Cancers ◽  
2021 ◽  
Vol 13 (11) ◽  
pp. 2547
Author(s):  
Keunsoo Kang ◽  
Yoonjung Choi ◽  
Hyeonjin Moon ◽  
Chaelin You ◽  
Minjin Seo ◽  
...  

RAD51 is a recombinase that plays a pivotal role in homologous recombination. Although the role of RAD51 in homologous recombination has been extensively studied, it is unclear whether RAD51 can be involved in gene regulation as a co-factor. In this study, we found evidence that RAD51 may contribute to the regulation of genes involved in the autophagy pathway with E-box proteins such as USF1, USF2, and/or MITF in GM12878, HepG2, K562, and MCF-7 cell lines. The canonical USF binding motif (CACGTG) was significantly identified at RAD51-bound cis-regulatory elements in all four cell lines. In addition, genome-wide USF1, USF2, and/or MITF-binding regions significantly coincided with the RAD51-associated cis-regulatory elements in the same cell line. Interestingly, the promoters of genes associated with the autophagy pathway, such as ATG3 and ATG5, were significantly occupied by RAD51 and regulated by RAD51 in HepG2 and MCF-7 cell lines. Taken together, these results unveiled a novel role of RAD51 and provided evidence that RAD51-associated cis-regulatory elements could possibly be involved in regulating autophagy-related genes with E-box binding proteins.


2021 ◽  
Vol 22 (1) ◽  
Author(s):  
Ruifeng Cui ◽  
Xiaoge Wang ◽  
Waqar Afzal Malik ◽  
Xuke Lu ◽  
Xiugui Chen ◽  
...  

Abstract Background The Raffinose synthetase (RAFS) genes superfamily is critical for the synthesis of raffinose, which accumulates in plant leaves under abiotic stress. However, it remains unclear whether RAFS contributes to resistance to abiotic stress in plants, specifically in the Gossypium species. Results In this study, we identified 74 RAFS genes from G. hirsutum, G. barbadense, G. arboreum and G. raimondii by using a series of bioinformatic methods. Phylogenetic analysis showed that the RAFS gene family in the four Gossypium species could be divided into four major clades; the relatively uniform distribution of the gene number in each species ranged from 12 to 25 based on species ploidy, most likely resulting from an ancient whole-genome polyploidization. Gene motif analysis showed that the RAFS gene structure was relatively conservative. Promoter analysis for cis-regulatory elements showed that some RAFS genes might be regulated by gibberellins and abscisic acid, which might influence their expression levels. Moreover, we further examined the functions of RAFS under cold, heat, salt and drought stress conditions, based on the expression profile and co-expression network of RAFS genes in Gossypium species. Transcriptome analysis suggested that RAFS genes in clade III are highly expressed in organs such as seed, root, cotyledon, ovule and fiber, and under abiotic stress in particular, indicating the involvement of genes belonging to clade III in resistance to abiotic stress. Gene co-expressed network analysis showed that GhRFS2A-GhRFS6A, GhRFS6D, GhRFS7D and GhRFS8A-GhRFS11A were key genes, with high expression levels under salt, drought, cold and heat stress. Conclusion The findings may provide insights into the evolutionary relationships and expression patterns of RAFS genes in Gossypium species and a theoretical basis for the identification of stress resistance materials in cotton.


2021 ◽  
Vol 21 (1) ◽  
Author(s):  
Zihan Cheng ◽  
Xuemei Zhang ◽  
Wenjing Yao ◽  
Kai Zhao ◽  
Lin Liu ◽  
...  

Abstract Background The Late Embryogenesis-Abundant (LEA) gene families, which play significant roles in regulation of tolerance to abiotic stresses, widely exist in higher plants. Poplar is a tree species that has important ecological and economic values. But systematic studies on the gene family have not been reported yet in poplar. Results On the basis of genome-wide search, we identified 88 LEA genes from Populus trichocarpa and renamed them as PtrLEA. The PtrLEA genes have fewer introns, and their promoters contain more cis-regulatory elements related to abiotic stress tolerance. Our results from comparative genomics indicated that the PtrLEA genes are conserved and homologous to related genes in other species, such as Eucalyptus robusta, Solanum lycopersicum and Arabidopsis. Using RNA-Seq data collected from poplar under two conditions (with and without salt treatment), we detected 24, 22 and 19 differentially expressed genes (DEGs) in roots, stems and leaves, respectively. Then we performed spatiotemporal expression analysis of the four up-regulated DEGs shared by the tissues, constructed gene co-expression-based networks, and investigated gene function annotations. Conclusion Lines of evidence indicated that the PtrLEA genes play significant roles in poplar growth and development, as well as in responses to salt stress.


Genes ◽  
2021 ◽  
Vol 12 (7) ◽  
pp. 1007
Author(s):  
Divya Kattupalli ◽  
Asha Sreenivasan ◽  
Eppurathu Vasudevan Soniya

Black pepper (Piper nigrum L.) is a prominent spice that is an indispensable ingredient in cuisine and traditional medicine. Phytophthora capsici, the causative agent of footrot disease, causes a drastic constraint in P. nigrum cultivation and productivity. To counterattack various biotic and abiotic stresses, plants employ a broad array of mechanisms that includes the accumulation of pathogenesis-related (PR) proteins. Through a genome-wide survey, eleven PR-1 genes that belong to a CAP superfamily protein with a caveolin-binding motif (CBM) and a CAP-derived peptide (CAPE) were identified from P. nigrum. Despite the critical functional domains, PnPR-1 homologs differ in their signal peptide motifs and core amino acid composition in the functional protein domains. The conserved motifs of PnPR-1 proteins were identified using MEME. Most of the PnPR-1 proteins were basic in nature. Secondary and 3D structure analyses of the PnPR-1 proteins were also predicted, which may be linked to a functional role in P. nigrum. The GO and KEGG functional annotations predicted their function in the defense responses of plant-pathogen interactions. Furthermore, a transcriptome-assisted FPKM analysis revealed PnPR-1 genes mapped to the P. nigrum-P. capsici interaction pathway. An altered expression pattern was detected for PnPR-1 transcripts among which a significant upregulation was noted for basic PnPR-1 genes such as CL10113.C1 and Unigene17664. The drastic variation in the transcript levels of CL10113.C1 was further validated through qRT-PCR and it showed a significant upregulation in infected leaf samples compared with the control. A subsequent analysis revealed the structural details, phylogenetic relationships, conserved sequence motifs and critical cis-regulatory elements of PnPR-1 genes. This is the first genome-wide study that identified the role of PR-1 genes during P. nigrum-P. capsici interactions. The detailed in silico experimental analysis revealed the vital role of PnPR-1 genes in regulating the first layer of defense towards a P. capsici infection in Panniyur-1 plants.


Genes ◽  
2021 ◽  
Vol 12 (8) ◽  
pp. 1175
Author(s):  
Amarni L. Thomas ◽  
Judith Marsman ◽  
Jisha Antony ◽  
William Schierding ◽  
Justin M. O’Sullivan ◽  
...  

The RUNX1/AML1 gene encodes a developmental transcription factor that is an important regulator of haematopoiesis in vertebrates. Genetic disruptions to the RUNX1 gene are frequently associated with acute myeloid leukaemia. Gene regulatory elements (REs), such as enhancers located in non-coding DNA, are likely to be important for Runx1 transcription. Non-coding elements that modulate Runx1 expression have been investigated over several decades, but how and when these REs function remains poorly understood. Here we used bioinformatic methods and functional data to characterise the regulatory landscape of vertebrate Runx1. We identified REs that are conserved between human and mouse, many of which produce enhancer RNAs in diverse tissues. Genome-wide association studies detected single nucleotide polymorphisms in REs, some of which correlate with gene expression quantitative trait loci in tissues in which the RE is active. Our analyses also suggest that REs can be variant in haematological malignancies. In summary, our analysis identifies features of the RUNX1 regulatory landscape that are likely to be important for the regulation of this gene in normal and malignant haematopoiesis.


Author(s):  
Le Wang ◽  
Fei Sun ◽  
Zi Yi Wan ◽  
Baoqing Ye ◽  
Yanfei Wen ◽  
...  

Abstract Resolving the genomic basis underlying phenotypic variations is a question of great importance in evolutionary biology. However, understanding how genotypes determine the phenotypes is still challenging. Centuries of artificial selective breeding for beauty and aggression resulted in a plethora of colors, long fin varieties, and hyper-aggressive behavior in the air-breathing Siamese fighting fish (Betta splendens), supplying an excellent system for studying the genomic basis of phenotypic variations. Combining whole genome sequencing, QTL mapping, genome-wide association studies and genome editing, we investigated the genomic basis of huge morphological variation in fins and striking differences in coloration in the fighting fish. Results revealed that the double tail, elephant ear, albino and fin spot mutants each were determined by single major-effect loci. The elephant ear phenotype was likely related to differential expression of a potassium ion channel gene, kcnh8. The albinotic phenotype was likely linked to a cis-regulatory element acting on the mitfa gene and the double tail mutant was suggested to be caused by a deletion in a zic1/zic4 co-enhancer. Our data highlight that major loci and cis-regulatory elements play important roles in bringing about phenotypic innovations and establish Bettas as new powerful model to study the genomic basis of evolved changes.


2021 ◽  
Vol 22 (10) ◽  
pp. 5221
Author(s):  
Danqi Zeng ◽  
Jaime A. Teixeira da Silva ◽  
Mingze Zhang ◽  
Zhenming Yu ◽  
Can Si ◽  
...  

The APETALA2 (AP2) transcription factors (TFs) play crucial roles in regulating development in plants. However, a comprehensive analysis of the AP2 family members in a valuable Chinese herbal orchid, Dendrobium officinale, or in other orchids, is limited. In this study, the 14 DoAP2 TFs that were identified from the D. officinale genome and named DoAP2-1 to DoAP2-14 were divided into three clades: euAP2, euANT, and basalANT. The promoters of all DoAP2 genes contained cis-regulatory elements related to plant development and also responsive to plant hormones and stress. qRT-PCR analysis showed the abundant expression of DoAP2-2, DoAP2-5, DoAP2-7, DoAP2-8 and DoAP2-12 genes in protocorm-like bodies (PLBs), while DoAP2-3, DoAP2-4, DoAP2-6, DoAP2-9, DoAP2-10 and DoAP2-11 expression was strong in plantlets. In addition, the expression of some DoAP2 genes was down-regulated during flower development. These results suggest that DoAP2 genes may play roles in plant regeneration and flower development in D. officinale. Four DoAP2 genes (DoAP2-1 from euAP2, DoAP2-2 from euANT, and DoAP2-6 and DoAP2-11 from basal ANT) were selected for further analyses. The transcriptional activation of DoAP2-1, DoAP2-2, DoAP2-6 and DoAP2-11 proteins, which were localized in the nucleus of Arabidopsis thaliana mesophyll protoplasts, was further analyzed by a dual-luciferase reporter gene system in Nicotiana benthamiana leaves. Our data showed that pBD-DoAP2-1, pBD-DoAP2-2, pBD-DoAP2-6 and pBD-DoAP2-11 significantly repressed the expression of the LUC reporter compared with the negative control (pBD), suggesting that these DoAP2 proteins may act as transcriptional repressors in the nucleus of plant cells. Our findings on AP2 genes in D. officinale shed light on the function of AP2 genes in this orchid and other plant species.


2021 ◽  
Vol 21 (1) ◽  
Author(s):  
Zhixuan Du ◽  
Qitao Su ◽  
Zheng Wu ◽  
Zhou Huang ◽  
Jianzhong Bao ◽  
...  

AbstractMultidrug and toxic compound extrusion (MATE) proteins are involved in many physiological functions of plant growth and development. Although an increasing number of MATE proteins have been identified, the understanding of MATE proteins is still very limited in rice. In this study, 46 MATE proteins were identified from the rice (Oryza sativa) genome by homology searches and domain prediction. The rice MATE family was divided into four subfamilies based on the phylogenetic tree. Tandem repeats and fragment replication contribute to the expansion of the rice MATE gene family. Gene structure and cis-regulatory elements reveal the potential functions of MATE genes. Analysis of gene expression showed that most of MATE genes were constitutively expressed and the expression patterns of genes in different tissues were analyzed using RNA-seq. Furthermore, qRT-PCR-based analysis showed differential expression patterns in response to salt and drought stress. The analysis results of this study provide comprehensive information on the MATE gene family in rice and will aid in understanding the functional divergence of MATE genes.


Sign in / Sign up

Export Citation Format

Share Document