transcription start sites
Recently Published Documents


TOTAL DOCUMENTS

632
(FIVE YEARS 229)

H-INDEX

57
(FIVE YEARS 10)

Author(s):  
Tetsuya Niihori ◽  
Reo Tanoshima ◽  
Yoji Sasahara ◽  
Atsushi Sato ◽  
Masahiro Irie ◽  
...  

MECOM encodes the transcriptional regulators, EVI1 and MDS1-EVI1, from two distinct transcription start sites. EVI1 plays important roles in hematopoiesis and stem cell self-renewal. Recently, our group and others revealed that individuals with MECOM variants present diverse hematological and skeletal defects, including radioulnar synostosis (RUS). In the present study, we analyzed two families suspected with MECOM-associated syndrome. In family 1, a MECOM splicing variant (c.2285+1G>A) was identified in an individual with bone marrow failure (TRS4) without RUS and her mother, who had mild leukocytopenia, thrombocytopenia, and bilateral RUS. A copy neutral loss of heterozygosity decreasing the variant allele frequency was observed in the bone marrow of TRS4 and the peripheral blood leukocytes of her mother. However, TRS4 remained transfusion-dependent. In family 2, a MECOM variant (c.2208-4A>G), which was predicted to cause a cryptic acceptor site that results in a 3-base insertion (an insertion of Ser) in the mRNA, was identified in the proband, with bone marrow failure; this variant was also observed in her brother and father, both of whom have skeletal malformations, but no cytopenia. RT-PCR using leukocytes revealed a transcript with a 3-bp insertion in the proband, her brother, and the father, suggesting that the transcript variant with a 3-bp insertion is independent of blood phenotype. Collectively, these results suggest the presence of intrafamilial clinical heterogeneity in both families with MECOM splicing variants. Somatic genetic event may complicate the understanding of clinical variability among family members.


2022 ◽  
Vol 12 (1) ◽  
Author(s):  
Chao Wang ◽  
Chun Liang

AbstractThe dysregulation of transposable elements (TEs) has been explored in a variety of cancers. However, TE activities in osteosarcoma (OS) have not been extensively studied yet. By integrative analysis of RNA-seq, whole-genome sequencing (WGS), and methylation data, we showed aberrant TE activities associated with dysregulations of TEs in OS tumors. Specifically, expression levels of LINE-1 and Alu of different evolutionary ages, as well as subfamilies of SVA and HERV-K, were significantly up-regulated in OS tumors, accompanied by enhanced DNA repair responses. We verified the characteristics of LINE-1 mediated TE insertions, including target site duplication (TSD) length (centered around 15 bp) and preferential insertions into intergenic and AT-rich regions as well as intronic regions of longer genes. By filtering polymorphic TE insertions reported in 1000 genome project (1KGP), besides 148 tumor-specific somatic TE insertions, we found most OS patient-specific TE insertions (3175 out of 3326) are germline insertions, which are associated with genes involved in neuronal processes or with transcription factors important for cancer development. In addition to 68 TE-affected cancer genes, we found recurrent germline TE insertions in 72 non-cancer genes with high frequencies among patients. We also found that +/− 500 bps flanking regions of transcription start sites (TSS) of LINE-1 (young) and Alu showed lower methylation levels in OS tumor samples than controls. Interestingly, by incorporating patient clinical data and focusing on TE activities in OS tumors, our data analysis suggested that higher TE insertions in OS tumors are associated with a longer event-free survival time.


BMC Genomics ◽  
2022 ◽  
Vol 23 (1) ◽  
Author(s):  
Yanding Zhao ◽  
Yadong Dong ◽  
Wei Hong ◽  
Chongming Jiang ◽  
Kevin Yao ◽  
...  

AbstractChromatin accessibility is essential for transcriptional activation of genomic regions. It is well established that transcription factors (TFs) and histone modifications (HMs) play critical roles in chromatin accessibility regulation. However, there is a lack of studies that quantify these relationships. Here we constructed a two-layer model to predict chromatin accessibility by integrating DNA sequence, TF binding, and HM signals. By applying the model to two human cell lines (GM12878 and HepG2), we found that DNA sequences had limited power for accessibility prediction, while both TF binding and HM signals predicted chromatin accessibility with high accuracy. According to the HM model, HM features determined chromatin accessibility in a cell line shared manner, with the prediction power attributing to five core HM types. Results from the TF model indicated that chromatin accessibility was determined by a subset of informative TFs including both cell line-specific and generic TFs. The combined model of both TF and HM signals did not further improve the prediction accuracy, indicating that they provide redundant information in terms of chromatin accessibility prediction. The TFs and HM models can also distinguish the chromatin accessibility of proximal versus distal transcription start sites with high accuracy.


2022 ◽  
Author(s):  
Edward J Banigan ◽  
Wen Tang ◽  
Aafke A van den Berg ◽  
Roman R Stocsits ◽  
Gordana Wutz ◽  
...  

Cohesin organizes mammalian interphase chromosomes by reeling chromatin fibers into dynamic loops (Banigan and Mirny, 2020; Davidson et al., 2019; Kim et al., 2019; Yatskevich et al., 2019). "Loop extrusion" is obstructed when cohesin encounters a properly oriented CTCF protein (Busslinger et al., 2017; de Wit et al., 2015; Fudenberg et al., 2016; Nora et al., 2017; Sanborn et al., 2015; Wutz et al., 2017), and recent work indicates that other factors, such as the replicative helicase MCM (Dequeker et al., 2020), can also act as barriers to loop extrusion. It has been proposed that transcription relocalizes (Busslinger et al., 2017; Glynn et al., 2004; Lengronne et al., 2004) or interferes with cohesin (Heinz et al., 2018; Jeppsson et al., 2020; Valton et al., 2021; S. Zhang et al., 2021), and that active transcription start sites function as cohesin loading sites (Busslinger et al., 2017; Kagey et al., 2010; Zhu et al., 2021; Zuin et al., 2014), but how these effects, and transcription in general, shape chromatin is unknown. To determine whether transcription can modulate loop extrusion, we studied cells in which the primary extrusion barriers could be removed by CTCF depletion and cohesin's residence time and abundance on chromatin could be increased by Wapl knockout. We found evidence that transcription directly interacts with loop extrusion through a novel "moving barrier" mechanism, but not by loading cohesin at active promoters. Hi-C experiments showed intricate, cohesin-dependent genomic contact patterns near actively transcribed genes, and in CTCF-Wapl double knockout (DKO) cells (Busslinger et al., 2017), genomic contacts were enriched between sites of transcription-driven cohesin localization ("cohesin islands"). Similar patterns also emerged in polymer simulations in which transcribing RNA polymerases (RNAPs) acted as "moving barriers" by impeding, slowing, or pushing loop-extruding cohesins. The model predicts that cohesin does not load preferentially at promoters and instead accumulates at TSSs due to the barrier function of RNAPs. We tested this prediction by new ChIP-seq experiments, which revealed that the "cohesin loader" Nipbl (Ciosk et al., 2000) co-localizes with cohesin, but, unlike in previous reports (Busslinger et al., 2017; Kagey et al., 2010; Zhu et al., 2021; Zuin et al., 2014), Nipbl did not accumulate at active promoters. We propose that RNAP acts as a new type of barrier to loop extrusion that, unlike CTCF, is not stationary in its precise genomic position, but is itself dynamically translocating and relocalizes cohesin along DNA. In this way, loop extrusion could enable translocating RNAPs to maintain contacts with distal regulatory elements, allowing transcriptional activity to shape genomic functional organization.


2022 ◽  
Vol 5 (4) ◽  
pp. e202101234
Author(s):  
Sonal Dahale ◽  
Jorge Ruiz-Orera ◽  
Jan Silhavy ◽  
Norbert Hübner ◽  
Sebastiaan van Heesch ◽  
...  

The role of alternative promoter usage in tissue-specific gene expression has been well established; however, its role in complex diseases is poorly understood. We performed cap analysis of gene expression (CAGE) sequencing from the left ventricle of a rat model of hypertension, the spontaneously hypertensive rat (SHR), and a normotensive strain, Brown Norway to understand the role of alternative promoter usage in complex disease. We identified 26,560 CAGE-defined transcription start sites in the rat left ventricle, including 1,970 novel cardiac transcription start sites. We identified 28 genes with alternative promoter usage between SHR and Brown Norway, which could lead to protein isoforms differing at the amino terminus between two strains and 475 promoter switching events altering the length of the 5′ UTR. We found that the shift in Insr promoter usage was significantly associated with insulin levels and blood pressure within a panel of HXB/BXH recombinant inbred rat strains, suggesting that hyperinsulinemia due to insulin resistance might lead to hypertension in SHR. Our study provides a preliminary evidence of alternative promoter usage in complex diseases.


2022 ◽  
Author(s):  
Christopher Sebastian Jürges ◽  
Manivel Lodha ◽  
Vu Thuy Khanh Le-Trilling ◽  
Pranjali Bhandare ◽  
Elmar Wolf ◽  
...  

For decades, human cytomegalovirus (HCMV) was thought to express ≈200 viral proteins during lytic infection. In recent years, systems biology approaches uncovered hundreds of additional viral gene products and suggested thousands of viral sites of transcription initiation. Despite all available data, the molecular mechanisms of HCMV gene regulation remain poorly understood. Here, we provide a unifying model of productive HCMV gene expression employing transcription start site profiling combined with metabolic RNA labeling as well as integrative computational analysis of previously published big data. This approach defined the expression of >2,600 high confidence viral transcripts and explained the complex kinetics of viral protein expression by cumulative effects of translation of incoming virion-associated RNA, multiple transcription start sites with distinct kinetics per viral open reading frame, and differences in viral protein stability. Most importantly, we identify pervasive transcription of transient RNAs as a common feature of this large DNA virus with its human host.


2022 ◽  
Vol 23 (1) ◽  
Author(s):  
Shuai Wang ◽  
Hairong Xie ◽  
Fei Mao ◽  
Haiyan Wang ◽  
Shu Wang ◽  
...  

Abstract Background Direct analogs of chemically modified bases that carry important epigenetic information, such as 5-methylcytosine (m5C)/5-methyldeoxycytosine (5mC), 5-hydroxymethylcytosine (hm5C)/5-hydroxymethyldeoxycytosine (5hmC), and N6-methyladenosine (m6A)/N6-methyldeoxyadenosine (6mA), are detected in both RNA and DNA, respectively. The modified base N4-acetylcytosine (ac4C) is well studied in RNAs, but its presence and epigenetic roles in cellular DNA have not been explored. Results Here, we demonstrate the existence of N4-acetyldeoxycytosine (4acC) in genomic DNA of Arabidopsis with multiple detection methods. Genome-wide profiling of 4acC modification reveals that 4acC peaks are mostly distributed in euchromatin regions and present in nearly half of the expressed protein-coding genes in Arabidopsis. 4acC is mainly located around transcription start sites and positively correlates with gene expression levels. Imbalance of 5mC does not directly affect 4acC modification. We also characterize the associations of 4acC with 5mC and histone modifications that cooperatively regulate gene expression. Moreover, 4acC is also detected in genomic DNA of rice, maize, mouse, and human by mass spectrometry. Conclusions Our findings reveal 4acC as a hitherto unknown DNA modification in higher eukaryotes. We identify potential interactions of this mark with other epigenetic marks in gene expression regulation.


2021 ◽  
pp. gr.275723.121
Author(s):  
Jill E Moore ◽  
Xiao-Ou Zhang ◽  
Shaimae I Elhajjajy ◽  
Kaili Fan ◽  
Henry E Pratt ◽  
...  

Accurate transcription start site (TSS) annotations are essential for understanding transcriptional regulation and its role in human disease. Gene collections such as GENCODE contain annotations for tens of thousands of TSSs, but not all of these annotations are experimentally validated, nor do they contain information on cell type-specific usage. Therefore, we sought to generate a collection of experimentally validated TSSs by integrating RNA Annotation and Mapping of Promoters for the Analysis of Gene Expression (RAMPAGE) data from 115 cell and tissue types, which resulted in a collection of approximately 50 thousand representative RAMPAGE peaks. These peaks were primarily proximal to GENCODE-annotated TSSs and were concordant with other transcription assays. Because RAMPAGE uses paired-end reads, we were then able to connect peaks to transcripts by analyzing the genomic positions of the 3' ends of read mates. Using this paired-end information, we classified the vast majority (37 thousand) of our RAMPAGE peaks as verified TSSs, updating TSS annotations for 20% of GENCODE genes. We also found that these updated TSS annotations were supported by epigenomic and other transcriptomic datasets. To demonstrate the utility of this RAMPAGE rPeak collection, we intersected it with the NHGRI/EBI genome-wide association studies (GWAS) catalog and identified new candidate GWAS genes. Overall, our work demonstrates the importance of integrating experimental data to further refine TSS annotations and provides a valuable resource for the biological community.


2021 ◽  
Author(s):  
Vivek Kumar Raxwal ◽  
Somya Singh ◽  
Manu Agarwal ◽  
Karel Riha

New genes continuously emerge from non-coding DNA or by diverging from existing genes, but most of them are rapidly lost and only a few become fixed within the population. We hypothesized that young genes are subject to transcriptional and post-transcriptional regulation to limit their expression and minimize their exposure to purifying selection. We found that young genes in rice have relatively low expression levels, which can be attributed to distal enhancers, and closed chromatin conformation at their transcription start sites (TSS). The chromatin in TSS regions can be re-modeled in response to abiotic stress, indicating conditional expression of young genes. Furthermore, transcripts of young genes in Arabidopsis tend to be targeted by nonsense-mediated RNA decay, presenting another layer of regulation limiting their expression. Together, these data suggest that transcriptional and post-transcriptional mechanisms contribute to the conditional expression of young genes, which may alleviate purging selection while providing an opportunity for phenotypic exposure and functionalization.


Genes ◽  
2021 ◽  
Vol 12 (12) ◽  
pp. 2006
Author(s):  
Ida Maria Westin ◽  
Andreas Viberg ◽  
Berit Byström ◽  
Irina Golovleva

Fuchs’ endothelial corneal dystrophy (FECD) is a bilateral disease of the cornea caused by gradual loss of corneal endothelial cells. Late-onset FECD is strongly associated with the CTG18.1 trinucleotide repeat expansion in the Transcription Factor 4 gene (TCF4), which forms RNA nuclear foci in corneal endothelial cells. To date, 46 RefSeq transcripts of TCF4 are annotated by the National Center of Biotechnology information (NCBI), however the effect of the CTG18.1 expansion on expression of alternative TCF4 transcripts is not completely understood. To investigate this, we used droplet digital PCR for quantification of TCF4 transcripts spanning over the CTG18.1 and transcripts with transcription start sites immediately downstream of the CTG18.1. TCF4 expression was analysed in corneal endothelium and in whole blood of FECD patients with and without CTG18.1 expansion, in non-FECD controls without CTG18.1 expansion, and in five additional control tissues. Subtle changes in transcription levels in groups of TCF4 transcripts were detected. In corneal endothelium, we found a lower fraction of transcripts spanning over the CTG18.1 tract compared to all other tissues investigated.


Sign in / Sign up

Export Citation Format

Share Document