scholarly journals LABRAT reveals association of alternative polyadenylation with transcript localization, RNA binding protein expression, transcription speed, and cancer survival

2020 ◽  
Author(s):  
Raeann Goering ◽  
Krysta L. Engel ◽  
Austin E. Gillen ◽  
Nova Fong ◽  
David L. Bentley ◽  
...  

ABSTRACTThe sequence content of the 3′ UTRs of many mRNA transcripts is regulated through alternative polyadenylation (APA). The study of this process using RNAseq data, though, has been historically challenging. To combat this problem, we developed LABRAT, an APA quantification method. LABRAT takes advantage of newly developed transcriptome quantification techniques to accurately determine relative APA site usage and how it varies across conditions. Using LABRAT, we found consistent relationships between gene-distal APA and subcellular RNA localization in multiple cell types. We also observed connections between transcription speed and APA site choice as well as tumor-specific transcriptome-wide shifts in APA in hundreds of patient-derived tumor samples that were associated with patient prognosis. We investigated the effects of APA on transcript expression and found a weak overall relationship, although many individual genes showed strong correlations between APA and expression. We interrogated the roles of 191 RNA-binding proteins in the regulation of APA, finding that dozens promote broad, directional shifts in relative APA isoform abundance both in vitro and in patient-derived samples. Finally, we find that APA site shifts in the two classes of APA, tandem UTRs and alternative last exons, are strongly correlated across many contexts, suggesting that they are coregulated.

BMC Genomics ◽  
2021 ◽  
Vol 22 (1) ◽  
Author(s):  
Raeann Goering ◽  
Krysta L. Engel ◽  
Austin E. Gillen ◽  
Nova Fong ◽  
David L. Bentley ◽  
...  

Abstract Background The sequence content of the 3′ UTRs of many mRNA transcripts is regulated through alternative polyadenylation (APA). The study of this process using RNAseq data, though, has been historically challenging. Results To combat this problem, we developed LABRAT, an APA isoform quantification method. LABRAT takes advantage of newly developed transcriptome quantification techniques to accurately determine relative APA site usage and how it varies across conditions. Using LABRAT, we found consistent relationships between gene-distal APA and subcellular RNA localization in multiple cell types. We also observed connections between transcription speed and APA site choice as well as tumor-specific transcriptome-wide shifts in APA isoform abundance in hundreds of patient-derived tumor samples that were associated with patient prognosis. We investigated the effects of APA on transcript expression and found a weak overall relationship, although many individual genes showed strong correlations between relative APA isoform abundance and overall gene expression. We interrogated the roles of 191 RNA-binding proteins in the regulation of APA isoforms, finding that dozens promote broad, directional shifts in relative APA isoform abundance both in vitro and in patient-derived samples. Finally, we find that APA site shifts in the two classes of APA, tandem UTRs and alternative last exons, are strongly correlated across many contexts, suggesting that they are coregulated. Conclusions We conclude that LABRAT has the ability to accurately quantify APA isoform ratios from RNAseq data across a variety of sample types. Further, LABRAT is able to derive biologically meaningful insights that connect APA isoform regulation to cellular and molecular phenotypes.


2020 ◽  
Vol 29 (R1) ◽  
pp. R89-R99
Author(s):  
Deivid Carvalho Rodrigues ◽  
Marat Mufteev ◽  
James Ellis

Abstract The methyl-CpG-binding protein 2 (MECP2) is a critical global regulator of gene expression. Mutations in MECP2 cause neurodevelopmental disorders including Rett syndrome (RTT). MECP2 exon 2 is spliced into two alternative messenger ribonucleic acid (mRNA) isoforms encoding MECP2-E1 or MECP2-E2 protein isoforms that differ in their N-termini. MECP2-E2, isolated first, was used to define the general roles of MECP2 in methyl-deoxyribonucleic acid (DNA) binding, targeting of transcriptional regulatory complexes, and its disease-causing impact in RTT. It was later found that MECP2-E1 is the most abundant isoform in the brain and its exon 1 is also mutated in RTT. MECP2 transcripts undergo alternative polyadenylation generating mRNAs with four possible 3′untranslated region (UTR) lengths ranging from 130 to 8600 nt. Together, the exon and 3′UTR isoforms display remarkable abundance disparity across cell types and tissues during development. These findings indicate discrete means of regulation and suggest that protein isoforms perform non-overlapping roles. Multiple regulatory programs have been explored to explain these disparities. DNA methylation patterns of the MECP2 promoter and first intron impact MECP2-E1 and E2 isoform levels. Networks of microRNAs and RNA-binding proteins also post-transcriptionally regulate the stability and translation efficiency of MECP2 3′UTR isoforms. Finally, distinctions in biophysical properties in the N-termini between MECP2-E1 and E2 lead to variable protein stabilities and DNA binding dynamics. This review describes the steps taken from the discovery of MECP2, the description of its key functions, and its association with RTT, to the emergence of evidence revealing how MECP2 isoforms are differentially regulated at the transcriptional, post-transcriptional and post-translational levels.


Biomedicines ◽  
2021 ◽  
Vol 9 (6) ◽  
pp. 630
Author(s):  
Huili Lyu ◽  
Cody M. Elkins ◽  
Jessica L. Pierce ◽  
C. Henrique Serezani ◽  
Daniel S. Perrien

Excess inflammation and canonical BMP receptor (BMPR) signaling are coinciding hallmarks of the early stages of injury-induced endochondral heterotopic ossification (EHO), especially in the rare genetic disease fibrodysplasia ossificans progressiva (FOP). Multiple inflammatory signaling pathways can synergistically enhance BMP-induced Smad1/5/8 activity in multiple cell types, suggesting the importance of pathway crosstalk in EHO and FOP. Toll-like receptors (TLRs) and IL-1 receptors mediate many of the earliest injury-induced inflammatory signals largely via MyD88-dependent pathways. Thus, the hypothesis that MyD88-dependent signaling is required for EHO was tested in vitro and in vivo using global or Pdgfrα-conditional deletion of MyD88 in FOP mice. As expected, IL-1β or LPS synergistically increased Activin A (ActA)-induced phosphorylation of Smad 1/5 in fibroadipoprogenitors (FAPs) expressing Alk2R206H. However, conditional deletion of MyD88 in Pdgfrα-positive cells of FOP mice did not significantly alter the amount of muscle injury-induced EHO. Even more surprisingly, injury-induced EHO was not significantly affected by global deletion of MyD88. These studies demonstrate that MyD88-dependent signaling is dispensable for injury-induced EHO in FOP mice.


Author(s):  
Yi Zhang ◽  
Lian Liu ◽  
Qiongzi Qiu ◽  
Qing Zhou ◽  
Jinwang Ding ◽  
...  

AbstractOccurring in over 60% of human genes, alternative polyadenylation (APA) results in numerous transcripts with differing 3’ends, thus greatly expanding the diversity of mRNAs and of proteins derived from a single gene. As a key molecular mechanism, APA is involved in various gene regulation steps including mRNA maturation, mRNA stability, cellular RNA decay, and protein diversification. APA is frequently dysregulated in cancers leading to changes in oncogenes and tumor suppressor gene expressions. Recent studies have revealed various APA regulatory mechanisms that promote the development and progression of a number of human diseases, including cancer. Here, we provide an overview of four types of APA and their impacts on gene regulation. We focus particularly on the interaction of APA with microRNAs, RNA binding proteins and other related factors, the core pre-mRNA 3’end processing complex, and 3’UTR length change. We also describe next-generation sequencing methods and computational tools for use in poly(A) signal detection and APA repositories and databases. Finally, we summarize the current understanding of APA in cancer and provide our vision for future APA related research.


2021 ◽  
Vol 12 (1) ◽  
Author(s):  
Vikram Agarwal ◽  
Sereno Lopez-Darwin ◽  
David R. Kelley ◽  
Jay Shendure

Abstract3′ untranslated regions (3′ UTRs) post-transcriptionally regulate mRNA stability, localization, and translation rate. While 3′-UTR isoforms have been globally quantified in limited cell types using bulk measurements, their differential usage among cell types during mammalian development remains poorly characterized. In this study, we examine a dataset comprising ~2 million nuclei spanning E9.5–E13.5 of mouse embryonic development to quantify transcriptome-wide changes in alternative polyadenylation (APA). We observe a global lengthening of 3′ UTRs across embryonic stages in all cell types, although we detect shorter 3′ UTRs in hematopoietic lineages and longer 3′ UTRs in neuronal cell types within each stage. An analysis of RNA-binding protein (RBP) dynamics identifies ELAV-like family members, which are concomitantly induced in neuronal lineages and developmental stages experiencing 3′-UTR lengthening, as putative regulators of APA. By measuring 3′-UTR isoforms in an expansive single cell dataset, our work provides a transcriptome-wide and organism-wide map of the dynamic landscape of alternative polyadenylation during mammalian organogenesis.


1991 ◽  
Vol 11 (2) ◽  
pp. 894-905
Author(s):  
R A Voelker ◽  
W Gibson ◽  
J P Graves ◽  
J F Sterling ◽  
M T Eisenberg

The nucleotide sequence of the Drosophila melanogaster suppressor of sable [su(s)] gene has been determined. Comparison of genomic and cDNA sequences indicates that an approximately 7,860-nucleotide primary transcript is processed into an approximately 5-kb message, expressed during all stages of the life cycle, that contains an open reading frame capable of encoding a 1,322-amino-acid protein of approximately 150 kDa. The putative protein contains an RNA recognition motif-like region and a highly charged arginine-, lysine-, serine-, aspartic or glutamic acid-rich region that is similar to a region contained in several RNA-processing proteins. In vitro translation of in vitro-transcribed RNA from a complete cDNA yields a product whose size agrees with the size predicted by the open reading frame. Antisera against su(s) fusion proteins recognize the in vitro-translated protein and detect a protein of identical size in the nuclear fractions from tissue culture cells and embryos. The protein is also present in smaller amounts in cytoplasmic fractions of embryos. That the su(s) protein has regions similar in structure to RNA-processing protein is consistent with its known role in affecting the transcript levels of those alleles that it suppresses.


2003 ◽  
Vol 23 (19) ◽  
pp. 7055-7067 ◽  
Author(s):  
Shelly A. Waggoner ◽  
Stephen A. Liebhaber

ABSTRACT Posttranscriptional controls in higher eukaryotes are central to cell differentiation and developmental programs. These controls reflect sequence-specific interactions of mRNAs with one or more RNA binding proteins. The α-globin poly(C) binding proteins (αCPs) comprise a highly abundant subset of K homology (KH) domain RNA binding proteins and have a characteristic preference for binding single-stranded C-rich motifs. αCPs have been implicated in translation control and stabilization of multiple cellular and viral mRNAs. To explore the full contribution of αCPs to cell function, we have identified a set of mRNAs that associate in vivo with the major αCP2 isoforms. One hundred sixty mRNA species were consistently identified in three independent analyses of αCP2-RNP complexes immunopurified from a human hematopoietic cell line (K562). These mRNAs could be grouped into subsets encoding cytoskeletal components, transcription factors, proto-oncogenes, and cell signaling factors. Two mRNAs were linked to ceroid lipofuscinosis, indicating a potential role for αCP2 in this infantile neurodegenerative disease. Surprisingly, αCP2 mRNA itself was represented in αCP2-RNP complexes, suggesting autoregulatory control of αCP2 expression. In vitro analyses of representative target mRNAs confirmed direct binding of αCP2 within their 3′ untranslated regions. These data expand the list of mRNAs that associate with αCP2 in vivo and establish a foundation for modeling its role in coordinating pathways of posttranscriptional gene regulation.


2019 ◽  
Author(s):  
Isabelle Leticia Zaboroski Silva ◽  
Anny Waloski Robert ◽  
Guillermo Cabrera Cabo ◽  
Lucia Spangenberg ◽  
Marco Augusto Stimamiglio ◽  
...  

AbstractPosttranscriptional regulation plays a fundamental role in the biology of embryonic stem cells (ESCs). Many studies have demonstrated that multiple mRNAs are coregulated by one or more RNA binding proteins (RBPs) that orchestrate the expression of these molecules. A family of RBPs, known as PUF (Pumilio-FBF), is highly conserved among species and has been associated with the undifferentiated and differentiated states of different cell lines. In humans, two homologs of the PUF family have been found: Pumilio 1 (PUM1) and Pumilio 2 (PUM2). To understand the role of these proteins in human ESCs (hESCs), we first demonstrated the influence of the silencing of PUM1 and PUM2 on pluripotency genes. OCT4 and NANOG mRNA levels decreased significantly with the knockdown of Pumilio, suggesting that PUMILIO proteins play a role in the maintenance of pluripotency in hESCs. Furthermore, we observed that the hESCs silenced for PUM1 and 2 exhibited an improvement in efficiency of in vitro cardiomyogenic differentiation. Using in silico analysis, we identified mRNA targets of PUM1 and PUM2 expressed during cardiomyogenesis. With the reduction of PUM1 and 2, these target mRNAs would be active and could be involved in the progression of cardiomyogenesis.


2018 ◽  
Author(s):  
Alina Munteanu ◽  
Neelanjan Mukherjee ◽  
Uwe Ohler

AbstractMotivationRNA-binding proteins (RBPs) regulate every aspect of RNA metabolism and function. There are hundreds of RBPs encoded in the eukaryotic genomes, and each recognize its RNA targets through a specific mixture of RNA sequence and structure properties. For most RBPs, however, only a primary sequence motif has been determined, while the structure of the binding sites is uncharacterized.ResultsWe developed SSMART, an RNA motif finder that simultaneously models the primary sequence and the structural properties of the RNA targets sites. The sequence-structure motifs are represented as consensus strings over a degenerate alphabet, extending the IUPAC codes for nucleotides to account for secondary structure preferences. Evaluation on synthetic data showed that SSMART is able to recover both sequence and structure motifs implanted into 3‘UTR-like sequences, for various degrees of structured/unstructured binding sites. In addition, we successfully used SSMART on high-throughput in vivo and in vitro data, showing that we not only recover the known sequence motif, but also gain insight into the structural preferences of the RBP.AvailabilitySSMART is freely available at https://ohlerlab.mdc-berlin.de/software/SSMART_137/[email protected]


Sign in / Sign up

Export Citation Format

Share Document