scholarly journals Detection of pseudouridine modifications and type I/II hypermodifications in human mRNAs using direct, long-read sequencing

2021 ◽  
Author(s):  
Sepideh Tavakoli ◽  
Mohammad Nabizadehmashhadtoroghi ◽  
Amr Makhamreh ◽  
Howard Gamper ◽  
Neda Rezapour ◽  
...  

Enzyme-mediated chemical modifications to mRNAs have the potential to fine-tune gene expression in response to environmental stimuli. Notably, pseudouridine-modified mRNAs are more resistant to RNase-mediated degradation, more responsive to cellular stress, and have the potential to modulate immunogenicity and enhance translation in vivo. However, the precise biological functions of pseudouridine modification on mRNAs remain unclear due to the lack of sensitive and accurate tools for mapping. We developed a semi-quantitative method for mapping pseudouridylated sites with high confidence directly on mammalian mRNA transcripts via direct RNA, long-read nanopore sequencing. By analysis of a modification-free transcriptome, we demonstrate that the depth of coverage and intrinsic errors associated with specific k-mer sequences are critical parameters for accurate base-calling. We adjust these parameters for high-confidence U-to-C base-calling errors that occur at pseudouridylated sites, which are benchmarked against sites that were identified previously by biochemical methods. We also uncovered new pseudouridylated sites, many of which fall on genes that encode RNA binding proteins and on uridine-rich k-mers. Sites identified by U-to-C base calling error were verified using 1000mer synthetic RNA controls bearing a single pseudouridine in the center position, demonstrating that 1. the U-to-C base-calling error occurs at the site of pseudouridylation, and 2. the basecalling error is systematically under-calling the pseudouridylated sites. High-occupancy sites with >40% U-to-C basecalling error are classified as sites of hyper modification type I, whereas genes with more than one site of pseudouridylation are classified as having type II hyper modification which is confirmed by single-molecule analysis. We report the discovery of mRNAs with up to 7 unique sites of pseudouridine modification. Here we establish an innovative pipeline for direct identification, quantification, and detection of pseudouridine modifications and type I/II hypermodifications on native RNA molecules using long-read sequencing without resorting to RNA amplification, chemical reactions on RNA, enzyme-based replication, or DNA sequencing steps.

Cells ◽  
2020 ◽  
Vol 9 (6) ◽  
pp. 1473
Author(s):  
Mohamed Zaiou

Circular RNAs (circRNAs) are genome transcripts that are produced from back-splicing of specific regions of pre-mRNA. These single-stranded RNA molecules are widely expressed across diverse phyla and many of them are stable and evolutionary conserved between species. Growing evidence suggests that many circRNAs function as master regulators of gene expression by influencing both transcription and translation processes. Mechanistically, circRNAs are predicted to act as endogenous microRNA (miRNA) sponges, interact with functional RNA-binding proteins (RBPs), and associate with elements of the transcriptional machinery in the nucleus. Evidence is mounting that dysregulation of circRNAs is closely related to the occurrence of a range of diseases including cancer and metabolic diseases. Indeed, there are several reports implicating circRNAs in cardiovascular diseases (CVD), diabetes, hypertension, and atherosclerosis. However, there is very little research addressing the potential role of these RNA transcripts in the occurrence and development of obesity. Emerging data from in vitro and in vivo studies suggest that circRNAs are novel players in adipogenesis, white adipose browning, obesity, obesity-induced inflammation, and insulin resistance. This study explores the current state of knowledge on circRNAs regulating molecular processes associated with adipogenesis and obesity, highlights some of the challenges encountered while studying circRNAs and suggests some perspectives for future research directions in this exciting field of study.


2019 ◽  
Author(s):  
Andreas U. Müller ◽  
Marc Leibundgut ◽  
Nenad Ban ◽  
Eilika Weber-Ban

AbstractIn mycobacteria, transcriptional activator PafBC is responsible for upregulating the majority of genes induced by DNA damage. Understanding the mechanism of PafBC activation is impeded by a lack of structural information on this transcription factor that contains a widespread, but poorly understood WYL domain frequently encountered in bacterial transcription factors. Here, we determined the crystal structure ofArthrobacter aurescensPafBC. The protein consists of two modules, each harboring an N-terminal helix-turn-helix DNA binding domain followed by a central WYL and a C-terminal extension (WCX) domain. The WYL domains exhibit Sm-folds, while the WCX domains adopt ferredoxin-like folds, both characteristic for RNA binding proteins. Our results suggest a mechanism of regulation in which WYL domain-containing transcription factors may be activated by binding RNA molecules. Using anin vivomutational screen inMycobacterium smegmatis, we identify potential co-activator binding sites on PafBC.


2019 ◽  
Author(s):  
Raeann Goering ◽  
Laura I. Hudish ◽  
Bryan B. Guzman ◽  
Nisha Raj ◽  
Gary J. Bassell ◽  
...  

ABSTRACTThe sorting of RNA molecules to distinct subcellular locations facilitates the activity of spatially restricted processes through local protein synthesis. This process affects thousands of transcripts yet precisely how these RNAs are trafficked to their destinations remains generally unclear. Here we have analyzed subcellular transcriptomes of FMRP-null mouse neuronal cells to identify transcripts that depend on FMRP for efficient transport to neurites. We found that these FMRP RNA localization targets contain a large enrichment of G-quadruplex sequences, particularly in their 3′ UTRs, suggesting that FMRP recognizes these sequences to promote the localization of transcripts that contain them. Fractionation of neurons derived from human Fragile X Syndrome patients revealed a high degree of conservation in the identity of FMRP localization targets between human and mouse as well as an enrichment of G-quadruplex sequences in human FMRP RNA localization targets. Using high-throughput RNA/protein interaction assays and single-molecule RNA FISH, we identified the RGG domain of FMRP as important for both interaction with G-quadruplex RNA sequences and the neuronal transport of G-quadruplex-containing transcripts. Finally, we used ribosome footprinting to identify translational regulatory targets of FMRP. The translational regulatory targets were not enriched for G-quadruplex sequences and were largely distinct from the RNA localization targets of FMRP, indicating that the two functions can be biochemically separated and are mediated through different target recognition mechanisms. These results establish a molecular mechanism underlying FMRP-mediated neuronal RNA localization and provide a framework for the elucidation of similar mechanisms governed by other RNA-binding proteins.


Author(s):  
Xianzhi Lin ◽  
Marcos A. S. Fonseca ◽  
Rosario I. Corona ◽  
Kate Lawrenson

AbstractRNA molecules function as messengers or noncoding adaptor molecules, structural components, and regulators of genome organization and gene expression. Their roles and regulation are mediated by other molecules they interact with, especially RNA binding proteins (RBPs). Here we report RNA proximity labeling (RPL), an RNA-centric method based on fusion of an endonuclease-deficient Type VI CRISPR-Cas protein (dCas13b) and engineered ascorbate peroxidase (APEX2) to discover in vivo target RNA proximal proteins (RPPs) through proximity-based biotinylation. U1 RPPs enriched by proximity-based biotinylation included both U1 snRNA canonical and noncanonical functions-related proteins. In addition, profiling of poly(A) tail proximal proteins uncovered expected categories of RBPs for poly(A) tails and also provided novel evidence for poly(A)+ RNA 5’-3’ proximity and expanded subcellular localizations. Our results suggest that RPL is a rapid approach for identifying both interacting and neighboring proteins associated with target RNA molecules in their native cellular contexts.


2021 ◽  
Vol 4 (1) ◽  
pp. 22
Author(s):  
Mrinmoyee Majumder ◽  
Viswanathan Palanisamy

Control of gene expression is critical in shaping the pro-and eukaryotic organisms’ genotype and phenotype. The gene expression regulatory pathways solely rely on protein–protein and protein–nucleic acid interactions, which determine the fate of the nucleic acids. RNA–protein interactions play a significant role in co- and post-transcriptional regulation to control gene expression. RNA-binding proteins (RBPs) are a diverse group of macromolecules that bind to RNA and play an essential role in RNA biology by regulating pre-mRNA processing, maturation, nuclear transport, stability, and translation. Hence, the studies aimed at investigating RNA–protein interactions are essential to advance our knowledge in gene expression patterns associated with health and disease. Here we discuss the long-established and current technologies that are widely used to study RNA–protein interactions in vivo. We also present the advantages and disadvantages of each method discussed in the review.


2021 ◽  
Vol 7 (1) ◽  
pp. 11 ◽  
Author(s):  
André P. Gerber

RNA–protein interactions frame post-transcriptional regulatory networks and modulate transcription and epigenetics. While the technological advances in RNA sequencing have significantly expanded the repertoire of RNAs, recently developed biochemical approaches combined with sensitive mass-spectrometry have revealed hundreds of previously unrecognized and potentially novel RNA-binding proteins. Nevertheless, a major challenge remains to understand how the thousands of RNA molecules and their interacting proteins assemble and control the fate of each individual RNA in a cell. Here, I review recent methodological advances to approach this problem through systematic identification of proteins that interact with particular RNAs in living cells. Thereby, a specific focus is given to in vivo approaches that involve crosslinking of RNA–protein interactions through ultraviolet irradiation or treatment of cells with chemicals, followed by capture of the RNA under study with antisense-oligonucleotides and identification of bound proteins with mass-spectrometry. Several recent studies defining interactomes of long non-coding RNAs, viral RNAs, as well as mRNAs are highlighted, and short reference is given to recent in-cell protein labeling techniques. These recent experimental improvements could open the door for broader applications and to study the remodeling of RNA–protein complexes upon different environmental cues and in disease.


Author(s):  
Jiaying Zhu ◽  
Changhao Li ◽  
Xu Peng ◽  
Xiuren Zhang

Abstract The majority of the genome is transcribed to RNA in living organisms. RNA transcripts can form astonishing arrays of secondary and tertiary structures via Watson-Crick, Hoogsteen or wobble base pairing. In vivo, RNA folding is not a simple thermodynamics event of minimizing free energy. Instead, the process is constrained by transcription, RNA binding proteins (RBPs), steric factors and micro-environment. RNA secondary structure (RSS) plays myriad roles in numerous biological processes, such as RNA processing, stability, transportation and translation in prokaryotes and eukaryotes. Emerging evidence has also implicated RSS in RNA trafficking, liquid-liquid phase separation and plant responses to environmental variations such as temperature and salinity. At the molecular level, RSS is correlated with regulating splicing, polyadenylation, protein systhsis, and miRNA biogenesis and functions. In this review, we summarized newly reported methods for probing RSS in vivo and functions and mechanisms of RSS in plant physiology.


Author(s):  
Brian M Forde ◽  
Andrew Henderson ◽  
Elliott G Playford ◽  
David Looke ◽  
Belinda C Henderson ◽  
...  

Abstract Background Diphtheria is a potentially fatal respiratory disease caused by toxigenic Corynebacterium diphtheriae. Although resistance to erythromycin has been recognized, β-lactam resistance in toxigenic diphtheria has not been described. Here, we report a case of fatal respiratory diphtheria caused by toxigenic C. diphtheriae resistant to penicillin and all other β-lactam antibiotics, and describe a novel mechanism of inducible carbapenem resistance associated with the acquisition of a mobile resistance element. Methods Long-read whole-genome sequencing was performed using Pacific Biosciences Single Molecule Real-Time sequencing to determine the genome sequence of C. diphtheriae BQ11 and the mechanism of β-lactam resistance. To investigate the phenotypic inducibility of meropenem resistance, short-read sequencing was performed using an Illumina NextSeq500 sequencer on the strain both with and without exposure to meropenem. Results BQ11 demonstrated high-level resistance to penicillin (benzylpenicillin minimum inhibitory concentration [MIC] ≥ 256 μg/ml), β-lactam/β-lactamase inhibitors and cephalosporins (amoxicillin/clavulanic acid MIC ≥ 256 μg/mL; ceftriaxone MIC ≥ 8 μg/L). Genomic analysis of BQ11 identified acquisition of a novel transposon carrying the penicillin-binding protein (PBP) Pbp2c, responsible for resistance to penicillin and cephalosporins. When strain BQ11 was exposed to meropenem, selective pressure drove amplification of the transposon in a tandem array and led to a corresponding change from a low-level to a high-level meropenem-resistant phenotype. Conclusions We have identified a novel mechanism of inducible antibiotic resistance whereby isolates that appear to be carbapenem susceptible on initial testing can develop in vivo resistance to carbapenems with repeated exposure. This phenomenon could have significant implications for the treatment of C. diphtheriae infection, and may lead to clinical failure.


2004 ◽  
Vol 379 (2) ◽  
pp. 283-289 ◽  
Author(s):  
Marie-Chloé BOULANGER ◽  
Tina Branscombe MIRANDA ◽  
Steven CLARKE ◽  
Marco di FRUSCIO ◽  
Beat SUTER ◽  
...  

The role of arginine methylation in Drosophila melanogaster is unknown. We identified a family of nine PRMTs (protein arginine methyltransferases) by sequence homology with mammalian arginine methyltransferases, which we have named DART1 to DART9 (Drosophilaarginine methyltransferases 1–9). In keeping with the mammalian PRMT nomenclature, DART1, DART4, DART5 and DART7 are the putative homologues of PRMT1, PRMT4, PRMT5 and PRMT7. Other DART family members have a closer resemblance to PRMT1, but do not have identifiable homologues. All nine genes are expressed in Drosophila at various developmental stages. DART1 and DART4 have arginine methyltransferase activity towards substrates, including histones and RNA-binding proteins. Amino acid analysis of the methylated arginine residues confirmed that both DART1 and DART4 catalyse the formation of asymmetrical dimethylated arginine residues and they are type I arginine methyltransferases. The presence of PRMTs in D. melanogaster suggest that flies are a suitable genetic system to study arginine methylation.


Sign in / Sign up

Export Citation Format

Share Document