scholarly journals Genome-wide Identification of Zero Nucleotide Recursive Splicing inDrosophila

2014 ◽  
Author(s):  
Michael O Duff ◽  
Sara Olson ◽  
Xintao Wei ◽  
Ahmad Osman ◽  
Alex Plocik ◽  
...  

Recursive splicing is a process in which large introns are removed in multiple steps by resplicing at ratchet points - 5? splice sites recreated after splicing. Recursive splicing was first identified in the Drosophila Ultrabithorax (Ubx) gene and only three additional Drosophila genes have since been experimentally shown to undergo recursive splicing. Here, we identify 196 zero nucleotide exon ratchet points in 130 introns of 115 Drosophila genes from total RNA sequencing data generated from developmental time points, dissected tissues, and cultured cells. Recursive splicing events were identified by splice junctions that map to annotated 5? splice sites and unannotated intronic 3? splice sites, the presence of the sequence AG/GT at the 3? splice site, and a 5? to 3? gradient of decreasing RNA-Seq read density indicative of co-transcriptional splicing. The sequential nature of recursive splicing was confirmed by identification of lariat introns generated by splicing to and from the ratchet points. We also show that recursive splicing is a constitutive process, and that the sequence and function of ratchet points are evolutionarily conserved. Together these results indicate that recursive splicing is commonly used in Drosophila and provides insight into the mechanisms by which some introns are removed.


2017 ◽  
Author(s):  
Christopher Wilks ◽  
Phani Gaddipati ◽  
Abhinav Nellore ◽  
Ben Langmead

AbstractAs more and larger genomics studies appear, there is a growing need for comprehensive and queryable cross-study summaries. Snaptron is a search engine for summarized RNA sequencing data with a query planner that leverages R-tree, B-tree and inverted indexing strategies to rapidly execute queries over 146 million exon-exon splice junctions from over 70,000 human RNA-seq samples. Queries can be tailored by constraining which junctions and samples to consider. Snaptron can also rank and score junctions according to tissue specificity or other criteria. Further, Snaptron can rank and score samples according to the relative frequency of different splicing patterns. We outline biological questions that can be explored with Snaptron queries, including a study of novel exons in annotated genes, of exonization of repetitive element loci, and of a recently discovered alternative transcription start site for the ALK gene. Web app and documentation are at http://snaptron.cs.jhu.edu. Source code is at https://github.com/ChristopherWilks/snaptron under the MIT license.



BMC Genomics ◽  
2018 ◽  
Vol 19 (1) ◽  
Author(s):  
Boas Pucker ◽  
Samuel F. Brockington


2020 ◽  
Vol 21 (5) ◽  
pp. 1754 ◽  
Author(s):  
Enrico Gaffo ◽  
Michele Bortolomeazzi ◽  
Andrea Bisognin ◽  
Piero Di Battista ◽  
Federica Lovisa ◽  
...  

MicroRNA-offset RNAs (moRNAs) are microRNA-like small RNAs generated by microRNA precursors. To date, little is known about moRNAs and bioinformatics tools to inspect their expression are still missing. We developed miR&moRe2, the first bioinformatics method to consistently characterize microRNAs, moRNAs, and their isoforms from small RNA sequencing data. To illustrate miR&moRe2 discovery power, we applied it to several published datasets. MoRNAs identified by miR&moRe2 were in agreement with previous research findings. Moreover, we observed that moRNAs and new microRNAs predicted by miR&moRe2 were downregulated upon the silencing of the microRNA-biogenesis pathway. Further, in a sizeable dataset of human blood cell populations, tens of novel miRNAs and moRNAs were discovered, some of them with significantly varied expression levels among the cell types. Results demonstrate that miR&moRe2 is a valid tool for a comprehensive study of small RNAs generated from microRNA precursors and could help to investigate their biogenesis and function.



2016 ◽  
Vol 2016 ◽  
pp. 1-9 ◽  
Author(s):  
Martin Převorovský ◽  
Martina Hálová ◽  
Kateřina Abrhámová ◽  
Jiří Libus ◽  
Petr Folk

Pre-mRNA splicing represents an important regulatory layer of eukaryotic gene expression. In the simple budding yeast Saccharomyces cerevisiae, about one-third of all mRNA molecules undergo splicing, and splicing efficiency is tightly regulated, for example, during meiotic differentiation. S. cerevisiae features a streamlined, evolutionarily highly conserved splicing machinery and serves as a favourite model for studies of various aspects of splicing. RNA-seq represents a robust, versatile, and affordable technique for transcriptome interrogation, which can also be used to study splicing efficiency. However, convenient bioinformatics tools for the analysis of splicing efficiency from yeast RNA-seq data are lacking. We present a complete workflow for the calculation of genome-wide splicing efficiency in S. cerevisiae using strand-specific RNA-seq data. Our pipeline takes sequencing reads in the FASTQ format and provides splicing efficiency values for the 5′ and 3′ splice junctions of each intron. The pipeline is based on up-to-date open-source software tools and requires very limited input from the user. We provide all relevant scripts in a ready-to-use form. We demonstrate the functionality of the workflow using RNA-seq datasets from three spliceosome mutants. The workflow should prove useful for studies of yeast splicing mutants or of regulated splicing, for example, under specific growth conditions.



2017 ◽  
Author(s):  
Abolfazl Doostparast Torshizi ◽  
Chris Armoskus ◽  
Hanwen Zhang ◽  
Marc P. Forrest ◽  
Siwei Zhang ◽  
...  

AbstractTissue-specific reverse engineering of transcriptional networks has uncovered master regulators (MRs) of cellular networks in various cancers, yet the application of this method to neuropsychiatric disorders is largely unexplored. Here, using RNA-Seq data on postmortem dorsolateral prefrontal cortex (DLPFC) from schizophrenia (SCZ) patients and control subjects, we deconvolved the transcriptional network to identify MRs that mediate expression of a large body of target genes. Together with an independent RNA-Seq data on cultured cells derived from olfactory neuroepithelium, we identified TCF4, a leading SCZ risk locus implicated by genome-wide association studies, as one of the top candidate MRs that may be potentially dysregulated in SCZ. We validated the dysregulated TCF4-related transcriptional network through examining the transcription factor binding footprints inferred from human induced pluripotent stem cell (hiPSC)-derived neuronal ATAC-Seq data, as well as direct binding sites obtained from ChIP-seq data in SH-SY5Y cells. The predicted TCF4 transcriptional targets were enriched for genes showing transcriptomic changes upon knockdown of TCF4 in hiPSC-derived neural progenitor cells (NPC) and glutamatergic neurons (Glut_N), based on observations from three separate cell lines. The altered TCF4 gene network perturbations in NPC, as compared to that in Glut_N, was more similar to the expression differences in the TCF4 gene network observed in the DLPFC of individuals with SCZ. Moreover, TCF4-associated gene expression changes in NPC were more enriched than Glut_N for pathways involved in neuronal activity, genome-wide significant SCZ risk genes, and SCZ-associated de novo mutations. Our results suggest that TCF4 may potentially serve as a MR of a gene network that confers susceptibility to SCZ at early stage of neurodevelopment, highlighting the importance of network dysregulation involving core genes and many hundreds of peripheral genes in conferring susceptibility to neuropsychiatric diseases.



BMC Genomics ◽  
2021 ◽  
Vol 22 (1) ◽  
Author(s):  
Junliang Yin ◽  
Jiahui Yan ◽  
Lu Hou ◽  
Liling Jiang ◽  
Wenrong Xian ◽  
...  

Abstract Background As a popular and valuable technique, grafting is widely used to protect against soil-borne diseases and nematodes in vegetable production. Growing evidences have revealed that long intergenic ncRNAs (lincRNAs) are strictly regulated and play essential roles in plants development and stress responses. Nevertheless, genome-wide identification and function deciphering of pepper lincRNAs, especially for their roles in improving grafting pepper resistance to Phytophthora capsici is largely unknown. Results In this study, RNA-seq data of grafting and control pepper plants with or without P. capsici inoculation were used to identify lincRNAs. In total, 2,388 reliable lincRNAs were identified. They were relatively longer and contained few exons than protein-coding genes. Similar to coding genes, lincRNAs had higher densities in euchromatin regions; and longer chromosome transcribed more lincRNAs. Expression pattern profiling suggested that lincRNAs commonly had lower expression than mRNAs. Totally, 607 differentially expressed lincRNAs (DE-lincRANs) were identified, of which 172 were found between P. capsici resistance grafting pepper sample GR and susceptible sample LDS. The neighboring genes of DE-lincRNAs and miRNAs competitively sponged by DE-lincRNAs were identified. Subsequently, the expression level of DE-lincRNAs was further confirmed by qRT-PCR and regulation patterns between DE-lincRNAs and neighboring mRNAs were also validated. Function annotation revealed that DE-lincRNAs increased the resistance of grafting prepper to P. capsici by modulating the expression of disease-defense related genes through cis-regulating and/or lincRNA-miRNA-mRNA interaction networks. Conclusions This study identified pepper lincRNAs and suggested their potential roles in increasing the resistance level of grafting pepper to P. capsici.



2015 ◽  
Author(s):  
Stefan Canzar ◽  
Sandro Andreotti ◽  
David Weese ◽  
Knut Reinert ◽  
Gunnar W. Klau

We present CIDANE, a novel framework for genome-based transcript reconstruction and quantification from RNA-seq reads. CIDANE assembles transcripts with significantly higher sensitivity and precision than existing tools, while competing in speed with the fastest methods. In addition to reconstructing transcripts ab initio, the algorithm also allows to make use of the growing annotation of known splice sites, transcription start and end sites, or full-length transcripts, which are available for most model organisms. CIDANE supports the integrated analysis of RNA-seq and additional gene-boundary data and recovers splice junctions that are invisible to other methods. CIDANE is available at http://ccb.jhu.edu/software/cidane/.



2021 ◽  
Author(s):  
Hsin-Yen Larry Wu ◽  
Polly Yingshan Hsu

Abstract Background: Ribo-seq has revolutionized the study of genome-wide mRNA translation. High-quality Ribo-seq data display strong 3-nucleotide (nt) periodicity, which corresponds to translating ribosomes deciphering three nts at a time. While 3-nt periodicity has been widely used to study novel translation events such as upstream ORFs in 5’ untranslated regions and small ORFs in presumed non-coding RNAs, tools that allow the visualization of these events remain underdeveloped.Results: RiboPlotR is a visualization package written in R that presents both RNA-seq coverage and Ribo-seq reads in genomic coordinates for all annotated transcript isoforms of a gene. Specifically, for individual isoform models, RiboPlotR plots Ribo-seq data related to splice junctions and presents the reads for all three reading frames in three different colors. Moreover, RiboPlotR shows Ribo-seq reads in upstream ORFs, 5' and 3' untranslated regions and introns, which is critical for observing new translation events and identifying potential regulatory mechanisms.Conclusions: RiboPlotR is freely available (https://github.com/hsinyenwu/RiboPlotR and https://sourceforge.net/projects/riboplotr/) and allows the visualization of translated features identified in Ribo-seq data.



2020 ◽  
Author(s):  
Shuhei Noguchi ◽  
Hideya Kawaji ◽  
Takeya Kasukawa

AbstractBackgroundGenome mapping is an essential step in data processing for transcriptome analysis, and many previous studies have evaluated various methods and strategies for mapping RNA-seq data. Cap Analysis of Gene Expression (CAGE) is a sequencing-based protocol particularly designed to capture the 5□-ends of transcripts for quantitatively measuring the expression levels of transcription start sites genome-wide. Because CAGE analysis can also predict the activities of promoters and enhancers, this protocol has been an essential tool in studies of transcriptional regulation. Typically, the same mapping software is used to align both RNA-seq data and CAGE reads to a reference genome, but which mapping software and options are most appropriate for mapping the 5□-end sequence reads obtained through CAGE has not previously been evaluated systematically.ResultsHere we assessed various strategies for aligning CAGE reads, particularly ∼50-bp sequences, with the human genome by using the HISAT2, LAST, and STAR programs both with and without a reference transcriptome. One of the major inconsistencies among the tested strategies involves alignments to pseudogenes and parent genes: some of the strategies prioritized alignments with pseudogenes even when the read could be aligned with coding genes with fewer mismatches. Another inconsistency concerned the detection of exon-exon junctions. These preferences depended on the program applied and whether a reference transcriptome was included. Overall, the choice of strategy yielded different mapping results for approximately 2% of all promoters.ConclusionsAlthough the various alignment strategies produced very similar results overall, we noted several important and measurable differences. In particular, using the reference transcriptome in STAR yielded alignments with the fewest mismatches. In addition, the inconsistencies among the strategies were especially noticeable regarding alignments to pseudogenes and novel splice junctions. Our results indicate that the choice of alignment strategy is important because it might affect the biological interpretation of the data.



Blood ◽  
2017 ◽  
Vol 130 (Suppl_1) ◽  
pp. 931-931
Author(s):  
Marilyn Parra ◽  
Benjamin W Booth ◽  
Gene W Yeo ◽  
Richard Weiszmann ◽  
Susan E Celniker ◽  
...  

Abstract Proper expression of the MDS-disease gene, SF3B1, ensures appropriate pre-mRNA splicing in erythroid progenitors and during terminal erythropoiesis. We previously showed that the SF3B1 gene is post-transcriptionally regulated in a differentiation stage-specific manner by intron retention (IR), such that ~50% of its transcripts in mature erythroblasts retain intron 4. Based on new mechanistic studies, we propose a model in which mostly unannotated and noncoding exons within intron 4 function as splicing decoys; i.e., they promote retention of intron 4 by interacting with, and blocking splice sites of, the adjacent exons 4 and 5. A total of six putative decoy exons were revealed via RT-PCR and RNA-seq analysis of RNA from erythroblasts treated with inhibitors of nonsense-mediated decay. That decoy exons have IR-promoting activity is suggested by several criteria. First, the frequency of interaction between constitutive exons 4 and 5 and putative decoy exons within intron 4, measured by the abundance of splice junctions in RNA-seq read data, is temporally correlated with levels of intron 4 retention during terminal erythropoiesis. Both IR and decoy splice junctions were low in early stage erythroblasts and much higher in mature erythroblasts. Second, selected decoy exons exhibited IR-promoting activity in the context of minigene splicing reporters expressing the exon 3-6 region of SF3B1 in transfected K562 cells. The wild type minigene reproduced the intron-specific retention phenotype, since it was fully spliced at introns 3 and 5 but exhibited substantial retention of intron 4, whereas deletion of decoy exon 4e, or mutation of its splice sites, substantially decreased IR. Third, RBP (RNA binding protein) cross-linking data from K562 cells show that 3' splice site factors including U2AF1 and U2AF2 can bind specifically to 3' splice sites of intron 4's decoy exons. Finally, several experiments showed that IR-promoting activity of decoy exons is a more general phenomenon that likely governs IR in other erythroid genes. We observed not only that SF3B1 intron 4 decoy exons could promote IR in heterologous contexts, but also that predicted decoy exons from other erythroblast transcripts could promote IR in the SF3B1 minigene. Apart from this experimental data, comparative genomics revealed that the SF3B1 decoy exons are extremely conserved among vertebrate genomes, with two of the exons being essentially identical from fish to humans. Together this data supports the hypothesis that a subset of up-regulated IR events in late erythroblasts are controlled by decoy exons that block productive splicing at the flanking exons. We propose that regulated IR is an important post-transcriptional mechanism for adjusting cellular splicing capacity during terminal erythropoiesis by regulating expression of key splicing factors such as SF3B1. Disclosures No relevant conflicts of interest to declare.



Sign in / Sign up

Export Citation Format

Share Document