scholarly journals Splice-site Strength Estimation: A simple yet powerful approach to analyse RNA splicing

Author(s):  
Craig Dent ◽  
Shilpi Singh ◽  
Shikhar Mishra ◽  
Nawar Shamaya ◽  
Kok Ping Loo ◽  
...  

RNA splicing, and variations in this process referred to as alternative splicing, are critical aspects of gene regulation in eukaryotes. From environmental responses in plants to being a primary link between genetic variation and disease in humans, splicing differences confer extensive phenotypic changes across diverse organisms1–3. Current approaches for analysing splicing rely on quantifying variant transcripts (i.e., isoforms) or splicing events (i.e., intron retention, exon skipping etc)4, 5. However, regulation of splicing occurs at the level of selection of individual splice sites, which results in variation in the abundance of isoforms and/or splicing events. Here, we present a simple approach to quantify the strength of individual splice sites, which determines their selection in a splicing reaction. Splice-site strength, as a quantitative phenotype, allows us to analyse splicing precisely in unprecedented ways. We demonstrate the power of this approach in defining the genomic determinants of the strength of individual splice-sites through GWAS. Our pilot-GWAS with more than thousand splice sites hints that cis-sequence divergence and competition between splice-sites and are among the primary determinants of variation in splicing among natural accessions of Arabidopsis thaliana. This approach allows deciphering the principles of splicing, which in turn has implications that range from agriculture to medicine.

2021 ◽  
Vol 3 (2) ◽  
Author(s):  
Craig I Dent ◽  
Shilpi Singh ◽  
Sourav Mukherjee ◽  
Shikhar Mishra ◽  
Rucha D Sarwade ◽  
...  

Abstract RNA splicing, and variations in this process referred to as alternative splicing, are critical aspects of gene regulation in eukaryotes. From environmental responses in plants to being a primary link between genetic variation and disease in humans, splicing differences confer extensive phenotypic changes across diverse organisms (1–3). Regulation of splicing occurs through differential selection of splice sites in a splicing reaction, which results in variation in the abundance of isoforms and/or splicing events. However, genomic determinants that influence splice-site selection remain largely unknown. While traditional approaches for analyzing splicing rely on quantifying variant transcripts (i.e. isoforms) or splicing events (i.e. intron retention, exon skipping etc.) (4), recent approaches focus on analyzing complex/mutually exclusive splicing patterns (5–8). However, none of these approaches explicitly measure individual splice-site usage, which can provide valuable information about splice-site choice and its regulation. Here, we present a simple approach to quantify the empirical usage of individual splice sites reflecting their strength, which determines their selection in a splicing reaction. Splice-site strength/usage, as a quantitative phenotype, allows us to directly link genetic variation with usage of individual splice-sites. We demonstrate the power of this approach in defining the genomic determinants of splice-site choice through GWAS. Our pilot analysis with more than a thousand splice sites hints that sequence divergence in cis rather than trans is associated with variations in splicing among accessions of Arabidopsis thaliana. This approach allows deciphering principles of splicing and has broad implications from agriculture to medicine.


1993 ◽  
Vol 13 (5) ◽  
pp. 2677-2687 ◽  
Author(s):  
D A Sterner ◽  
S M Berget

Very small vertebrate exons are problematic for RNA splicing because of the proximity of their 3' and 5' splice sites. In this study, we investigated the recognition of a constitutive 7-nucleotide mini-exon from the troponin I gene that resides quite close to the adjacent upstream exon. The mini-exon failed to be included in spliced RNA when placed in a heterologous gene unless accompanied by the upstream exon. The requirement for the upstream exon disappeared when the mini-exon was internally expanded, suggesting that the splice sites bordering the mini-exon are compatible with those of other constitutive vertebrate exons and that the small size of the exon impaired inclusion. Mutation of the 5' splice site of the natural upstream exon did not result in either exon skipping or activation of a cryptic 5' splice site, the normal vertebrate phenotypes for such mutants. Instead, a spliced RNA accumulated that still contained the upstream intron. In vitro, the mini-exon failed to assemble into spliceosome complexes unless either internally expanded or accompanied by the upstream exon. Thus, impaired usage of the mini-exon in vivo was accompanied by impaired recognition in vitro, and recognition of the mini-exon was facilitated by the presence of the upstream exon in vivo and in vitro. Cumulatively, the atypical in vivo and in vitro properties of the troponin exons suggest a mechanism for the recognition of this mini-exon in which initial recognition of an exon-intron-exon unit is followed by subsequent recognition of the intron.


2017 ◽  
Author(s):  
Yuichi Shiraishi ◽  
Keisuke Kataoka ◽  
Kenichi Chiba ◽  
Ai Okada ◽  
Yasunori Kogure ◽  
...  

AbstractAlthough many driver mutations are thought to promote carcinogenesis via abnormal splicing, the landscape of these splicing-associated variants (SAVs) remains unknown due to the complexity of splicing abnormalities. Here we developed a statistical framework to identify SAVs disrupting or newly creating splice site motifs and applied it to sequencing data from 8,976 samples across 31 cancer types. We constructed a catalog of 14,438 SAVs, approximately 50% of which consist of SAVs disrupting non-canonical splice sites (including the 3rd and 5th intronic bases of donor sites) or newly creating splice sites. Smoking-related signature substantially contributes to SAV generation. As many as 14.7% of samples harbor at least one SAVs in cancer-related genes, particularly in tumor suppressors. Importantly, in addition to previously reported intron retention, exon skipping or alternative splice site usage more frequently affected these genes. Our findings delineate a comprehensive portrait of SAVs, providing a basis for cancer precision medicine.


1993 ◽  
Vol 13 (5) ◽  
pp. 2677-2687
Author(s):  
D A Sterner ◽  
S M Berget

Very small vertebrate exons are problematic for RNA splicing because of the proximity of their 3' and 5' splice sites. In this study, we investigated the recognition of a constitutive 7-nucleotide mini-exon from the troponin I gene that resides quite close to the adjacent upstream exon. The mini-exon failed to be included in spliced RNA when placed in a heterologous gene unless accompanied by the upstream exon. The requirement for the upstream exon disappeared when the mini-exon was internally expanded, suggesting that the splice sites bordering the mini-exon are compatible with those of other constitutive vertebrate exons and that the small size of the exon impaired inclusion. Mutation of the 5' splice site of the natural upstream exon did not result in either exon skipping or activation of a cryptic 5' splice site, the normal vertebrate phenotypes for such mutants. Instead, a spliced RNA accumulated that still contained the upstream intron. In vitro, the mini-exon failed to assemble into spliceosome complexes unless either internally expanded or accompanied by the upstream exon. Thus, impaired usage of the mini-exon in vivo was accompanied by impaired recognition in vitro, and recognition of the mini-exon was facilitated by the presence of the upstream exon in vivo and in vitro. Cumulatively, the atypical in vivo and in vitro properties of the troponin exons suggest a mechanism for the recognition of this mini-exon in which initial recognition of an exon-intron-exon unit is followed by subsequent recognition of the intron.


2000 ◽  
Vol 20 (21) ◽  
pp. 7955-7970 ◽  
Author(s):  
Charles M. Romfo ◽  
Consuelo J. Alvarez ◽  
Willem J. van Heeckeren ◽  
Christopher J. Webb ◽  
Jo Ann Wise

ABSTRACT Schizosaccharomyces pombe pre-mRNAs are generally multi-intronic and share certain features with pre-mRNAs fromDrosophila melanogaster, in which initial splice site pairing can occur via either exon or intron definition. Here, we present three lines of evidence suggesting that, despite these similarities, fission yeast splicing is most likely restricted to intron definition. First, mutating either or both splice sites flanking an internal exon in the S. pombe cdc2 gene produced almost exclusively intron retention, in contrast to the exon skipping observed in vertebrates. Second, we were unable to induce skipping of the internal microexon in fission yeast cgs2, whereas the default splicing pathway excludes extremely small exons in mammals. Because nearly quantitative removal of the downstream intron incgs2 could be achieved by expanding the microexon, we propose that its retention is due to steric occlusion. Third, several cryptic 5′ junctions in the second intron of fission yeastcdc2 are located within the intron, in contrast to their generally exonic locations in metazoa. The effects of expanding and contracting this intron are as predicted by intron definition; in fact, even highly deviant 5′ junctions can compete effectively with the standard 5′ splice site if they are closer to the 3′ splicing signals. Taken together, our data suggest that pairing of splice sites inS. pombe most likely occurs exclusively across introns in a manner that favors excision of the smallest segment possible.


2019 ◽  
Vol 21 (Supplement_6) ◽  
pp. vi41-vi41
Author(s):  
Tobias Bonifert ◽  
Sonali Arora ◽  
Matthew Fitzgibbon ◽  
Patrick Paddison

Abstract Chemical or genetic perturbation of the spliceosome function may represent a therapeutic opportunity for Glioblastoma multiforme (GBM) and other cancers. We and others have shown that oncogenic MYC activity causes spliceosome vulnerability and leads to deficits in 3’ splice site recognition upon spliceosome perturbation. Several spliceosome inhibitors in pre-clinical development and in clinical trials target the U2 snRNP subcomplex SF3B, which is essential for 3’ splice site recognition. We showed that knockdown of one of its subunits, PHF5A, causes GBM-specific changes in splicing, including increases in exon skipping, intron retention and activation of alternative 3’ and 5’ splice sites. However, in depth analysis of these events revealed they are disproportionately enriched for “minor” introns. This class of ~800 introns is specifically excised by an alternative splicing pathway which involves the U12-dependent spliceosome. Remarkably, we provide evidence that inhibition of PHF5A in GBM stem-like cells causes increased retention of most minor introns, as well as activation of cryptic and alternative U2 snRNP-dependent splicing sites. We further demonstrate that components of the U12 snRNP complex are differentially required for GSC-specific viability. Our results suggest that regulation of the splicing of U12-type introns may be critically important for GBM tumor cell viability.


1984 ◽  
Vol 4 (5) ◽  
pp. 966-972
Author(s):  
C Montell ◽  
E F Fisher ◽  
M H Caruthers ◽  
A J Berk

The primary transcript from adenovirus 2 early region 1B (E1B) is processed by differential RNA splicing into two overlapping mRNAs, 13S and 22S. The 22S mRNA is the major E1B mRNA during the early phase of infection, whereas the 13S mRNA predominates during the late phase. In previous work, it has been shown that this shift in proportions of the E1B mRNAs is influenced by increased cytoplasmic stability of the 13S mRNA at late times in infection. Two observations presented here demonstrate that the increase in proportion of the 13S mRNA at late times is also regulated by a change in the specificity of RNA splicing. First, the relative concentrations of the 13S to 22S nuclear RNAs were not constant throughout infection but increased at late times. Secondly, studies with the mutant, adenovirus 2 pm2250 , provided evidence that there was an increased propensity to utilize a 5' splice in the region of the 13S 5' splice site at late times in infection. Adenovirus 2 pm2250 has a G----C transversion in the first base of E1B 13S mRNA intron preventing splicing of the 13S mRNA but not of the 22S mRNA. During the early phase of a pm2250 infection, the E1B primary transcripts were processed into the 22S mRNA only. However, during the late phase, when the 13S mRNA normally predominates, E1B primary transcripts were also processed by RNA splicing at two formerly unused or cryptic 5' splice sites. Both cryptic splice sites were located much closer to the disrupted 13S 5' splice site than to the 22S 5' splice site. Thus, the temporal increase in proportion of the 13S mRNA to the 22S mRNA is regulated by two processes, an increase in cytoplasmic stability of the 13S mRNA and an increased propensity to utilize the 13S 5' splice site during the late phase of infection. Adenovirus 2 pm2250 was not defective for productive infection of HeLa cells or for transformation of rat cells.


Blood ◽  
2018 ◽  
Vol 132 (Supplement 1) ◽  
pp. 2316-2316
Author(s):  
Jihyun Song ◽  
Seonggyun Han ◽  
Ricardo Amaru ◽  
Teddy Quispe ◽  
Dongwook Kim ◽  
...  

Abstract Evolutionary adaptations to high altitude in Tibetans, Ethiopians, and Andean populations of South America have shown that Tibetans and Ethiopians have normal hemoglobin %, while most of Aymara and Quechua of the Andean highlands are polycythemic. Whole genome sequencing (WGS) in Quechua identified enriched SENP1 and ANP32D genes correlating with polycythemia (Zhou et al, Am J Hum Genet. 2013 Sep 5; 93(3): 452-462) but these genes were neither enriched nor segregated with polycythemia in Aymara. Instead, we identified that genes enriched in Aymara are related to regulation of cardiovascular development in high-altitude adapted Andeans, BRINP3, NOS2, and TBX5 (Crawford et al, Am J Hum Genet. 2017 Nov 2;101(5):752-767). To further search for Aymara propensity to polycythemia, we analyzed transcriptomes from Aymara and Europeans living in La Paz, Bolivia (3,639-4,150m) from limited amount of peripheral blood reticulocytes, platelets and granulocytes, but only granulocyte RNA was adequate for unbiased whole transcriptome analyses. In Aymaras, 2,585 genes were upregulated and 365 genes were downregulated (Adjp<0.05, fold difference <-2.0, and >2.0). Many of these modulated genes are involved in inflammatory pathways including B-cell activation (FDR=0.005) and NF-κB signaling pathway (FDR=0.011). We then analyzed differential exon usage in the transcriptome and identified 2,475 genes with alternative splicing events, comprising 1,568 exon skipping, 485 intron retention, 175 alternative 3' splice sites, 144 alternative 5' splice sites, and 902 mutually exclusive exons. These alternative spliced genes were also overrepresented in inflammatory pathways (TNF receptor, IL-1 and IL-23 mediated signaling, and NF-κB signaling). Notably we detected the previously unreported NFKB1 alternate transcripts skipping exon 4 or 5, which lead to the out-of-framed NFKB1 mRNA, generating the truncated nonfunctional NF-κB protein (Figure). Inflammation is a potent suppressor of erythropoiesis and the NF-κB is transcriptional regulator of plethora of inflammatory genes. Further, NF-κB also interacts with erythropoiesis-regulators, hypoxia-inducible factors (HIFs). By the integrative analysis of the Aymara transcriptome and WGS, we identified 46 NFKB1 splicing quantitative trait loci (sQTLs). Among these 46 sQTLs, five single nucleotide polymorphisms (SNP) were in high linkage disequilibrium, and two (rs230511 and rs230504) were more enriched in Aymara (allele frequency: 0.878) (Figure) and within a genomic region where Andeans are genetically differentiated from lowland Native Americans (peak FST = 0.37, peak PBSn1 = 0.31). These sQTLs rs230511 and rs230504 were corelated with two functionally important exon skipping (exon 4 and 5) in NFKB1 as described above. Furthermore, these two SNPs were correlated with higher hemoglobin levels and lower leukocytes; the wild-type NFKB1 transcript inversely correlated with hemoglobin%. We report Aymara have differentially expressed and alternatively spliced transcripts of genes modulating inflammation, particularly NFKB1. This Aymara enriched NFKB1 haplotype variant stands out as a major cause of Aymara adaptation to high altitude, as this truncated nonfunctional NF-κB variant peptide correlates with higher hemoglobin, lower leukocytes and suppresses inflammation. These data indicate that NFKB1 SNPs enriched in Aymara are associated with alternative spliced NFKB1 transcripts which contribute to polycythemia in Aymara. Further evaluation of NF-κB and HIFs' transcriptional activity and their correlation with inflammatory makers, hepcidin and erythroferrone in Aymara and Europeans living at the same high altitude is under way. JS and SH contributed equally to this work. YL and JTP act as equivalent co-senior authors. Figure. Figure. Disclosures No relevant conflicts of interest to declare.


1990 ◽  
Vol 10 (1) ◽  
pp. 84-94 ◽  
Author(s):  
B L Robberson ◽  
G J Cote ◽  
S M Berget

Interactions at the 3' end of the intron initiate spliceosome assembly and splice site selection in vertebrate pre-mRNAs. Multiple factors, including U1 small nuclear ribonucleoproteins (snRNPs), are involved in initial recognition at the 3' end of the intron. Experiments were designed to test the possibility that U1 snRNP interaction at the 3' end of the intron during early assembly functions to recognize and define the downstream exon and its resident 5' splice site. Splicing precursor RNAs constructed to have elongated second exons lacking 5' splice sites were deficient in spliceosome assembly and splicing activity in vitro. Similar substrates including a 5' splice site at the end of exon 2 assembled and spliced normally as long as the second exon was less than 300 nucleotides long. U2 snRNPs were required for protection of the 5' splice site terminating exon 2, suggesting direct communication during early assembly between factors binding the 3' and 5' splice sites bordering an exon. We suggest that exons are recognized and defined as units during early assembly by binding of factors to the 3' end of the intron, followed by a search for a downstream 5' splice site. In this view, only the presence of both a 3' and a 5' splice site in the correct orientation and within 300 nucleotides of one another will stable exon complexes be formed. Concerted recognition of exons may help explain the 300-nucleotide-length maximum of vertebrate internal exons, the mechanism whereby the splicing machinery ignores cryptic sites within introns, the mechanism whereby exon skipping is normally avoided, and the phenotypes of 5' splice site mutations that inhibit splicing of neighboring introns.


1989 ◽  
Vol 9 (5) ◽  
pp. 2007-2017
Author(s):  
J C Noble ◽  
H Ge ◽  
M Chaudhuri ◽  
J L Manley

To study the interaction of splicing factors with the simian virus 40 early-region pre-RNA, which can be alternatively spliced to produce large T and small t mRNAs, we used an in vitro RNase protection assay that defines the 5' boundaries of factor-RNA interactions. Protection products reflecting factor interactions with the large T and small t 5' splice sites and with the multiple lariat branch site region were characterized. All protection products were detected very early in the splicing reaction, before the appearance of spliced RNAs. However, protection of the large T 5' splice site was detected well before small t 5' splice site and branch site protection products, which appeared simultaneously. Oligonucleotide-targeted degradation of small nuclear RNAs (snRNAs) revealed that protection of the branch site region, which occurred at multiple sites, required intact U2 snRNA and was enhanced by U1 snRNA, while protection of the large T and small t 5' splice sites required both U1 and U2 snRNAs. Analysis of several pre-RNAs containing mutations in the branch site region suggests that factor interactions involving the multiple copies of the branch site consensus determine the selection of branch points, which is an important factor in the selection of alternative splicing pathways.


Sign in / Sign up

Export Citation Format

Share Document