coding sequence
Recently Published Documents


TOTAL DOCUMENTS

1059
(FIVE YEARS 134)

H-INDEX

81
(FIVE YEARS 6)

2021 ◽  
Author(s):  
Yvette Luyten ◽  
Deanna Hausman ◽  
Juliana C. Young ◽  
Lindsey A. Doyle ◽  
Natalia C. Ubilla-Rodriguez ◽  
...  

Bacteriophage exclusion (BREX) phage restriction systems are found in a wide range of bacteria. Various BREX systems encode unique combinations of proteins that usually include a site-specific methyltransferase; none appear to contain a nuclease. Here we describe the identification and characterization of a Type I BREX system from Acinetobacter and the effect of deleting each BREX ORF on growth, methylation and phage restriction. The analysis identified a previously uncharacterized gene at the 5-prime end of the BREX operon that is dispensable for methylation but involved in restriction. Biochemical and crystallographic analyses of this factor, which we term BrxR (BREX Regulator), demonstrate that it forms a homodimer and specifically binds a pseudo-palindromic DNA target site upstream of its transcription start site. Precise deletion of the BrxR gene causes cell toxicity, reduces phage restriction, and significantly increases the expression of BrxC. In contrast, the introduction of a premature stop codon into the BrxR gene has little effect, implying that the BrxR coding sequence and BrxR protein have independent functional roles in BREX regulation. We speculate that the BrxR coding sequence is involved in cis regulation of BREX activity and that the BrxR protein may play an additional regulatory role, perhaps during horizontal transfer of the system.


2021 ◽  
Vol 7 (4) ◽  
pp. 77
Author(s):  
Christopher Klapproth ◽  
Rituparno Sen ◽  
Peter F. Stadler ◽  
Sven Findeiß ◽  
Jörg Fallmann

Long non-coding RNAs (lncRNAs) are widely recognized as important regulators of gene expression. Their molecular functions range from miRNA sponging to chromatin-associated mechanisms, leading to effects in disease progression and establishing them as diagnostic and therapeutic targets. Still, only a few representatives of this diverse class of RNAs are well studied, while the vast majority is poorly described beyond the existence of their transcripts. In this review we survey common in silico approaches for lncRNA annotation. We focus on the well-established sets of features used for classification and discuss their specific advantages and weaknesses. While the available tools perform very well for the task of distinguishing coding sequence from other RNAs, we find that current methods are not well suited to distinguish lncRNAs or parts thereof from other non-protein-coding input sequences. We conclude that the distinction of lncRNAs from intronic sequences and untranslated regions of coding mRNAs remains a pressing research gap.


2021 ◽  
Author(s):  
Phillip C. Burke ◽  
Heungwon Park ◽  
Arvind Rasi Subramaniam

AbstractStability of eukaryotic mRNAs is associated with their codon, amino acid, and GC content. Yet, coding sequence motifs that predictably alter mRNA stability in human cells remain poorly defined. Here, we develop a massively parallel assay to measure mRNA effects of thousands of synthetic and endogenous coding sequence motifs in human cells. We identify several families of simple dipeptide repeats whose translation triggers acute mRNA instability. Rather than individual amino acids, specific combinations of bulky and positively charged amino acids are critical for the destabilizing effects of dipeptide repeats. Remarkably, dipeptide sequences that form extended β strands in silico and in vitro drive ribosome stalling and mRNA instability in vivo. The resulting nascent peptide code underlies ribosome stalling and mRNA-destabilizing effects of hundreds of endogenous peptide sequences in the human proteome. Our work reveals an intrinsic role for the ribosome as a selectivity filter against the synthesis of bulky and aggregation-prone peptides.


2021 ◽  
Author(s):  
Muqing Ma ◽  
Ting Li ◽  
David J. Lemon ◽  
Eduardo A. Caro ◽  
Linnea Judith Ritchie ◽  
...  

Organisms frequently encounter environments with nutrient shortages and their survival depends on changes in physiology and the ability to conserve resources. In bacteria, many physiological changes associated with starvation have been identified, but the underlying genetic components and regulatory networks that direct these physiological changes are often poorly defined. Here, we aimed to better define the gene regulatory networks that mediate the starvation response in Myxococcus xanthus, a bacterium that copes with starvation by producing fruiting bodies filled with dormant and stress-resistant spores. We focused on the direct promoter/gene targets of Nla28, a transcriptional activator/enhancer binding protein (EBP) that is important for early rounds of gene expression following starvation. Using expression profiling to identify genes that are downregulated in nla28 mutant cells and bioinformatics to identify the putative promoters of these genes, 12 potential promoter targets (37 genes) of Nla28 were identified. The results of in vitro promoter binding assays, coupled with in vitro and in vivo mutational analyses, suggested that the 12 promoters are in vivo targets of Nla28 and that Nla28 dimers use tandem, imperfect repeats of an 8-bp sequence for binding. Interestingly, nine of the Nla28 target promoters are intragenic, located in the protein coding sequence of an upstream gene or in the protein coding sequence of one gene within an operon (internal promoters). Based on mutational analyses, we concluded that the 12 Nla28 target loci contain at least one gene important for production of stress-resistant spores following starvation. Most of these loci contain genes predicted to be involved in regulatory or defense-related functions. Using the consensus Nla28 binding sequence, followed by bioinformatics and expression profiling, 58 additional promoters and 102 genes were tagged as potential Nla28 targets. Among these putative Nla28 targets, functions such as regulatory, metabolic and cell envelope biogenesis were commonly assigned to genes.


Blood ◽  
2021 ◽  
Vol 138 (Supplement 1) ◽  
pp. 1500-1500
Author(s):  
Fiorenza Irushani Vanderwert ◽  
Benedetta Sordi ◽  
Francesco Mannelli ◽  
Boaz Palterer ◽  
Francesca Gesullo ◽  
...  

Abstract Introduction Hereditary alpha-Tryptasemia (HαT) is a group of genetically defined traits that share increased copy number of TPSAB1 gene encoding for both the α- and β-alleles (Lyons et al 2018). Increased copy number (CN) of the α-tryptase coding sequence in TPSAB1 on one or both alleles represents the genetic base of HαT (Lyons et al 2016). HαT is characterized by mild elevation in serum tryptase levels and a variety of mast cell (MC) activation symptoms, including recurrent anaphylaxis. Prevalence in the general population is up to 5%, that increases up to 20% in systemic mastocytosis (SM); it has been suggested that HαT might be a germline variant predisposing to SM development. In SM, HαT correlated with higher incidence of mediator-related symptoms (Greiner G et al, Blood 2021). Due to its complexity, the assay for TPSAB1 CN is performed in few centers and the actual prevalence of HαT in selected subsets is still to be elucidated. To this end, we screened for HαT 2 groups of subjects, the first with MC activation symptoms and no evidence of SM, the second with diagnosis of SM according to WHO-2016. Methods Droplet digital PCR (ddPCR) was used to measure CN variation (CNV) in TPSAB1 by adapting standard CNV ddPCR protocol to genotype for both TPSAB1 and TPSB2 (Fig.1A). The high homology between α and β encoding isoforms and the presence of paralogous genes in a single locus makes TPSAB1 CNV detection very challenging. ddPCR was performed on genomic DNA with/without BamHI, using the PrimePCR ddPCR Copy Number reference AP3B1 (BioRad). Accuracy and precision of the ddPCR protocol was assessed by analyzing 10 samples in triplicate in 3 separate experiments. Data robustness and repeatability can be appreciated in Fig 1B. Results We studied 41 subjects with mediator-related symptoms and augmented basal serum tryptase (BST) (cohort 1) and 150 patients with ascertained diagnosis of mastocytosis (cohort 2). The BST threshold established for cohort 1 was equal or higher than 11 mcg/L. Median age was 64.7 yr, males 54%; median BST levels was 15.3 (range 12.3-21 mcg/L); 29% of the pts had history of anaphylaxis. In cohort 2, 134/150 (89.3%) pts had a diagnosis of SM, whereas 13/150 (8,6%) were Cutaneous Mastocytosis (CM). Among SM patients, 113(84.3%) presented with non-advanced SM variants. Advanced forms including aggressive SM (ASM) and SM with an associated hematological neoplasm (SM-AHN) were diagnosed in 6 (4.5%) and 8 (6%) respectively. In 3 pts, SM subtype was not available. Median age was 49 yr, males 55%; 41.7% of the pts had history of anaphylaxis. HαT was documented in 27 (65.9%) subjects in cohort 1, and 14 (9.3%) in cohort 2. In cohort 1, 3 α-tryptase (3α) copy number was observed more frequently (59.2% of HαT+ pts); conversely 3α and 2α-tryptase copy number were observed at a similar rate (42,8%) in cohort 2. HαT+ pts in cohort 1 presented significantly higher BST (17.1 vs 12.05 mgc/L, P<0.001), as previously reported (Greiner G et al, Blood 2021); however, occurrence of mediator related symptoms was comparable to HαT wt, 72% vs 71.4% , respectively; likewise for anaphylaxis (28% in HαT+ vs 33%). In cohort 2, BST levels were similar in HαT+ and HαT wt pts (24.6 and 24.3 mcg/L), as were anaphylaxis episodes (50% and 41%, respectively). A trend for lower MC burden in HαT+ as assessed by flow cytometry was demonstrated (% of bone marrow MC: 0.01% in HαT+ vs 0.07%) whereas no meaningful differences emerged regarding the symptom burden. In addition, a lower prevalence of KIT 816V mutation was observed in HαT+ (71.4% vs 89.5%; p=0.073). Conclusions In our study HαT+ was observed in around 10% of patients with SM, a prevalence lower than previously reported (Greiner et al, Blood 2021) and remarkably lower than in a selected cohort of subjects with raised BTL and history of mediator-released symptoms (66%). ddPCR represents a suitable method to investigate the presence of CNV in the α-tryptase coding sequence. Genetic testing for HαT+ should be considered in the diagnostic workout of patients presenting with anaphylaxis or MC mediator-related symptoms and no suspicion/evidence of SM. The clinical correlates of HαT in SM remain to be fully ascertained. Supported by IMH no.GR-2016-02362631 and AIRC, Mynerva project no21267 Figure 1 Figure 1. Disclosures Elena: CELGENE: Other: funding for meeting participation; PFIZER: Membership on an entity's Board of Directors or advisory committees; NOVARTIS: Membership on an entity's Board of Directors or advisory committees; GILEAD: Membership on an entity's Board of Directors or advisory committees. Vannucchi: Novartis: Honoraria, Membership on an entity's Board of Directors or advisory committees; Incyte: Honoraria, Membership on an entity's Board of Directors or advisory committees; BMS: Honoraria, Membership on an entity's Board of Directors or advisory committees; AbbVie: Membership on an entity's Board of Directors or advisory committees.


Cells ◽  
2021 ◽  
Vol 10 (11) ◽  
pp. 2910
Author(s):  
Ewa A. Grzybowska ◽  
Maciej Wakula

Protein binding to the non-coding regions of mRNAs is relatively well characterized and its functionality has been described in many examples. New results obtained by high-throughput methods indicate that binding to the coding sequence (CDS) by RNA-binding proteins is also quite common, but the functions thereof are more obscure. As described in this review, CDS binding has a role in the regulation of mRNA stability, but it has also a more intriguing role in the regulation of translational efficiency. Global approaches, which suggest the significance of CDS binding along with specific examples of CDS-binding RBPs and their modes of action, are outlined here, pointing to the existence of a relatively less-known regulatory network controlling mRNA stability and translation on yet another level.


2021 ◽  
Author(s):  
Alejandro Casco ◽  
Akansha Gupta ◽  
Mitchell Hayes ◽  
Reza Djavadian ◽  
Makoto Ohashi ◽  
...  

Herpesviruses employ extensive bidirectional transcription of overlapping genes to overcome length constraints on their gene product repertoire. As a consequence, many lytic transcripts cannot be measured individually by RT-qPCR or conventional RNA-seq analysis. Bruce et al. (Pathogens 2017, 6, 11; doi:10.3390/pathogens6010011) proposed an approximation method using Unique CoDing Sequences (UCDS) to estimate lytic gene abundance from KSHV RNA-seq data. Although UCDS has been widely employed, its accuracy, to our knowledge, has never been rigorously validated for any herpesvirus. In this study, we use CAGE-seq as a gold-standard to determine the accuracy of UCDS for estimating EBV lytic gene expression levels from RNA-seq data. We also introduce the Unique TranScript (UTS) method that, like UCDS, estimates transcript abundance from changes in mean RNA-seq read-depth. UTS is distinguished by its use of empirically determined 5’ and 3’ transcript ends, rather than coding sequence annotations. Compared to conventional read assignment, both UCDS and UTS improved quantitation accuracy of overlapping genes, with UTS giving the most accurate results. The UTS method discards fewer reads and may be advantageous for experiments with less sequencing depth. UTS is compatible with any aligner and, unlike isoform-aware alignment methods, can be implemented on a laptop computer. Our findings demonstrate that accuracy achieved by complex and expensive techniques such as CAGE-seq can be approximated using conventional short-read RNA-seq data when read assignment methods address transcript overlap. Although our study focuses on EBV transcription, the UTS method should be applicable across all herpesviruses and other genomes with extensively overlapping transcriptomes. IMPORTANCE Many viruses employ extensively overlapping transcript structures. This complexity makes it difficult to quantify gene expression using conventional methods including RNA-seq. Although high-throughput techniques that overcome these limitations exist, they are complex, expensive, and scarce in herpesvirus literature relative to short-read RNA-seq. Here, using Epstein-Barr virus (EBV) as a model, we demonstrate that conventional RNA-seq analysis methods fail to accurately quantify abundance of many overlapping transcripts. We further show that the previously described Unique CoDing Sequence (UCDS) and our Unique TranScript (UTS) methods greatly improve the accuracy of EBV lytic gene measurements obtained from RNA-seq data. The UTS method has the advantages of discarding fewer reads and being implementable on a laptop computer. Although this study focuses on EBV, the UCDS and UTS methods should be applicable across herpesviruses and for other viruses that make extensive use of overlapping transcription.


Author(s):  
William J. Raab ◽  
Arabella Mazzocchi ◽  
Paolo Radice ◽  
Debashis Sahoo ◽  
Chiara Castelli ◽  
...  

2021 ◽  
Author(s):  
David Stern ◽  
Clair Han

Many genomes contain rapidly evolving and highly divergent genes whose homology to genes of known function often cannot be determined from sequence similarity alone. However, coding sequence-independent features of genes, such as intron-exon boundaries, often evolve more slowly than coding sequences and can provide complementary evidence for homology. We found that a linear logistic regression classifier using only structural features of rapidly evolving bicycle aphid effector genes identified many putative bicycle homologs in aphids, phylloxerids, and scale insects, whereas sequence similarity search methods yielded few homologs in most aphids and no homologs in phylloxerids and scale insects. Subsequent examination of sequence features and intron locations supported homology assignments. Differential expression studies of newly-identified bicycle homologs, together with prior proteomic studies, support the hypothesis that BICYCLE proteins act as plant effector proteins in many aphid species and perhaps also in phylloxerids and scale insects.


2021 ◽  
Vol 7 (3) ◽  
pp. 33-39
Author(s):  
Marina V. Kubekina ◽  
Yulia Yu. Silaeva ◽  
Alexandra V. Bruter ◽  
Diana S. Korshunova ◽  
Leonid A. Ilchuk ◽  
...  

Introduction: PolG-alpha is a nuclear-encoded enzyme which provides replication and repair of mitochondrial DNA. D257A mutation of PolG-alpha leads to change in the N-terminal ”proofreading” domain, which deprives the enzyme of 3′-5′ exonuclease activity, resulting in accumulation of mutations in the mitochondrial genome. Materials and methods: Murine zygotes were microinjected with transgene construction carrying mutant murine Polg coding sequence and GFP coding sequence by a loxP-flanked STOP-cassette. Two Cre-activator strains, CMV-Cre (systemic activation) and Tie2-Cre (endothelial activation), were used for activation of the transgene. To confirm the insertion and Cre-dependent activation of the transgene, genotyping and qPCR copy number measurement of mutant Polg were performed, and GFP fluorescence was assessed. Results: Two primary transgenic animals were used as the founders for two lines with copy numbers of transgene ~7 and ~5. After systemic activation, the number of the transgene copies decreases to ~1.0 while endothelial specific activation does not affect the number of transgene copies in tail tissue. Discussion: A murine model with spatial control of mutant Polg expression has been developed. To our knowledge, this is the first transgenic model of tissue-specific mitochondrial dysfunction. Conclusion: Transgenic mice Cre-dependent expressing mutant polymerase-gamma are a novel test-system for studying mitochondrial biology and efficacy of mitoprotective drugs.


Sign in / Sign up

Export Citation Format

Share Document