scholarly journals TBDB – A database of structurally annotated T-box riboswitch:tRNA pairs

2020 ◽  
Author(s):  
Jorge A. Marchand ◽  
Merrick D. Pierson Smela ◽  
Thomas H. H. Jordan ◽  
Kamesh Narasimhan ◽  
George M. Church

AbstractT-box riboswitches constitute a large family of tRNA-binding leader sequences that play a central role in gene regulation in many gram-positive bacteria. Accurate inference of the tRNA binding to T-boxes is critical to predict their cis-regulatory activity. However, there is no central repository of information on the tRNA binding specificities of T-box riboswitches and de novo prediction of binding specificities requires advance knowledge of computational tools to annotate riboswitch secondary structure features. Here we present T-box annotation Database (TBDB,https://tbdb.io), an open-access database with a collection of 23,497 T-box sequences, spanning the major phyla of 3,621 bacterial species. Among structural predictions, the TBDB also identifies specifier sequences, cognate tRNA binding partners, and downstream regulatory target. To our knowledge, the TBDB presents the largest collection of feature, sequence, and structural annotations carried out on this important family of regulatory RNA.

2020 ◽  
Vol 49 (D1) ◽  
pp. D229-D235
Author(s):  
Jorge A Marchand ◽  
Merrick D Pierson Smela ◽  
Thomas H H Jordan ◽  
Kamesh Narasimhan ◽  
George M Church

Abstract T-box riboswitches constitute a large family of tRNA-binding leader sequences that play a central role in gene regulation in many gram-positive bacteria. Accurate inference of the tRNA binding to T-box riboswitches is critical to predict their cis-regulatory activity. However, there is no central repository of information on the tRNA binding specificities of T-box riboswitches, and de novo prediction of binding specificities requires advanced knowledge of computational tools to annotate riboswitch secondary structure features. Here, we present the T-box Riboswitch Annotation Database (TBDB, https://tbdb.io), an open-access database with a collection of 23,535 T-box riboswitch sequences, spanning the major phyla of 3,632 bacterial species. Among structural predictions, the TBDB also identifies specifier sequences, cognate tRNA binding partners, and downstream regulatory targets. To our knowledge, the TBDB presents the largest collection of feature, sequence, and structural annotations carried out on this important family of regulatory RNA.


2015 ◽  
Vol 112 (4) ◽  
pp. 1113-1118 ◽  
Author(s):  
Anna V. Sherwood ◽  
Frank J. Grundy ◽  
Tina M. Henkin

The T box riboswitch regulates many amino acid-related genes in Gram-positive bacteria. T box riboswitch-mediated gene regulation was shown previously to occur at the level of transcription attenuation via structural rearrangements in the 5′ untranslated (leader) region of the mRNA in response to binding of a specific uncharged tRNA. In this study, a novel group of isoleucyl-tRNA synthetase gene (ileS) T box leader sequences found in organisms of the phylum Actinobacteria was investigated. The Stem I domains of these RNAs lack several highly conserved elements that are essential for interaction with the tRNA ligand in other T box RNAs. Many of these RNAs were predicted to regulate gene expression at the level of translation initiation through tRNA-dependent stabilization of a helix that sequesters a sequence complementary to the Shine–Dalgarno (SD) sequence, thus freeing the SD sequence for ribosome binding and translation initiation. We demonstrated specific binding to the cognate tRNAIle and tRNAIle-dependent structural rearrangements consistent with regulation at the level of translation initiation, providing the first biochemical demonstration, to our knowledge, of translational regulation in a T box riboswitch.


eLife ◽  
2018 ◽  
Vol 7 ◽  
Author(s):  
Jiacheng Zhang ◽  
Bhaskar Chetnani ◽  
Eric D Cormack ◽  
Dulce Alonso ◽  
Wei Liu ◽  
...  

T-box riboswitches are cis-regulatory RNA elements that regulate the expression of proteins involved in amino acid biosynthesis and transport by binding to specific tRNAs and sensing their aminoacylation state. While the T-box modular structural elements that recognize different parts of a tRNA have been identified, the kinetic trajectory describing how these interactions are established temporally remains unclear. Using smFRET, we demonstrate that tRNA binds to the riboswitch in two steps, first anticodon recognition followed by the sensing of the 3’ NCCA end, with the second step accompanied by a T-box riboswitch conformational change. Studies on site-specific mutants highlight that specific T-box structural elements drive the two-step binding process in a modular fashion. Our results set up a kinetic framework describing tRNA binding by T-box riboswitches, and suggest such binding mechanism is kinetically beneficial for efficient, co-transcriptional recognition of the cognate tRNA ligand.


Microbiome ◽  
2021 ◽  
Vol 9 (1) ◽  
Author(s):  
Hannes Petruschke ◽  
Christian Schori ◽  
Sebastian Canzler ◽  
Sarah Riesbeck ◽  
Anja Poehlein ◽  
...  

Abstract Background The intestinal microbiota plays a crucial role in protecting the host from pathogenic microbes, modulating immunity and regulating metabolic processes. We studied the simplified human intestinal microbiota (SIHUMIx) consisting of eight bacterial species with a particular focus on the discovery of novel small proteins with less than 100 amino acids (= sProteins), some of which may contribute to shape the simplified human intestinal microbiota. Although sProteins carry out a wide range of important functions, they are still often missed in genome annotations, and little is known about their structure and function in individual microbes and especially in microbial communities. Results We created a multi-species integrated proteogenomics search database (iPtgxDB) to enable a comprehensive identification of novel sProteins. Six of the eight SIHUMIx species, for which no complete genomes were available, were sequenced and de novo assembled. Several proteomics approaches including two earlier optimized sProtein enrichment strategies were applied to specifically increase the chances for novel sProtein discovery. The search of tandem mass spectrometry (MS/MS) data against the multi-species iPtgxDB enabled the identification of 31 novel sProteins, of which the expression of 30 was supported by metatranscriptomics data. Using synthetic peptides, we were able to validate the expression of 25 novel sProteins. The comparison of sProtein expression in each single strain versus a multi-species community cultivation showed that six of these sProteins were only identified in the SIHUMIx community indicating a potentially important role of sProteins in the organization of microbial communities. Two of these novel sProteins have a potential antimicrobial function. Metabolic modelling revealed that a third sProtein is located in a genomic region encoding several enzymes relevant for the community metabolism within SIHUMIx. Conclusions We outline an integrated experimental and bioinformatics workflow for the discovery of novel sProteins in a simplified intestinal model system that can be generically applied to other microbial communities. The further analysis of novel sProteins uniquely expressed in the SIHUMIx multi-species community is expected to enable new insights into the role of sProteins on the functionality of bacterial communities such as those of the human intestinal tract.


Author(s):  
Carolin Wiechers ◽  
Mangge Zou ◽  
Eric Galvez ◽  
Michael Beckstette ◽  
Maria Ebel ◽  
...  

AbstractIntestinal Foxp3+ regulatory T cell (Treg) subsets are crucial players in tolerance to microbiota-derived and food-borne antigens, and compelling evidence suggests that the intestinal microbiota modulates their generation, functional specialization, and maintenance. Selected bacterial species and microbiota-derived metabolites, such as short-chain fatty acids (SCFAs), have been reported to promote Treg homeostasis in the intestinal lamina propria. Furthermore, gut-draining mesenteric lymph nodes (mLNs) are particularly efficient sites for the generation of peripherally induced Tregs (pTregs). Despite this knowledge, the direct role of the microbiota and their metabolites in the early stages of pTreg induction within mLNs is not fully elucidated. Here, using an adoptive transfer-based pTreg induction system, we demonstrate that neither transfer of a dysbiotic microbiota nor dietary SCFA supplementation modulated the pTreg induction capacity of mLNs. Even mice housed under germ-free (GF) conditions displayed equivalent pTreg induction within mLNs. Further molecular characterization of these de novo induced pTregs from mLNs by dissection of their transcriptomes and accessible chromatin regions revealed that the microbiota indeed has a limited impact and does not contribute to the initialization of the Treg-specific epigenetic landscape. Overall, our data suggest that the microbiota is dispensable for the early stages of pTreg induction within mLNs.


2019 ◽  
Vol 476 (5) ◽  
pp. 809-826
Author(s):  
Karthik V. Rajasekar ◽  
Shuangxi Ji ◽  
Rachel J. Coulthard ◽  
Jon P. Ride ◽  
Gillian L. Reynolds ◽  
...  

Abstract SPH (self-incompatibility protein homologue) proteins are a large family of small, disulfide-bonded, secreted proteins, initially found in the self-incompatibility response in the field poppy (Papaver rhoeas), but now known to be widely distributed in plants, many containing multiple members of this protein family. Using the Origami strain of Escherichia coli, we expressed one member of this family, SPH15 from Arabidopsis thaliana, as a folded thioredoxin fusion protein and purified it from the cytosol. The fusion protein was cleaved and characterised by analytical ultracentrifugation, circular dichroism and nuclear magnetic resonance (NMR) spectroscopy. This showed that SPH15 is monomeric and temperature stable, with a β-sandwich structure. The four strands in each sheet have the same topology as the unrelated proteins: human transthyretin, bacterial TssJ and pneumolysin, with no discernible sequence similarity. The NMR-derived structure was compared with a de novo model, made using a new deep learning algorithm based on co-evolution/correlated mutations, DeepCDPred, validating the method. The DeepCDPred de novo method and homology modelling to SPH15 were then both used to derive models of the 3D structure of the three known PrsS proteins from P. rhoeas, which have only 15–18% sequence homology to SPH15. The DeepCDPred method gave models with lower discreet optimised protein energy scores than the homology models. Three loops at one end of the poppy structures are postulated to interact with their respective pollen receptors to instigate programmed cell death in pollen tubes.


2020 ◽  
Author(s):  
Lei Li ◽  
Yanjie Chao

ABSTRACTSmall proteins shorter than 50 amino acids have been long overlooked. A number of small proteins have been identified in several model bacteria using experimental approaches and assigned important functions in diverse cellular processes. The recent development of ribosome profiling technologies has allowed a genome-wide identification of small proteins and small ORFs (smORFs), but our incomplete understanding of small proteins hinders de novo computational prediction of smORFs in non-model bacterial species. Here, we have identified several sequence features for smORFs by a systematic analysis of all the known small proteins in E. coli, among which the translation initiation rate is the strongest determinant. By integrating these features into a support vector machine learning model, we have developed a novel sPepFinder algorithm that can predict conserved smORFs in bacterial genomes with a high accuracy of 92.8%. De novo prediction in E. coli has revealed several novel smORFs with evidence of translation supported by ribosome profiling. Further application of sPepFinder in 549 bacterial species has led to the identification of > 100,000 novel smORFs, many of which are conserved at the amino acid and nucleotide levels under purifying selection. Overall, we have established sPepFinder as a valuable tool to identify novel smORFs in both model and non-model bacterial organisms, and provided a large resource of small proteins for functional characterizations.


2020 ◽  
Author(s):  
Jonathan R. Belyeu ◽  
Harrison Brand ◽  
Harold Wang ◽  
Xuefang Zhao ◽  
Brent S. Pedersen ◽  
...  

AbstractEach human genome includes de novo mutations that arose during gametogenesis. While these germline mutations represent a fundamental source of new genetic diversity, they can also create deleterious alleles that impact fitness. The germline mutation rate for single nucleotide variants and factors that significantly influence this rate, such as parental age, are now well established. However, far less is known about the frequency, distribution, and features that impact de novo structural mutations. We report a large, family-based study of germline mutations, excluding aneuploidy, that affect genome structure among 572 genomes from 33 families in a multigenerational CEPH-Utah cohort and 2,363 cases of non-familial autism spectrum disorder (ASD), 1,938 unaffected siblings, and both parents (9,599 genomes in total). We find that de novo structural mutations detected by alignment-based, short-read WGS occurred at an overall rate of at least 0.160 events per genome in unaffected individuals and was significantly higher (0.206 per genome) in ASD cases. In both probands and unaffected samples, nearly 73% of de novo structural mutations arose in paternal gametes, and predict most de novo structural mutations to be caused by mutational mechanisms that do not require sequence homology. After multiple testing correction we did not observe a statistically significant correlation between parental age and the rate of de novo structural variation in offspring. These results highlight that a spectrum of mutational mechanisms contribute to germline structural mutations, and that these mechanisms likely have markedly different rates and selective pressures than those leading to point mutations.


2014 ◽  
Vol 176 (14) ◽  
pp. 357-357 ◽  
Author(s):  
M. Rheinwald ◽  
K. Hartmann ◽  
M. Hähner ◽  
G. Wolf ◽  
R. K. Straubinger ◽  
...  

The aim of this study was to investigate the prevalence of bacterial species isolated from bronchoalveolar lavage fluid (BALF) samples taken from dogs with respiratory signs and to determine their antibiotic susceptibility. Clinical cases were included in the study if they showed signs of respiratory disease and data relating to bacterial culture and susceptibility of BALF samples were available. The medical records of 493 privately owned dogs that were presented between January 1989 and December 2011 were evaluated retrospectively. In 35 per cent of samples, no bacteria were cultured. Bacteria isolated from culture-positive samples included Streptococcus species (31 per cent of positive cultures), Enterobacteriaceae (30 per cent, including Escherichia coli (15 per cent)), Staphylococcus species (19 per cent), Pasteurella species (16 per cent) and Pseudomonas species (14 per cent). Bordetella bronchiseptica as a primary respiratory pathogen was isolated in 8 per cent of cases. Enrofloxacin showed the best susceptibility pattern; 86 per cent of all isolates and 87 per cent of Gram-negative bacteria were susceptible to this antibiotic. Amoxicillin/clavulanic acid yielded the best susceptibility pattern in Gram-positive bacteria (92 per cent). Therefore, these antibiotics can be recommended for empirical or first-line treatment in dogs with bacterial lower respiratory tract infections.


Microbiology ◽  
2003 ◽  
Vol 149 (8) ◽  
pp. 2243-2250 ◽  
Author(s):  
Shin-ichi Miyata ◽  
Kenro Oshima ◽  
Shigeyuki Kakizawa ◽  
Hisashi Nishigawa ◽  
Hee-Young Jung ◽  
...  

Thymidylate kinase (TMK) catalyses the phosphorylation of dTMP to form dTDP in both the de novo and salvage pathways of dTTP synthesis in both prokaryotes and eukaryotes. Two homologues of bacterial thymidylate kinase genes were identified in a genomic library of the onion yellows (OY) phytoplasma, a plant pathogen that inhabits both plant phloem and the organs of insects. Southern blotting analysis suggested that the OY genome contained one copy of the tmk-b gene and multiple copies of the tmk-a gene. Sequencing of PCR products generated by amplification of tmk-a enabled identification of three other copies of tmk-a, although the ORF in each of these was interrupted by point mutations. The proteins, TMK-a and TMK-b, encoded by the two intact genes contained conserved motifs for catalytic activity. Both proteins were overexpressed as fusion proteins with a polyhistidine tag in Escherichia coli and purified, and TMK-b was shown to have thymidylate kinase activity. This is believed to be the first report of the catalytic activity of a phytoplasmal protein, and the OY phytoplasma is the first bacterial species to be found to have two intact homologues of tmk in its genome.


Sign in / Sign up

Export Citation Format

Share Document