scholarly journals Approximate search for known gene clusters in new genomes using PQ-trees

2021 ◽  
Vol 16 (1) ◽  
Author(s):  
Galia R. Zimerman ◽  
Dina Svetlitsky ◽  
Meirav Zehavi ◽  
Michal Ziv-Ukelson

AbstractGene clusters are groups of genes that are co-locally conserved across various genomes, not necessarily in the same order. Their discovery and analysis is valuable in tasks such as gene annotation and prediction of gene interactions, and in the study of genome organization and evolution. The discovery of conserved gene clusters in a given set of genomes is a well studied problem, but with the rapid sequencing of prokaryotic genomes a new problem is inspired. Namely, given an already known gene cluster that was discovered and studied in one genomic dataset, to identify all the instances of the gene cluster in a given new genomic sequence. Thus, we define a new problem in comparative genomics, denoted PQ-Tree Search that takes as input a PQ-tree T representing the known gene orders of a gene cluster of interest, a gene-to-gene substitution scoring function h, integer arguments $$d_T$$ d T and $$d_S$$ d S , and a new sequence of genes S. The objective is to identify in S approximate new instances of the gene cluster; These instances could vary from the known gene orders by genome rearrangements that are constrained by T, by gene substitutions that are governed by h, and by gene deletions and insertions that are bounded from above by $$d_T$$ d T and $$d_S$$ d S , respectively. We prove that PQ-Tree Search is -hard and propose a parameterized algorithm that solves the optimization variant of PQ-Tree Search in $$O^*(2^{\gamma })$$ O ∗ ( 2 γ ) time, where $$\gamma$$ γ is the maximum degree of a node in T and $$O^*$$ O ∗ is used to hide factors polynomial in the input size. The algorithm is implemented as a search tool, denoted PQFinder, and applied to search for instances of chromosomal gene clusters in plasmids, within a dataset of 1,487 prokaryotic genomes. We report on 29 chromosomal gene clusters that are rearranged in plasmids, where the rearrangements are guided by the corresponding PQ-trees. One of these results, coding for a heavy metal efflux pump, is further analysed to exemplify how PQFinder can be harnessed to reveal interesting new structural variants of known gene clusters.

mBio ◽  
2020 ◽  
Vol 11 (2) ◽  
Author(s):  
Luchao Lv ◽  
Miao Wan ◽  
Chengzhen Wang ◽  
Xun Gao ◽  
Qiwen Yang ◽  
...  

ABSTRACT Transporters belonging to the chromosomally encoded resistance-nodulation-division (RND) superfamily mediate multidrug resistance in Gram-negative bacteria. However, the cotransfer of large gene clusters encoding RND-type pumps from the chromosome to a plasmid appears infrequent, and no plasmid-mediated RND efflux pump gene cluster has yet been found to confer resistance to tigecycline. Here, we identified a novel RND efflux pump gene cluster, designated tmexCD1-toprJ1, on plasmids from five pandrug-resistant Klebsiella pneumoniae isolates of animal origin. TMexCD1-TOprJ1 increased (by 4- to 32-fold) the MICs of tetracyclines (including tigecycline and eravacycline), quinolones, cephalosporins, and aminoglycosides for K. pneumoniae, Escherichia coli, and Salmonella. TMexCD1-TOprJ1 is closely related (64.5% to 77.8% amino acid identity) to the MexCD-OprJ efflux pump encoded on the chromosome of Pseudomonas aeruginosa. In an IncFIA plasmid, pHNAH8I, the tmexCD1-toprJ1 gene cluster lies adjacent to two genes encoding site-specific integrases, which may have been responsible for its acquisition. Expression of TMexCD1-TOprJ1 in E. coli resulted in increased tigecycline efflux and in K. pneumoniae negated the efficacy of tigecycline in an in vivo infection model. Expression of TMexCD1-TOprJ1 reduced the growth of E. coli and Salmonella but not K. pneumoniae. tmexCD1-toprJ1-positive Enterobacteriaceae isolates were rare in humans (0.08%) but more common in chicken fecal (14.3%) and retail meat (3.4%) samples. Plasmid-borne tmexCD1-toprJ1-like gene clusters were identified in sequences in GenBank from Enterobacteriaceae and Pseudomonas strains from multiple continents. The possibility of further global dissemination of the tmexCD1-toprJ1 gene cluster and its analogues in Enterobacteriaceae via plasmids may be an important consideration for public health planning. IMPORTANCE In an era of increasing concerns about antimicrobial resistance, tigecycline is likely to have a critically important role in the treatment of carbapenem-resistant Enterobacteriaceae, the most problematic pathogens in human clinical settings—especially carbapenem-resistant K. pneumoniae. Here, we identified a new plasmid-borne RND-type tigecycline resistance determinant, TMexCD1-TOprJ1, which is widespread among K. pneumoniae isolates from food animals. tmexCD1-toprJ1 appears to have originated from the chromosome of a Pseudomonas species and may have been transferred onto plasmids by adjacent site-specific integrases. Although tmexCD1-toprJ1 still appears to be rare in human clinical isolates, considering the transferability of the tmexCD1-toprJ1 gene cluster and the broad substrate spectrum of TMexCD1-TOprJ1, further dissemination of this mobile tigecycline resistance determinant is possible. Therefore, from a “One Health” perspective, measures are urgently needed to monitor and control its further spread. The current low prevalence in human clinical isolates provides a precious time window to design and implement measures to tackle this.


Author(s):  
Cheng-Zhen Wang ◽  
Xun Gao ◽  
Qi-Wen Yang ◽  
Lu-Chao Lv ◽  
Miao Wan ◽  
...  

We recently identified a novel plasmid-mediated RND-type efflux pump gene cluster, tmexCD1-toprJ1 in Klebsiella pneumoniae, that conferred resistance to multiple antimicrobials, including tigecycline. While homologs of tmexCD1-toprJ1 were found encoded in many other bacterial species in GenBank, their function and transfer mechanism remain unknown. This study identified another mobile gene cluster, tmexCD2-toprJ2, co-occurring on both plasmid (pHNNC189-2) and chromosome of a clinical Raoultella ornithinolytica strain NC189 producing KPC-2, NDM-1, and RmtC. tmexCD2-toprJ2 shares high similarity at nucleotide level to tmexCD1-toprJ1 with 98.02%, 96.75%, and 99.93% identity, respectively. Phylogenetic analysis revealed that tmexCD2-toprJ2 may have originated from chromosome of a Pseudomonas species. Expression of tmexCD2-toprJ2 in Escherichia coli strain resulted in an 8-fold increase of tigecycline MIC and decreased susceptibility to other antimicrobials. Genetic context analyses demonstrated that tmexCD2-toprJ2, together with the adjacent hypothetical site-specific integrase genes, was possibly captured and mobilized by a XerD-like tyrosine recombinase system, forming a putative transposition unit (xerD-like-int-thf2-ybjD-umuD-ΔumuC1-int1-int2-hp1-hp2-tnfxB2-ISBvi2-tmexCD2-toprJ2-ΔumuC1), which was inserted into umuC-like genes in both the NC189 plasmid pHNNC189-2 and chromosome. As tmexCD1-toprJ1 and tmexCD2-toprJ2 could confer multidrug resistance, the spread of these gene clusters, associated with the new recombinase system, calls for more attention.


2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Woo Cheol Lee ◽  
Sungjae Choi ◽  
Ahjin Jang ◽  
Kkabi Son ◽  
Yangmee Kim

AbstractSome Gram-negative bacteria harbor lipids with aryl polyene (APE) moieties. Biosynthesis gene clusters (BGCs) for APE biosynthesis exhibit striking similarities with fatty acid synthase (FAS) genes. Despite their broad distribution among pathogenic and symbiotic bacteria, the detailed roles of the metabolic products of APE gene clusters are unclear. Here, we determined the crystal structures of the β-ketoacyl-acyl carrier protein (ACP) reductase ApeQ produced by an APE gene cluster from clinically isolated virulent Acinetobacter baumannii in two states (bound and unbound to NADPH). An in vitro visible absorption spectrum assay of the APE polyene moiety revealed that the β-ketoacyl-ACP reductase FabG from the A. baumannii FAS gene cluster cannot be substituted for ApeQ in APE biosynthesis. Comparison with the FabG structure exhibited distinct surface electrostatic potential profiles for ApeQ, suggesting a positively charged arginine patch as the cognate ACP-binding site. Binding modeling for the aryl group predicted that Leu185 (Phe183 in FabG) in ApeQ is responsible for 4-benzoyl moiety recognition. Isothermal titration and arginine patch mutagenesis experiments corroborated these results. These structure–function insights of a unique reductase in the APE BGC in comparison with FAS provide new directions for elucidating host–pathogen interaction mechanisms and novel antibiotics discovery.


Toxins ◽  
2019 ◽  
Vol 12 (1) ◽  
pp. 19 ◽  
Author(s):  
Maria B. Nowakowska ◽  
François P. Douillard ◽  
Miia Lindström

The botulinum neurotoxin (BoNT) has been extensively researched over the years in regard to its structure, mode of action, and applications. Nevertheless, the biological roles of four proteins encoded from a number of BoNT gene clusters, i.e., OrfX1-3 and P47, are unknown. Here, we investigated the diversity of orfX-p47 gene clusters using in silico analytical tools. We show that the orfX-p47 cluster was not only present in the genomes of BoNT-producing bacteria but also in a substantially wider range of bacterial species across the bacterial phylogenetic tree. Remarkably, the orfX-p47 cluster was consistently located in proximity to genes coding for various toxins, suggesting that OrfX1-3 and P47 may have a conserved function related to toxinogenesis and/or pathogenesis, regardless of the toxin produced by the bacterium. Our work also led to the identification of a putative novel BoNT-like toxin gene cluster in a Bacillus isolate. This gene cluster shares striking similarities to the BoNT cluster, encoding a bont/ntnh-like gene and orfX-p47, but also differs from it markedly, displaying additional genes putatively encoding the components of a polymorphic ABC toxin complex. These findings provide novel insights into the biological roles of OrfX1, OrfX2, OrfX3, and P47 in toxinogenesis and pathogenesis of BoNT-producing and non-producing bacteria.


2021 ◽  
Vol 85 (3) ◽  
pp. 714-721
Author(s):  
Risa Takao ◽  
Katsuyuki Sakai ◽  
Hiroyuki Koshino ◽  
Hiroyuki Osada ◽  
Shunji Takahashi

ABSTRACT Recent advances in genome sequencing have revealed a variety of secondary metabolite biosynthetic gene clusters in actinomycetes. Understanding the biosynthetic mechanism controlling secondary metabolite production is important for utilizing these gene clusters. In this study, we focused on the kinanthraquinone biosynthetic gene cluster, which has not been identified yet in Streptomyces sp. SN-593. Based on chemical structure, 5 type II polyketide synthase gene clusters were listed from the genome sequence of Streptomyces sp. SN-593. Among them, a candidate gene cluster was selected by comparing the gene organization with grincamycin, which is synthesized through an intermediate similar to kinanthraquinone. We initially utilized a BAC library for subcloning the kiq gene cluster, performed heterologous expression in Streptomyces lividans TK23, and identified the production of kinanthraquinone and kinanthraquinone B. We also found that heterologous expression of kiqA, which belongs to the DNA-binding response regulator OmpR family, dramatically enhanced the production of kinanthraquinones.


Life ◽  
2021 ◽  
Vol 11 (8) ◽  
pp. 758
Author(s):  
Xiaohe Jin ◽  
Yunlong Zhang ◽  
Ran Zhang ◽  
Kathy-Uyen Nguyen ◽  
Jonathan S. Lindsey ◽  
...  

Tolyporphins A–R are unusual tetrapyrrole macrocycles produced by the non-axenic filamentous cyanobacterium HT-58-2. A putative biosynthetic gene cluster for biosynthesis of tolyporphins (here termed BGC-1) was previously identified in the genome of HT-58-2. Here, homology searching of BGC-1 in HT-58-2 led to identification of similar BGCs in seven other filamentous cyanobacteria, including strains Nostoc sp. 106C, Nostoc sp. RF31YmG, Nostoc sp. FACHB-892, Brasilonema octagenarum UFV-OR1, Brasilonema octagenarum UFV-E1, Brasilonema sennae CENA114 and Oculatella sp. LEGE 06141, suggesting their potential for tolyporphins production. A similar gene cluster (BGC-2) also was identified unexpectedly in HT-58-2. Tolyporphins BGCs were not identified in unicellular cyanobacteria. Phylogenetic analysis based on 16S rRNA and a common component of the BGCs, TolD, points to a close evolutionary history between each strain and their respective tolyporphins BGC. Though identified with putative tolyporphins BGCs, examination of pigments extracted from three cyanobacteria has not revealed the presence of tolyporphins. Overall, the identification of BGCs and potential producers of tolyporphins presents a collection of candidate cyanobacteria for genetic and biochemical analysis pertaining to these unusual tetrapyrrole macrocycles.


2014 ◽  
Vol 58 (10) ◽  
pp. 6185-6196 ◽  
Author(s):  
Marius Spohn ◽  
Norbert Kirchner ◽  
Andreas Kulik ◽  
Angelika Jochim ◽  
Felix Wolf ◽  
...  

ABSTRACTThe emergence of antibiotic-resistant pathogenic bacteria within the last decades is one reason for the urgent need for new antibacterial agents. A strategy to discover new anti-infective compounds is the evaluation of the genetic capacity of secondary metabolite producers and the activation of cryptic gene clusters (genome mining). One genus known for its potential to synthesize medically important products isAmycolatopsis. However,Amycolatopsis japonicumdoes not produce an antibiotic under standard laboratory conditions. In contrast to mostAmycolatopsisstrains,A. japonicumis genetically tractable with different methods. In order to activate a possible silent glycopeptide cluster, we introduced a gene encoding the transcriptional activator of balhimycin biosynthesis, thebbrgene fromAmycolatopsis balhimycina(bbrAba), intoA. japonicum. This resulted in the production of an antibiotically active compound. Following whole-genome sequencing ofA. japonicum, 29 cryptic gene clusters were identified by genome mining. One of these gene clusters is a putative glycopeptide biosynthesis gene cluster. Using bioinformatic tools, ristomycin (syn. ristocetin), a type III glycopeptide, which has antibacterial activity and which is used for the diagnosis of von Willebrand disease and Bernard-Soulier syndrome, was deduced as a possible product of the gene cluster. Chemical analyses by high-performance liquid chromatography and mass spectrometry (HPLC-MS), tandem mass spectrometry (MS/MS), and nuclear magnetic resonance (NMR) spectroscopy confirmed thein silicoprediction that the recombinantA. japonicum/pRM4-bbrAbasynthesizes ristomycin A.


2007 ◽  
Vol 73 (8) ◽  
pp. 2673-2681 ◽  
Author(s):  
Arno Wegkamp ◽  
Wietske van Oorschot ◽  
Willem M. de Vos ◽  
Eddy J. Smid

ABSTRACT The pab genes for para-aminobenzoic acid (pABA) biosynthesis in Lactococcus lactis were identified and characterized. In L. lactis NZ9000, only two of the three genes needed for pABA production were initially found. No gene coding for 4-amino-4-deoxychorismate lyase (pabC) was initially annotated, but detailed analysis revealed that pabC was fused with the 3′ end of the gene coding for chorismate synthetase component II (pabB). Therefore, we hypothesize that all three enzyme activities needed for pABA production are present in L. lactis, allowing for the production of pABA. Indeed, the overexpression of the pABA gene cluster in L. lactis resulted in elevated pABA pools, demonstrating that the genes are involved in the biosynthesis of pABA. Moreover, a pABA knockout (KO) strain lacking pabA and pabB C was constructed and shown to be unable to produce folate when cultivated in the absence of pABA. This KO strain was unable to grow in chemically defined medium lacking glycine, serine, nucleobases/nucleosides, and pABA. The addition of the purine guanine, adenine, xanthine, or inosine restored growth but not the production of folate. This suggests that, in the presence of purines, folate is not essential for the growth of L. lactis. It also shows that folate is not strictly required for the pyrimidine biosynthesis pathway. L. lactis strain NZ7024, overexpressing both the folate and pABA gene clusters, was found to produce 2.7 mg of folate/liter per optical density unit at 600 nm when the strain was grown on chemically defined medium without pABA. This is in sharp contrast to L. lactis strains overexpressing only one of the two gene clusters. Therefore, we conclude that elevated folate levels can be obtained only by the overexpression of folate combined with the overexpression of the pABA biosynthesis gene cluster, suggesting the need for a balanced carbon flux through the folate and pABA biosynthesis pathway in the wild-type strain.


2001 ◽  
Vol 183 (5) ◽  
pp. 1810-1812 ◽  
Author(s):  
Sergei Stolyar ◽  
Marion Franke ◽  
Mary E. Lidstrom

ABSTRACT The expression of the two gene clusters encoding the particulate methane monooxygenase (pMMO) in Methylococcus capsulatusBath was assessed by analysis of transcripts and by use of chromosomal gene fusions. The results suggest that the two clusters are functionally redundant but that relative expression alters depending on the copper levels available for growth.


Author(s):  
Rebecca Devine ◽  
Hannah McDonald ◽  
Zhiwei Qin ◽  
Corinne Arnold ◽  
Katie Noble ◽  
...  

AbstractThe formicamycins are promising antibiotics with potent activity against Gram-positive pathogens including VRE and MRSA and display a high barrier to selection of resistant isolates. They were first identified in Streptomyces formicae KY5, which produces the formicamycins at low levels on solid agar but not in liquid culture, thus hindering further investigation of these promising antibacterial compounds. We hypothesised that by understanding the organisation and regulation of the for biosynthetic gene cluster, we could rationally refactor the cluster to increase production levels. Here we report that the for biosynthetic gene cluster consists of 24 genes expressed on nine transcripts. Seven of these transcripts, including those containing all the major biosynthetic genes, are repressed by the MarR-regulator ForJ which also controls the expression of the ForGF two-component system that initiates biosynthesis. A third cluster-situated regulator, ForZ, autoregulates and controls production of the putative MFS transporter ForAA. Consistent with these findings, deletion of forJ increased formicamycin biosynthesis 5-fold, while over-expression of forGF in the ΔforJ background increased production 10-fold compared to the wild-type. De-repression by deleting forJ also switched on biosynthesis in liquid-culture and induced the production of two novel formicamycin congeners. By combining mutations in regulatory and biosynthetic genes, six new biosynthetic precursors with antibacterial activity were also isolated. This work demonstrates the power of synthetic biology for the rational redesign of antibiotic biosynthetic gene clusters both to engineer strains suitable for fermentation in large scale bioreactors and to generate new molecules.ImportanceAntimicrobial resistance is a growing threat as existing antibiotics become increasingly ineffective against drug resistant pathogens. Here we determine the transcriptional organisation and regulation of the gene cluster encoding biosynthesis of the formicamycins, promising new antibiotics with activity against drug resistant bacteria. By exploiting this knowledge, we construct stable mutant strains which over-produce these molecules in both liquid and solid culture whilst also making some new compound variants. This will facilitate large scale purification of these molecules for further study including in vivo experiments and the elucidation of their mechanism of action. Our work demonstrates that understanding the regulation of natural product biosynthetic pathways can enable rational improvement of the producing strains.


Sign in / Sign up

Export Citation Format

Share Document