scholarly journals Cluster-specific gene markers enhance Shigella and Enteroinvasive Escherichia coli in silico serotyping

2021 ◽  
Author(s):  
Xiaomei Zhang ◽  
Michael Payne ◽  
Thanh Nguyen ◽  
Sandeep Kaur ◽  
Ruiting Lan

AbstractShigella and enteroinvasive Escherichia coli (EIEC) cause human bacillary dysentery with similar invasion mechanisms and share similar physiological, biochemical and genetic characteristics. The ability to differentiate Shigella and EIEC from each other is important for clinical diagnostic and epidemiologic investigations. The existing genetic signatures may not discriminate between Shigella and EIEC. However, phylogenetically, Shigella and EIEC strains are composed of multiple clusters and are different forms of E. coli. In this study, we identified 10 Shigella clusters, 7 EIEC clusters and 53 sporadic types of EIEC by examining over 17,000 publicly available Shigella/EIEC genomes. We compared Shigella and EIEC accessory genomes to identify the cluster-specific gene markers or marker sets for the 17 clusters and 53 sporadic types. The gene markers showed 99.63% accuracy and more than 97.02% specificity. In addition, we developed a freely available in silico serotyping pipeline named Shigella EIEC Cluster Enhanced Serotype Finder (ShigEiFinder) by incorporating the cluster-specific gene markers and established Shigella/EIEC serotype specific O antigen genes and modification genes into typing. ShigEiFinder can process either paired end Illumina sequencing reads or assembled genomes and almost perfectly differentiated Shigella from EIEC with 99.70% and 99.81% cluster assignment accuracy for the assembled genomes and mapped reads respectively. ShigEiFinder was able to serotype over 59 Shigella serotypes and 22 EIEC serotypes and provided a high specificity with 99.40% for assembled genomes and 99.38% for mapped reads for serotyping. The cluster markers and our new serotyping tool, ShigEiFinder (https://github.com/LanLab/ShigEiFinder), will be useful for epidemiologic and diagnostic investigations.Impact statementThe differentiation of Shigella strains from enteroinvasive E. coli (EIEC) is important for clinical diagnosis and public health epidemiologic investigations. The similarities between Shigella and EIEC strains make this differentiation very difficult as both share common ancestries within E. coli. However, Shigella and EIEC are phylogenetically separated into multiple clusters, making high resolution separation using cluster specific genomic markers possible. In this study, we identified 17 Shigella or EIEC clusters including five that were newly identified through examination of over 17,000 publicly available Shigella and EIEC genomes. We further identified an individual or a set of cluster-specific gene markers for each cluster using comparative genomic analysis. These markers can then be used to classify isolates into clusters and were used to develop an in silico pipeline, ShigEiFinder (https://github.com/LanLab/ShigEiFinder) for accurate differentiation, cluster typing and serotyping of Shigella and EIEC from Illumina sequencing reads or assembled genomes. This study will have broad application from understanding the evolution of Shigella/EIEC to diagnosis and epidemiology.Data summarySequencing data have been deposited at the National Center for Biotechnology Information under BioProject number PRJNA692536.RepositoriesRaw sequence data are available from NCBI under the BioProject number PRJNA692536.

2021 ◽  
Vol 7 (12) ◽  
Author(s):  
Xiaomei Zhang ◽  
Michael Payne ◽  
Thanh Nguyen ◽  
Sandeep Kaur ◽  
Ruiting Lan

Shigella and enteroinvasive Escherichia coli (EIEC) cause human bacillary dysentery with similar invasion mechanisms and share similar physiological, biochemical and genetic characteristics. Differentiation of Shigella from EIEC is important for clinical diagnostic and epidemiological investigations. However, phylogenetically, Shigella and EIEC strains are composed of multiple clusters and are different forms of E. coli , making it difficult to find genetic markers to discriminate between Shigella and EIEC. In this study, we identified 10 Shigella clusters, seven EIEC clusters and 53 sporadic types of EIEC by examining over 17000 publicly available Shigella and EIEC genomes. We compared Shigella and EIEC accessory genomes to identify cluster-specific gene markers for the 17 clusters and 53 sporadic types. The cluster-specific gene markers showed 99.64% accuracy and more than 97.02% specificity. In addition, we developed a freely available in silico serotyping pipeline named Shigella EIEC Cluster Enhanced Serotype Finder (ShigEiFinder) by incorporating the cluster-specific gene markers and established Shigella and EIEC serotype-specific O antigen genes and modification genes into typing. ShigEiFinder can process either paired-end Illumina sequencing reads or assembled genomes and almost perfectly differentiated Shigella from EIEC with 99.70 and 99.74% cluster assignment accuracy for the assembled genomes and read mapping respectively. ShigEiFinder was able to serotype over 59 Shigella serotypes and 22 EIEC serotypes and provided a high specificity of 99.40% for assembled genomes and 99.38% for read mapping for serotyping. The cluster-specific gene markers and our new serotyping tool, ShigEiFinder (installable package: https://github.com/LanLab/ShigEiFinder, online tool: https://mgtdb.unsw.edu.au/ShigEiFinder/), will be useful for epidemiological and diagnostic investigations.


Author(s):  
Xiaomei Zhang ◽  
Michael Payne ◽  
Sandeep Kaur ◽  
Ruiting Lan

Shiga toxin-producing Escherichia coli (STEC) have more than 470 serotypes. The well-known STEC O157:H7 serotype is a leading cause of STEC infections in humans. However, the incidence of non-O157:H7 STEC serotypes associated with foodborne outbreaks and human infections has increased in recent years. Current detection and serotyping assays are focusing on O157 and top six (“Big six”) non-O157 STEC serogroups. In this study, we performed phylogenetic analysis of nearly 41,000 publicly available STEC genomes representing 460 different STEC serotypes and identified 19 major and 229 minor STEC clusters. STEC cluster-specific gene markers were then identified through comparative genomic analysis. We further identified serotype-specific gene markers for the top 10 most frequent non-O157:H7 STEC serotypes. The cluster or serotype specific gene markers had 99.54% accuracy and more than 97.25% specificity when tested using 38,534 STEC and 14,216 non-STEC E. coli genomes, respectively. In addition, we developed a freely available in silico serotyping pipeline named STECFinder that combined these robust gene markers with established E. coli serotype specific O and H antigen genes and stx genes for accurate identification, cluster determination and serotyping of STEC. STECFinder can assign 99.85% and 99.83% of 38,534 STEC isolates to STEC clusters using assembled genomes and Illumina reads respectively and can simultaneously predict stx subtypes and STEC serotypes. Using shotgun metagenomic sequencing reads of STEC spiked food samples from a published study, we demonstrated that STECFinder can detect the spiked STEC serotypes, accurately. The cluster/serotype-specific gene markers could also be adapted for culture independent typing, facilitating rapid STEC typing. STECFinder is available as an installable package (https://github.com/LanLab/STECFinder) and will be useful for in silico STEC cluster identification and serotyping using genome data.


2020 ◽  
Vol 17 ◽  
Author(s):  
Soudabe Kavousipour ◽  
Mahadi Barazesh ◽  
Shiva Mohammadi ◽  
Meghdad Abdollahpour- Alitappeh ◽  
Shirzad Fallahi ◽  
...  

Background:: Escherichia coli host has been the workhorse for the production of heterologous proteins due to simplicity of use, low cost, availability of various expression vectors, and widespread knowledge on its genetic characteristics, but without a suitable signal sequence, this host cannot be used for production secretory proteins. Humulin is a form of insulin used to treat hyperglycemia caused by types 1 and 2 diabetes. To improve expression and make a straightforward production of Humulin protein, we chose a series of signal peptides. Objective:: aim our study to predict the most excellent signal peptides to express secretory Humulin in E. coli organisms. Method:: Therefore, to forecast the most excellent signal peptides for expression of Humulin in Escherichia coli, 47 signal sequences from bacteria organisms were elected and the most imperative elements of them were studied. Hence, signal peptide probability along with physicochemical features was evaluated by signal 4.1, and Portparam, PROSO II servers respectively. Later, the in-silico cloning in a known pET28a plasmid system also estimated the possibility of best signal peptide+ Humulin expression in E.Coli. Results:: The outcomes demonstrated among 47 signal peptides only 2 signal peptides can be suggested as suitable signal peptides. Conclusion:: Ultimately protein yebF precursor (YEBF_ECOLI) and protein yebF precursor (YEBF_YERP3) were suggested severally; as the most excellent signal peptides to express Humulin (With D scores 0.812 and 0.623 respectively). Although verification of these results want experimental analysis.


2005 ◽  
Vol 71 (12) ◽  
pp. 7880-7887 ◽  
Author(s):  
Sang Jun Lee ◽  
Dong-Yup Lee ◽  
Tae Yong Kim ◽  
Byung Hun Kim ◽  
Jinwon Lee ◽  
...  

ABSTRACT Comparative analysis of the genomes of mixed-acid-fermenting Escherichia coli and succinic acid-overproducing Mannheimia succiniciproducens was carried out to identify candidate genes to be manipulated for overproducing succinic acid in E. coli. This resulted in the identification of five genes or operons, including ptsG, pykF, sdhA, mqo, and aceBA, which may drive metabolic fluxes away from succinic acid formation in the central metabolic pathway of E. coli. However, combinatorial disruption of these rationally selected genes did not allow enhanced succinic acid production in E. coli. Therefore, in silico metabolic analysis based on linear programming was carried out to evaluate the correlation between the maximum biomass and succinic acid production for various combinatorial knockout strains. This in silico analysis predicted that disrupting the genes for three pyruvate forming enzymes, ptsG, pykF, and pykA, allows enhanced succinic acid production. Indeed, this triple mutation increased the succinic acid production by more than sevenfold and the ratio of succinic acid to fermentation products by ninefold. It could be concluded that reducing the metabolic flux to pyruvate is crucial to achieve efficient succinic acid production in E. coli. These results suggest that the comparative genome analysis combined with in silico metabolic analysis can be an efficient way of developing strategies for strain improvement.


Microbiology ◽  
2021 ◽  
Vol 167 (3) ◽  
Author(s):  
Sathi Mallick ◽  
Shanti Kiran ◽  
Tapas Kumar Maiti ◽  
Anindya S. Ghosh

Escherichia coli low-molecular-mass (LMM) Penicillin-binding proteins (PBPs) help in hydrolysing the peptidoglycan fragments from their cell wall and recycling them back into the growing peptidoglycan matrix, in addition to their reported involvement in biofilm formation. Biofilms are external slime layers of extra-polymeric substances that sessile bacterial cells secrete to form a habitable niche for themselves. Here, we hypothesize the involvement of Escherichia coli LMM PBPs in regulating the nature of exopolysaccharides (EPS) prevailing in its extra-polymeric substances during biofilm formation. Therefore, this study includes the assessment of physiological characteristics of E. coli CS109 LMM PBP deletion mutants to address biofilm formation abilities, viability and surface adhesion. Finally, EPS from parent CS109 and its ΔPBP4 and ΔPBP5 mutants were purified and analysed for sugars present. Deletions of LMM PBP reduced biofilm formation, bacterial adhesion and their viability in biofilms. Deletions also diminished EPS production by ΔPBP4 and ΔPBP5 mutants, purification of which suggested an increased overall negative charge compared with their parent. Also, EPS analyses from both mutants revealed the appearance of an unusual sugar, xylose, that was absent in CS109. Accordingly, the reason for reduced biofilm formation in LMM PBP mutants may be speculated as the subsequent production of xylitol and a hindrance in the standard flow of the pentose phosphate pathway.


Microbiology ◽  
2021 ◽  
Vol 167 (10) ◽  
Author(s):  
James P. R. Connolly ◽  
Natasha C. A. Turner ◽  
Jennifer C. Hallam ◽  
Patricia T. Rimbi ◽  
Tom Flett ◽  
...  

Appropriate interpretation of environmental signals facilitates niche specificity in pathogenic bacteria. However, the responses of niche-specific pathogens to common host signals are poorly understood. d-Serine (d-ser) is a toxic metabolite present in highly variable concentrations at different colonization sites within the human host that we previously found is capable of inducing changes in gene expression. In this study, we made the striking observation that the global transcriptional response of three Escherichia coli pathotypes – enterohaemorrhagic E. coli (EHEC), uropathogenic E. coli (UPEC) and neonatal meningitis-associated E. coli (NMEC) – to d-ser was highly distinct. In fact, we identified no single differentially expressed gene common to all three strains. We observed the induction of ribosome-associated genes in extraintestinal pathogens UPEC and NMEC only, and the induction of purine metabolism genes in gut-restricted EHEC, and UPEC indicating distinct transcriptional responses to a common signal. UPEC and NMEC encode dsdCXA – a genetic locus required for detoxification and hence normal growth in the presence of d-ser. Specific transcriptional responses were induced in strains accumulating d-ser (WT EHEC and UPEC/NMEC mutants lacking the d-ser-responsive transcriptional activator DsdC), corroborating the notion that d-ser is an unfavourable metabolite if not metabolized. Importantly, many of the UPEC-associated transcriptome alterations correlate with published data on the urinary transcriptome, supporting the hypothesis that d-ser sensing forms a key part of urinary niche adaptation in this pathotype. Collectively, our results demonstrate distinct pleiotropic responses to a common metabolite in diverse E. coli pathotypes, with important implications for niche selectivity.


2019 ◽  
Vol 10 (1) ◽  
Author(s):  
Anand V. Sastry ◽  
Ye Gao ◽  
Richard Szubin ◽  
Ying Hefner ◽  
Sibei Xu ◽  
...  

AbstractUnderlying cellular responses is a transcriptional regulatory network (TRN) that modulates gene expression. A useful description of the TRN would decompose the transcriptome into targeted effects of individual transcriptional regulators. Here, we apply unsupervised machine learning to a diverse compendium of over 250 high-quality Escherichia coli RNA-seq datasets to identify 92 statistically independent signals that modulate the expression of specific gene sets. We show that 61 of these transcriptomic signals represent the effects of currently characterized transcriptional regulators. Condition-specific activation of signals is validated by exposure of E. coli to new environmental conditions. The resulting decomposition of the transcriptome provides: a mechanistic, systems-level, network-based explanation of responses to environmental and genetic perturbations; a guide to gene and regulator function discovery; and a basis for characterizing transcriptomic differences in multiple strains. Taken together, our results show that signal summation describes the composition of a model prokaryotic transcriptome.


mBio ◽  
2014 ◽  
Vol 5 (4) ◽  
Author(s):  
Piotr Bielecki ◽  
Uthayakumar Muthukumarasamy ◽  
Denitsa Eckweiler ◽  
Agata Bielecka ◽  
Sarah Pohl ◽  
...  

ABSTRACTmRNA profiling of pathogens during the course of human infections gives detailed information on the expression levels of relevant genes that drive pathogenicity and adaptation and at the same time allows for the delineation of phylogenetic relatedness of pathogens that cause specific diseases. In this study, we used mRNA sequencing to acquire information on the expression ofEscherichia colipathogenicity genes during urinary tract infections (UTI) in humans and to assign the UTI-associatedE. coliisolates to different phylogenetic groups. Whereas thein vivogene expression profiles of the majority of genes were conserved among 21E. colistrains in the urine of elderly patients suffering from an acute UTI, the specific gene expression profiles of the flexible genomes was diverse and reflected phylogenetic relationships. Furthermore, genes transcribedin vivorelative to laboratory media included well-described virulence factors, small regulatory RNAs, as well as genes not previously linked to bacterial virulence. Knowledge on relevant transcriptional responses that drive pathogenicity and adaptation of isolates to the human host might lead to the introduction of a virulence typing strategy into clinical microbiology, potentially facilitating management and prevention of the disease.IMPORTANCEUrinary tract infections (UTI) are very common; at least half of all women experience UTI, most of which are caused by pathogenicEscherichia colistrains. In this study, we applied massive parallel cDNA sequencing (RNA-seq) to provide unbiased, deep, and accurate insight into the nature and the dimension of the uropathogenicE. coligene expression profile during an acute UTI within the human host. This work was undertaken to identify key players in physiological adaptation processes and, hence, potential targets for new infection prevention and therapy interventions specifically aimed at sabotaging bacterial adaptation to the human host.


2019 ◽  
Vol 63 (5) ◽  
Author(s):  
Jun Li ◽  
Haihong Hao ◽  
Menghong Dai ◽  
Heying Zhang ◽  
Jianan Ning ◽  
...  

ABSTRACT This study aimed to investigate the genetic characteristics, antibiotic resistance patterns, and novel mechanisms involved in fluoroquinolone (FQ) resistance in commensal Escherichia coli isolates. The E. coli isolates were recovered from a previous clinical study and subjected to antimicrobial susceptibility testing and molecular typing. Known mechanisms of FQ resistance (target site mutations, plasmid-mediated quinolone resistance [PMQR] genes, relative expression levels of efflux pumps and porins) were detected using DNA sequencing of PCR products and real-time quantitative PCR. Whole-genome shotgun sequencing was performed on 11 representative strains to screen for single nucleotide polymorphisms (SNPs). The function of a key SNP (A1541G) was investigated by site-directed mutagenesis and allelic exchange. The results showed that long-term enrofloxacin treatment selected multidrug-resistant (MDR) E. coli isolates in the chicken gut and that these E. coli isolates had diverse genetic backgrounds. Multiple genetic alterations, including double mutations on GyrA (S83L and D87N), a single mutation on ParC (S80I) and ParE (S458E), activation of efflux pumps, and the presence of the QnrS1 protein, contributed to the high-level FQ resistance (enrofloxacin MIC [MICENR] ≥ 128 μg/ml), while the relatively low-level FQ resistance (MICENR = 8 or 16 μg/ml) was commonly mediated by decreased expression of the porin OmpF, besides enhancement of the efflux pumps. No significant relationship was observed between resistance mechanisms and virulence genes. Introduction of the A1541G mutation on aegA was able to increase FQ susceptibility by 2-fold. This study contributes to a better understanding of the development of MDR and the differences underlying the mechanisms of high-level and low-level FQ resistance in E. coli.


Animals ◽  
2020 ◽  
Vol 10 (8) ◽  
pp. 1295
Author(s):  
Isabel Carvalho ◽  
María Teresa Tejedor-Junco ◽  
Margarita González-Martín ◽  
Juan Alberto Corbera ◽  
Vanessa Silva ◽  
...  

Objective: This work aimed to determine the carriage rate of ESBL-producing Escherichia coli as well as their genetic characteristics in camels from the Canary Islands, Spain. Methods: Fecal samples were recovered from 58 healthy camels from Gran Canaria (n = 32) and Fuerteventura Islands (n = 26) during July 2019. They were seeded on MacConkey (MC) agar no supplemented and supplemented (MC + CTX) with cefotaxime (2 µg/mL). Antimicrobial susceptibility was determined by disk diffusion test (CLSI, 2018). The presence of blaCTX-M, blaSHV, blaTEM,blaCMY-2 and blaOXA-1/48 genes was tested by PCR/sequencing. Furthermore, the mcr-1 (colistin resistance), tetA/tetB (tetracycline resistance), int1 (integrase of class 1 integrons) and stx1,2 genes were analyzed. Phylogenetic groups and sequence types were determined by specific-PCR/sequencing for selected isolates. Results: E. coli was obtained from all the 58 camels in MC media (100%) and in five of them in MC + CTX media (8.6%). Furthermore, 63.8% of E. coli isolates recovered from MC agar were susceptible to all the antibiotics tested. The five E. coli isolates recovered from MC + CTX media were characterized and two of them were ESBL-producers (3.4%). Both ESBL-producer isolates carried the blaCTX-M-15 gene and belonged to the lineages ST3018 (phylogroup A) and ST69 (phylogroup B1). The 3 ESBL-negative isolates recovered from MC-CTX plates were ascribed to phylogroup-B1. Conclusions: Camels can be a source of ESBL-producer bacteria, containing the widespread blaCTX-M-15 gene associated with the lineages ST3018 and ST69.


Sign in / Sign up

Export Citation Format

Share Document