Large-scale comparative genomics of Salmonella enterica to refine the organization of the global Salmonella population structure

2021 ◽  
Author(s):  
Chao Chun Liu ◽  
William W.L. Hsiao

1.AbstractSince the introduction of the White-Kauffmann-Le Minor (WKL) scheme for Salmonella serotyping, the nomenclature remains the most widely used for reporting the disease prevalence of Salmonella enterica across the globe. With the advent of whole genome sequencing (WGS), traditional serotyping has been increasingly replaced by in-silico methods that couple the detection of genetic variations in antigenic determinants with sequence-based typing. However, despite the integration of genomic-based typing by in-silico serotyping tools such as SeqSero2 and SISTR, in-silico serotyping in certain contexts remains ambiguous and insufficiently informative due to polyphyletic serovars. Furthermore, in spite of the widespread acknowledgement of polyphyly from genomic studies, the serotyping nomenclature remains unaltered. To prompt refinements to the Salmonella typing nomenclature for disease reporting, we herein performed a systematic characterization of putative polyphyletic serovars and the global Salmonella population structure by comparing 180,098 Salmonella genomes (representing 723 predicted serovars) from GenomeTrakr and PubMLST databases. We identified a range of core genome MLST typing thresholds that result in stable population structure, potentially suitable as the foundation of a genomic-based typing nomenclature for longitudinal surveillance. From the genomic comparisons of hundreds of predicted serovars, we demonstrated that in-silico serotyping classifications do not consistently reflect the population divergence observed at the genomic level. The organization of Salmonella subpopulations based on antigenic determinants can be confounded by homologous recombination and niche adaptation, resulting in shared classification of highly divergent genomes and misleading distinction between highly similar genomes. In consideration of the pivotal role of Salmonella serotyping, a compendium of putative polyphyletic serovars was compiled and made publicly available to provide additional context for future interpretations of in-silico serotyping results in disease surveillance settings. To refine the typing nomenclatures used in Salmonella surveillance reports, we foresee an improved typing scheme to be a hybrid that integrates both genomic and antigenic information such that the resolution from WGS is leveraged to improve the precision of subpopulation classifications while preserving the common names defined by the WKL scheme. Lastly, we stress the importance of controlled vocabulary integration for typing information in open data settings in order for the global Salmonella population dynamics to be fully trackable.2.Impact StatementSalmonella enterica (S. enterica) is a major foodborne pathogen responsible for an annual incidence rate of more than 90 million cases of foodborne illnesses worldwide. To surveil the high order Salmonella lineages, compare disease prevalence across jurisdictions worldwide, and inform risk assessments, in-silico serotyping has been established as the gold standard for typing the bacteria. However, despite previous Salmonella genomic studies reporting discordance between phylogenomic clades and serovars, refinements have yet been made to the serotyping scheme. Here, we analyzed over 180,000 Salmonella genomes representing 723 predicted serovars to subdivide the population into evolutionarily stable clusters in order to propose a stable organization of the Salmonella population structure that can form the basis of a genomic-based typing scheme for the pathogen. We described numerous instances in which genomes between serotypes are more similar than genomes within a serotype to reflect the inconsistencies of subpopulation classifications based on antigenic determinants. Moreover, we found inconsistencies between predicted serovars and reported serovars which highlighted potential errors in existing in-silico serotyping tools and the need to implement controlled vocabularies for reporting Salmonella subtypes in public databases. The findings of our study aim to motivate the future development of a standardized genomic-based typing nomenclature that more accurately captures the natural populations of S. enterica.3.Data SummaryThe assembly accession numbers of the genomes analyzed in this study (n = 204,952) and the associated metadata (e.g. sampling location, collection date, FTP address for retrieval) are documented in Table S1. The GenomeTrakr genomes were retrieved from the National Center for Biological Information GenBank database. The PubMLST genomes were retrieved using the BIGSdb API.

BMC Genomics ◽  
2021 ◽  
Vol 22 (1) ◽  
Author(s):  
Melina Campos ◽  
Luisa D. P. Rona ◽  
Katie Willis ◽  
George K. Christophides ◽  
Robert M. MacCallum

Abstract Background Whole genome re-sequencing provides powerful data for population genomic studies, allowing robust inferences of population structure, gene flow and evolutionary history. For the major malaria vector in Africa, Anopheles gambiae, other genetic aspects such as selection and adaptation are also important. In the present study, we explore population genetic variation from genome-wide sequencing of 765 An. gambiae and An. coluzzii specimens collected from across Africa. We used t-SNE, a recently popularized dimensionality reduction method, to create a 2D-map of An. gambiae and An. coluzzii genes that reflect their population structure similarities. Results The map allows intuitive navigation among genes distributed throughout the so-called “mainland” and numerous surrounding “island-like” gene clusters. These gene clusters of various sizes correspond predominantly to low recombination genomic regions such as inversions and centromeres, and also to recent selective sweeps. Because this mosquito species complex has been studied extensively, we were able to support our interpretations with previously published findings. Several novel observations and hypotheses are also made, including selective sweeps and a multi-locus selection event in Guinea-Bissau, a known intense hybridization zone between An. gambiae and An. coluzzii. Conclusions Our results present a rich dataset that could be utilized in functional investigations aiming to shed light onto An. gambiae s.l genome evolution and eventual speciation. In addition, the methodology presented here can be used to further characterize other species not so well studied as An. gambiae, shortening the time required to progress from field sampling to the identification of genes and genomic regions under unique evolutionary processes.


Animals ◽  
2021 ◽  
Vol 11 (1) ◽  
pp. 206
Author(s):  
Md Bashir Uddin ◽  
S.M. Bayejed Hossain ◽  
Mahmudul Hasan ◽  
Mohammad Nurul Alam ◽  
Mita Debnath ◽  
...  

Colistin (polymyxin E) is widely used in animal and human medicine and is increasingly used as one of the last-resort antibiotics against Gram-negative bacilli. Due to the increased use of colistin in treating infections caused by multidrug-resistant Gram-negative bacteria, resistance to this antibiotic ought to be monitored. The study was undertaken to elucidate the molecular mechanisms, genetic relationships and phenotype correlations of colistin-resistant isolates. Here, we report the detection of the mcr-1 gene in chicken-associated Salmonella isolates in Bangladesh and its in-silico functional analysis. Out of 100 samples, 82 Salmonella spp. were isolated from chicken specimens (liver, intestine). Phenotypic disc diffusion and minimum inhibitory concentration (MIC) assay using different antimicrobial agents were performed. Salmonella isolates were characterized using PCR methods targeting genus-specific invA and mcr-1 genes with validation for the functional analysis. The majority of the tested Salmonella isolates were found resistant to colistin (92.68%), ciprofloxacin (73.17%), tigecycline (62.20%) and trimethoprim/sulfamethoxazole (60.98%). When screened using PCR, five out of ten Salmonella isolates were found to carry the mcr-1 gene. One isolate was confirmed for Salmonella enterica subsp. enterica serovar Enteritidis, and other four isolates were confirmed for Salmonella enterica subsp. enterica serovar Typhimurium. Sequencing and phylogenetic analysis revealed a divergent evolutionary relationship between the catalytic domain of Neisseria meningitidis lipooligosaccharide phosphoethanolamine transferase A (LptA) and MCR proteins, rendering them resistant to colistin. Three-dimensional homology structural analysis of MCR-1 proteins and molecular docking interactions suggested that MCR-1 and LptA share a similar substrate binding cavity, which could be validated for the functional analysis. The comprehensive molecular and in-silico analyses of the colistin resistance mcr-1 gene of Salmonella spp. of chicken origin in the present study highlight the importance of continued monitoring and surveillance for antimicrobial resistance among pathogens in food chain animals.


2021 ◽  
Vol 59 (1) ◽  
Author(s):  
Jie-Yin Chen ◽  
Steven J. Klosterman ◽  
Xiao-Ping Hu ◽  
Xiao-Feng Dai ◽  
Krishna V. Subbarao

The genomics era has ushered in exciting possibilities to examine the genetic bases that undergird the characteristic features of Verticillium dahliae and other plant pathogens. In this review, we provide historical perspectives on some of the salient biological characteristics of V. dahliae, including its morphology, microsclerotia formation, host range, disease symptoms, vascular niche, reproduction, and population structure. The kaleidoscopic population structure of this pathogen is summarized, including different races of the pathogen, defoliating and nondefoliating phenotypes, vegetative compatibility groupings, and clonal populations. Where possible, we place the characteristic differences in the context of comparative and functional genomics analyses that have offered insights into population divergence within V. dahliae and the related species. Current challenges are highlighted along with some suggested future population genomics studies that will contribute to advancing our understanding of the population divergence in V. dahliae. Expected final online publication date for the Annual Review of Phytopathology, Volume 59 is August 2021. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.


2010 ◽  
Vol 192 (24) ◽  
pp. 6465-6476 ◽  
Author(s):  
Vartul Sangal ◽  
Heather Harbottle ◽  
Camila J. Mazzoni ◽  
Reiner Helmuth ◽  
Beatriz Guerra ◽  
...  

ABSTRACT Salmonellosis caused by Salmonella enterica serovar Newport is a major global public health concern, particularly because S. Newport isolates that are resistant to multiple drugs (MDR), including third-generation cephalosporins (MDR-AmpC phenotype), have been commonly isolated from food animals. We analyzed 384 S. Newport isolates from various sources by a multilocus sequence typing (MLST) scheme to study the evolution and population structure of the serovar. These were compared to the population structure of S. enterica serovars Enteritidis, Kentucky, Paratyphi B, and Typhimurium. Our S. Newport collection fell into three lineages, Newport-I, Newport-II, and Newport-III, each of which contained multiple sequence types (STs). Newport-I has only a few STs, unlike Newport-II or Newport-III, and has possibly emerged recently. Newport-I is more prevalent among humans in Europe than in North America, whereas Newport-II is preferentially associated with animals. Two STs of Newport-II encompassed all MDR-AmpC isolates, suggesting recent global spread after the acquisition of the bla CMY-2 gene. In contrast, most Newport-III isolates were from humans in North America and were pansusceptible to antibiotics. Newport was intermediate in population structure to the other serovars, which varied from a single monophyletic lineage in S. Enteritidis or S. Typhimurium to four discrete lineages within S. Paratyphi B. Both mutation and homologous recombination are responsible for diversification within each of these lineages, but the relative frequencies differed with the lineage. We conclude that serovars of S. enterica provide a variety of different population structures.


2020 ◽  
Vol 4 (1) ◽  
Author(s):  
Stalis Norma Ethica ◽  
Hayatun Fuad ◽  
Nur Hidayah ◽  
Sri Sinto Dewi ◽  
Aditya Rahman Ernanto ◽  
...  

Detection of Salmonella bacteria based on their virulence genes is among essential steps in the eradication of clinical infection by bacteria. In this study, two pair of primers, PhoPF-PhoPR: 5’- CCGCGCAGGAAAAACTCAAA-3’ and 5’-ATCTGTTCCAGCATCACCGG -3’ as well as PhoQF-PhoQR: 5’-AGAGATGATGCGCGTACTGG-3’ and 5’- CAGACGCCCCATGAGAACAT-3’, had been successfully designed using Primer3Plus to detect the presence of phoP and phoQ genes in Salmonella spp. Using genomic DNA of 44 genomic data of Salmonella spp. as templates, PhoPF-PhoPR could produce 520-bp amplicon, while PhoQF-PhoQR could result in 598-bp amplicon. Results of in silico PCR showed that both pairs of primers PhoPF-PhoPR and PhoQF-PhoQR could detect only Salmonella enterica species, and no Salmonella bongori species could be detected based on phoP and phoQ sequences. Both pairs of PhoPF-PhoPR and PhoQF-PhoQR primers were also able to detect the virulence genes in most of the studied subspecies of Salmonella enterica available in silico database unless Arizona subspecies. As conclusion, based on this in silico study, phoP and phoQ genes appeared to be biomarkers for Salmonella enterica species. Both pairs of primers designed in this study has potential to be used as detection tool to differentiate species Salmonella enterica from Salmonella bongori, and also to distinguish S.enterica subsp. enterica from subsp. Arizonae.Keywords: Gene detection, bacterial virulence, phoP, phoQ, Salmonella spp.


2020 ◽  
Author(s):  
A.Y. Martinez-Sanguiné ◽  
B. D’Alessandro ◽  
M. Langleib ◽  
G. M. Traglia ◽  
A. Mónaco ◽  
...  

The Enteritidis and Dublin serovars of Salmonella enterica are phylogenetically closely related yet they differ significantly in host-range and virulence. S. Enteritidis is a broad-host range serovar that commonly causes self-limited gastroenteritis in humans, whereas S. Dublin is a cattle-adapted serovar that can infect humans often resulting in invasive extra-intestinal disease. The mechanism underlying the higher invasiveness of S. Dublin remains undetermined. In this work, we quantitatively compared the proteomes of clinical isolates of each serovar grown under gut mimicking conditions. Compared to S. Enteritidis, the S. Dublin proteome was enriched in proteins linked to response to several stress conditions, such as those encountered during host infection, as well as to virulence. The S. Enteritidis proteome contained several proteins related to central anaerobic metabolism pathways that were undetected in S. Dublin. In contrast to what has been observed in other extra-intestinal serovars, most of the coding genes for these pathways are not degraded in S. Dublin. Thus, we provide evidence that S. Dublin may have much more affected metabolic functions than previously reported based on genomic studies. Single and double null mutants in stress response proteins Dps, YciF and YgaU demonstrate their relevance to S. Dublin invasiveness in a murine model of invasive salmonellosis. All in all, this work provides a basis for understanding inter-serovar differences in invasiveness and niche adaptation, underscoring the relevance of using proteomic approaches to complement genomic studies.


PLoS Genetics ◽  
2020 ◽  
Vol 16 (6) ◽  
pp. e1008850 ◽  
Author(s):  
Matt Bawn ◽  
Nabil-Fareed Alikhan ◽  
Gaëtan Thilliez ◽  
Mark Kirkwood ◽  
Nicole E. Wheeler ◽  
...  

2019 ◽  
Vol 11 (8) ◽  
pp. 2220-2231 ◽  
Author(s):  
Gavin J Fenske ◽  
Anil Thachil ◽  
Patrick L McDonough ◽  
Amy Glaser ◽  
Joy Scaria

Abstract Salmonella enterica serotype Dublin (S. Dublin) is a bovine-adapted serotype that can cause serious systemic infections in humans. Despite the increasing prevalence of human infections and the negative impact on agricultural processes, little is known about the population structure of the serotype. To this end, we compiled a manually curated data set comprising of 880 S. Dublin genomes. Core genome phylogeny and ancestral state reconstruction revealed that region-specific clades dominate the global population structure of S. Dublin. Strains of S. Dublin in the UK are genomically distinct from US, Brazilian, and African strains. The geographical partitioning impacts the composition of the core genome as well as the ancillary genome. Antibiotic resistance genes are almost exclusively found in US genomes and are mediated by an IncA/C2 plasmid. Phage content and the S. Dublin virulence plasmid were strongly conserved in the serotype. Comparison of S. Dublin to a closely related serotype, S. enterica serotype Enteritidis, revealed that S. Dublin contains 82 serotype specific genes that are not found in S. Enteritidis. Said genes encode metabolic functions involved in the uptake and catabolism of carbohydrates and virulence genes associated with type VI secretion systems and fimbria assembly respectively.


2001 ◽  
Vol 356 (1411) ◽  
pp. 1027-1034 ◽  
Author(s):  
John Wain ◽  
Deborah House ◽  
Derek Pickard ◽  
Gordon Dougan ◽  
Gad Frankel

In this review we summarize recent genomic studies that shed light on the mechanism through which pathogenic Escherichia coli and Salmonella enterica have evolved. We show how acquisition of DNA at specific sites on the chromosome has contributed to increased genetic variation and virulence of these two genera of the Enterobacteriaceae.


Sign in / Sign up

Export Citation Format

Share Document