scholarly journals Peer Review #1 of "The single-species metagenome: subtyping Staphylococcus aureus core genome sequences from shotgun metagenomic data (v0.1)"

Author(s):  
J Sahl
2015 ◽  
Author(s):  
Sandeep J. Joseph ◽  
Ben Li ◽  
Robert A. Petit ◽  
Zhaohui S. Qin ◽  
Lyndsey A. Darrow ◽  
...  

AbstractMetagenome shotgun sequence projects offer the potential for large scale biogeographic analysis of microbial species. In this project we developed a method for detecting 33 common subtypes of the pathogenic bacterium Staphylococcus aureus. We used a binomial mixture model implemented in the binstrain software and the coverage counts at > 100,000 known S. aureus SNP (single nucleotide polymorphism) sites derived from prior comparative genomic analysis to estimate the proportion of each subtype in metagenome samples. Using this pipeline we were able to obtain > 87% sensitivity and > 94% specificity when testing on low genome coverage samples of diverse S. aureus strains (0.025X). We found that 321 and 149 metagenome samples from the Human Microbiome Project and metaSUB analysis of the New York City subway, respectively, contained S. aureus at genome coverage > 0.025. In both projects, CC8 and CC30 were the most common S. aureus subtypes encountered. We found evidence that the subtype composition at different body sites of the same individual were more similar than random sampling and more limited evidence that certain body sites were enriched for particular subtypes. One surprising finding was the apparent high frequency of CC398, a lineage associated with livestock, in samples from the tongue dorsum. Epidemiologic analysis of the HMP subject population suggested that high BMI (body mass index) and health insurance are risk factors for S. aureus but there was limited power to find factors linked to carriage of even the most common subtype. In the NYC subway data, we found a small signal of geographic distance affecting subtype clustering but other unknown factors influence taxonomic distribution of the species around the city. We argue that pathogen detection in metagenome samples requires the use of subtypes based on whole species population genomic analysis rather than using ad hoc collections of reference strains.


Genes ◽  
2021 ◽  
Vol 12 (1) ◽  
pp. 111 ◽  
Author(s):  
J. Peter W. Young ◽  
Sara Moeskjær ◽  
Alexey Afonin ◽  
Praveen Rahi ◽  
Marta Maluk ◽  
...  

Bacteria currently included in Rhizobium leguminosarum are too diverse to be considered a single species, so we can refer to this as a species complex (the Rlc). We have found 429 publicly available genome sequences that fall within the Rlc and these show that the Rlc is a distinct entity, well separated from other species in the genus. Its sister taxon is R. anhuiense. We constructed a phylogeny based on concatenated sequences of 120 universal (core) genes, and calculated pairwise average nucleotide identity (ANI) between all genomes. From these analyses, we concluded that the Rlc includes 18 distinct genospecies, plus 7 unique strains that are not placed in these genospecies. Each genospecies is separated by a distinct gap in ANI values, usually at approximately 96% ANI, implying that it is a ‘natural’ unit. Five of the genospecies include the type strains of named species: R. laguerreae, R. sophorae, R. ruizarguesonis, “R. indicum” and R. leguminosarum itself. The 16S ribosomal RNA sequence is remarkably diverse within the Rlc, but does not distinguish the genospecies. Partial sequences of housekeeping genes, which have frequently been used to characterize isolate collections, can mostly be assigned unambiguously to a genospecies, but alleles within a genospecies do not always form a clade, so single genes are not a reliable guide to the true phylogeny of the strains. We conclude that access to a large number of genome sequences is a powerful tool for characterizing the diversity of bacteria, and that taxonomic conclusions should be based on all available genome sequences, not just those of type strains.


2019 ◽  
Vol 8 (17) ◽  
Author(s):  
Gary Xie ◽  
Qiuying Cheng ◽  
Hajnalka Daligault ◽  
Karen Davenport ◽  
Cheryl Gleasner ◽  
...  

Here, we report the genome sequences of a Staphylococcus aureus clinical isolate, strain SMA0034-04 (UGA22), which contains one chromosome and one plasmid. We also reveal that isolate SMA0034-04 (UGA22) contains loci in the genome that encode multiple exotoxins.


2020 ◽  
Vol 9 (41) ◽  
Author(s):  
Raphael N. Sieber ◽  
Søren Overballe-Petersen ◽  
Hülya Kaya ◽  
Anders R. Larsen ◽  
Andreas Petersen

ABSTRACT Methicillin-resistant Staphylococcus aureus (MRSA) sequence type 630 (ST630) and spa type t4549 is an emerging lineage in Nordic countries, and some representatives carry the CRISPR-Cas system. Here, the complete genome sequences of two isolates from this lineage are presented, comprising chromosomes of 2,918,239 and 2,877,083 nucleotides, respectively, and a 2,473-nucleotide plasmid carrying erm(C).


Sign in / Sign up

Export Citation Format

Share Document