Expanding the utility of sequence comparisons using data from whole genomes

AbstractWhole genome comparisons based on Average Nucleotide Identities (ANI), and the Genome-to-genome distance calculator have risen to prominence in rapidly classifying taxa using whole genome sequences. Some implementations have even been proposed as a new standard in species classification and have become a common technique for papers describing newly sequenced genomes. However, attempts to apply whole genome divergence data to delineation of higher taxonomic units, and to phylogenetic inference have had difficulty matching those produced by more complex phylogenetics methods. We present a novel method for generating reliable and statistically supported phylogenies using established ANI techniques. For the test cases to which we applied the developed approach we obtained accurate results up to at least the family level. The developed method uses non-parametric bootstrapping to gauge reliability of inferred groups. This method offers the opportunity make use of whole-genome comparison data that is already being generated to quickly produce accurate phylogenies. Additionally, the developed ANI methodology can assist classification of higher order taxonomic groups.Significance StatementThe average nucleotide identity (ANI) measure and its iterations have come to dominate in-silico species delimitation in the past decade. Yet the problem of gene content has not been fully resolved, and attempts made to do so contain two metrics which makes interpretation difficult at times. We provide a new single based ANI metric created from the combination of genomic content and genomic identity measures. Our results show that this method can handle comparisons of genomes with divergent content or identity. Additionally, the metric can be used to create distance based phylogenetic trees that are comparable to other tree building methods, while also providing a tentative metric for categorizing organisms into higher level taxonomic classifications.

Download Full-text

Relationship of Bacillus amyloliquefaciens clades associated with strains DSM 7T and FZB42T: a proposal for Bacillus amyloliquefaciens subsp. amyloliquefaciens subsp. nov. and Bacillus amyloliquefaciens subsp. plantarum subsp. nov. based on complete genome sequence comparisons

INTERNATIONAL JOURNAL OF SYSTEMATIC AND EVOLUTIONARY MICROBIOLOGY ◽

10.1099/ijs.0.023267-0 ◽

2011 ◽

Vol 61 (8) ◽

pp. 1786-1801 ◽

Cited By ~ 155

Author(s):

Rainer Borriss ◽

Xiao-Hua Chen ◽

Christian Rueckert ◽

Jochen Blom ◽

Anke Becker ◽

...

Keyword(s):

16S Rrna ◽

Dna Sequences ◽

Bacillus Amyloliquefaciens ◽

Type Strain ◽

Marker Gene ◽

Genome Comparison ◽

Phenotypic Traits ◽

Whole Genome ◽

Gene Sequences ◽

Sequence Comparisons

The whole-genome-sequenced rhizobacterium Bacillus amyloliquefaciens FZB42T (Chen et al., 2007) and other plant-associated strains of the genus Bacillus described as belonging to the species Bacillus amyloliquefaciens or Bacillus subtilis are used commercially to promote the growth and improve the health of crop plants. Previous investigations revealed that a group of strains represented a distinct ecotype related to B. amyloliquefaciens; however, the exact taxonomic position of this group remains elusive (Reva et al., 2004). In the present study, we demonstrated the ability of a group of Bacillus strains closely related to strain FZB42T to colonize Arabidopsis roots. On the basis of their phenotypic traits, the strains were similar to Bacillus amyloliquefaciens DSM 7T but differed considerably from this type strain in the DNA sequences of genes encoding 16S rRNA, gyrase subunit A (gyrA) and histidine kinase (cheA). Phylogenetic analysis performed with partial 16S rRNA, gyrA and cheA gene sequences revealed that the plant-associated strains of the genus Bacillus, including strain FZB42T, formed a lineage, which could be distinguished from the cluster of strains closely related to B. amyloliquefaciens DSM 7T. DNA–DNA hybridizations (DDH) performed with genomic DNA from strains DSM 7T and FZB42T yielded relatedness values of 63.7–71.2 %. Several methods of genomic analysis, such as direct whole-genome comparison, digital DDH and microarray-based comparative genomichybridization (M-CGH) were used as complementary tests. The group of plant-associated strains could be distinguished from strain DSM 7T and the type strain of B. subtilis by differences in the potential to synthesize non-ribosomal lipopeptides and polyketides. Based on the differences found in the marker gene sequences and the whole genomes of these strains, we propose two novel subspecies, designated B. amyloliquefaciens subsp. plantarum subsp. nov., with the type strain FZB42T ( = DSM 23117T = BGSC 10A6T), and B. amyloliquefaciens subsp. amyloliquefaciens subsp. nov., with the type strain DSM 7T( = ATCC 23350T = Fukumoto Strain FT), for plant-associated and non-plant-associated representatives, respecitvely. This is in agreement with results of DDH and M-CGH tests and the MALDI-TOF MS of cellular components, all of which suggested that the ecovars represent two different subspecies.

Download Full-text

Analyzing the Phylogenetic Trees with Tree- building Methods

Indian Journal Of Applied Research ◽

10.15373/2249555x/apr2012/25 ◽

2011 ◽

Vol 1 (7) ◽

pp. 83-85

Author(s):

Jasmine Jasmine ◽

◽

Pankaj Bhambri ◽

Dr. O.P. Gupta Dr. O.P. Gupta

Keyword(s):

Phylogenetic Trees ◽

Tree Building ◽

Building Methods

Download Full-text

Faculty Opinions recommendation of Whole genome comparison of Campylobacter jejuni human isolates using a low-cost microarray reveals extensive genetic diversity.

Faculty Opinions – Post-Publication Peer Review of the Biomedical Literature ◽

10.3410/f.1002156.24405 ◽

2001 ◽

Author(s):

Dominique Caugant

Keyword(s):

Genetic Diversity ◽

Campylobacter Jejuni ◽

Low Cost ◽

Genome Comparison ◽

Whole Genome ◽

Whole Genome Comparison

Download Full-text

Faculty Opinions recommendation of Geobiological analysis using whole genome-based tree building applied to the Bacteria, Archaea, and Eukarya.

Faculty Opinions – Post-Publication Peer Review of the Biomedical Literature ◽

10.3410/f.1014848.195156 ◽

2003 ◽

Author(s):

Douglas Erwin

Keyword(s):

Whole Genome ◽

Tree Building

Download Full-text

Whole-Genome Phylogenomic Heterogeneity of Neisseria gonorrhoeae Isolates with Decreased Cephalosporin Susceptibility Collected in Canada between 1989 and 2013

Journal of Clinical Microbiology ◽

10.1128/jcm.02589-14 ◽

2014 ◽

Vol 53 (1) ◽

pp. 191-200 ◽

Cited By ~ 78

Author(s):

Walter Demczuk ◽

Tarah Lynch ◽

Irene Martin ◽

Gary Van Domselaar ◽

Morag Graham ◽

...

Keyword(s):

Neisseria Gonorrhoeae ◽

Large Scale ◽

Epidemiological Studies ◽

23S Rrna ◽

Genome Comparison ◽

Whole Genome ◽

Content Type ◽

Genomic Epidemiology ◽

High Level ◽

Sequence Types

A large-scale, whole-genome comparison of CanadianNeisseria gonorrhoeaeisolates with high-level cephalosporin MICs was used to demonstrate a genomic epidemiology approach to investigate strain relatedness and dynamics. Although current typing methods have been very successful in tracing short-chain transmission of gonorrheal disease, investigating the temporal evolutionary relationships and geographical dissemination of highly clonal lineages requires enhanced resolution only available through whole-genome sequencing (WGS). Phylogenomic cluster analysis grouped 169 Canadian strains into 12 distinct clades. While someN. gonorrhoeaemultiantigen sequence types (NG-MAST) agreed with specific phylogenomic clades or subclades, other sequence types (ST) and closely related groups of ST were widely distributed among clades. Decreased susceptibility to extended-spectrum cephalosporins (ESC-DS) emerged among a group of diverse strains in Canada during the 1990s with a variety of nonmosaicpenAalleles, followed in 2000/2001 with thepenAmosaic X allele and then in 2007 with ST1407 strains with thepenAmosaic XXXIV allele. Five genetically distinct ESC-DS lineages were associated withpenAmosaic X, XXXV, and XXXIV alleles and nonmosaic XII and XIII alleles. ESC-DS with coresistance to azithromycin was observed in 5 strains with 23S rRNA C2599T or A2143G mutations. As the costs associated with WGS decline and analysis tools are streamlined, WGS can provide a more thorough understanding of strain dynamics, facilitate epidemiological studies to better resolve social networks, and improve surveillance to optimize treatment for gonorrheal infections.

Download Full-text