percent sequence identity Latest Research Papers

DNA barcode data accurately assign higher spider taxa

PeerJ ◽

10.7717/peerj.2201 ◽

2016 ◽

Vol 4 ◽

pp. e2201 ◽

Cited By ~ 12

Author(s):

Jonathan A. Coddington ◽

Ingi Agnarsson ◽

Ren-Chung Cheng ◽

Klemen Čandek ◽

Amy Driskell ◽

...

Keyword(s):

Dna Sequences ◽

Dna Barcode ◽

Accurate Method ◽

Reference Database ◽

Sequencing Errors ◽

Sequence Identity ◽

Percent Sequence Identity ◽

Near Term ◽

Accuracy Of Results ◽

Global Inventory

The use of unique DNA sequences as a method for taxonomic identification is no longer fundamentally controversial, even though debate continues on the best markers, methods, and technology to use. Although both existing databanks such as GenBank and BOLD, as well as reference taxonomies, are imperfect, in best case scenarios “barcodes” (whether single or multiple, organelle or nuclear, loci) clearly are an increasingly fast and inexpensive method of identification, especially as compared to manual identification of unknowns by increasingly rare expert taxonomists. Because most species on Earth are undescribed, a complete reference database at the species level is impractical in the near term. The question therefore arises whether unidentified species can, using DNA barcodes, be accurately assigned to more inclusive groups such as genera and families—taxonomic ranks of putatively monophyletic groups for which the global inventory is more complete and stable. We used a carefully chosen test library of CO1 sequences from 49 families, 313 genera, and 816 species of spiders to assess the accuracy of genus and family-level assignment. We used BLAST queries of each sequence against the entire library and got the top ten hits. The percent sequence identity was reported from these hits (PIdent, range 75–100%). Accurate assignment of higher taxa (PIdent above which errors totaled less than 5%) occurred for genera at PIdent values >95 and families at PIdent values ≥ 91, suggesting these as heuristic thresholds for accurate generic and familial identifications in spiders. Accuracy of identification increases with numbers of species/genus and genera/family in the library; above five genera per family and fifteen species per genus all higher taxon assignments were correct. We propose that using percent sequence identity between conventional barcode sequences may be a feasible and reasonably accurate method to identify animals to family/genus. However, the quality of the underlying database impacts accuracy of results; many outliers in our dataset could be attributed to taxonomic and/or sequencing errors in BOLD and GenBank. It seems that an accurate and complete reference library of families and genera of lifecouldprovide accurate higher level taxonomic identifications cheaply and accessibly, within years rather than decades.

DNA barcode data accurately identify higher taxa

10.7287/peerj.preprints.1633v1 ◽

2016 ◽

Author(s):

Jonathan A Coddington ◽

Ingi Agnarsson ◽

Ren-Chung Cheng ◽

Klemen Čandek ◽

Amy Driskell ◽

...

Keyword(s):

Dna Sequences ◽

Dna Barcode ◽

Accurate Method ◽

Reference Database ◽

Accurate Identification ◽

Sequencing Errors ◽

Sequence Identity ◽

Percent Sequence Identity ◽

Near Term ◽

Accuracy Of Results

The use of unique DNA sequences as a method for taxonomic identification is no longer fundamentally controversial, even though debate continues on the best markers, methods, and technology to use. Although both existing databanks such as GenBank and BOLD, as well as reference taxonomies, are imperfect, in best case scenarios “barcodes” (whether single or multiple, organelle or nuclear, loci) clearly are an increasingly fast and inexpensive method of identification, especially as compared to manual identification of unknowns by increasingly rare expert taxonomists. Because most species on Earth are undescribed, a complete reference database at the species level is impractical in the near term. The question therefore arises whether unidentified species can, using DNA barcodes, be accurately assigned to more inclusive groups such as genera and families—taxonomic ranks of putatively monophyletic groups for which the global inventory is more complete and stable. We used a carefully chosen test library of CO1 sequences from 49 families, 313 genera, and 816 species of spiders to assess the accuracy of genus and family-level identifications. We used BLAST queries of each sequence against the entire library and got the top ten hits resulting in 8160 hits. The percent sequence identity was reported from these hits (PIdent, range 75-100%). Accurate identification (PIdent above which errors totaled less than 5%) occurred for genera at PIdent values > 95 and families at PIdent values ≥ 91, suggesting these as heuristic thresholds for generic and familial identifications in spiders. Accuracy of identification increases with numbers of species/genus and genera/family in the library; above five genera per family and fifteen species per genus all identifications were correct. We propose that using percent sequence identity between conventional barcode sequences may be a feasible and reasonably accurate method to identify animals to family/genus. However, the quality of the underlying database impacts accuracy of results; many outliers in our dataset could be attributed to taxonomic and/or sequencing errors in BOLD and GenBank. It seems that an accurate and complete reference library of families and genera of life could provide accurate higher level taxonomic identifications cheaply and accessibly, within years rather than decades.

DNA barcode data accurately identify higher taxa

10.7287/peerj.preprints.1633 ◽

2016 ◽

Author(s):

Jonathan A Coddington ◽

Ingi Agnarsson ◽

Ren-Chung Cheng ◽

Klemen Čandek ◽

Amy Driskell ◽

...

Keyword(s):

Dna Sequences ◽

Dna Barcode ◽

Accurate Method ◽

Reference Database ◽

Accurate Identification ◽

Sequencing Errors ◽

Sequence Identity ◽

Percent Sequence Identity ◽

Near Term ◽

Accuracy Of Results

The use of unique DNA sequences as a method for taxonomic identification is no longer fundamentally controversial, even though debate continues on the best markers, methods, and technology to use. Although both existing databanks such as GenBank and BOLD, as well as reference taxonomies, are imperfect, in best case scenarios “barcodes” (whether single or multiple, organelle or nuclear, loci) clearly are an increasingly fast and inexpensive method of identification, especially as compared to manual identification of unknowns by increasingly rare expert taxonomists. Because most species on Earth are undescribed, a complete reference database at the species level is impractical in the near term. The question therefore arises whether unidentified species can, using DNA barcodes, be accurately assigned to more inclusive groups such as genera and families—taxonomic ranks of putatively monophyletic groups for which the global inventory is more complete and stable. We used a carefully chosen test library of CO1 sequences from 49 families, 313 genera, and 816 species of spiders to assess the accuracy of genus and family-level identifications. We used BLAST queries of each sequence against the entire library and got the top ten hits resulting in 8160 hits. The percent sequence identity was reported from these hits (PIdent, range 75-100%). Accurate identification (PIdent above which errors totaled less than 5%) occurred for genera at PIdent values > 95 and families at PIdent values ≥ 91, suggesting these as heuristic thresholds for generic and familial identifications in spiders. Accuracy of identification increases with numbers of species/genus and genera/family in the library; above five genera per family and fifteen species per genus all identifications were correct. We propose that using percent sequence identity between conventional barcode sequences may be a feasible and reasonably accurate method to identify animals to family/genus. However, the quality of the underlying database impacts accuracy of results; many outliers in our dataset could be attributed to taxonomic and/or sequencing errors in BOLD and GenBank. It seems that an accurate and complete reference library of families and genera of life could provide accurate higher level taxonomic identifications cheaply and accessibly, within years rather than decades.

Independent Co-Option of a Tailed Bacteriophage into a Killing Complex in Pseudomonas

mBio ◽

10.1128/mbio.00452-15 ◽

2015 ◽

Vol 6 (4) ◽

Cited By ~ 36

Author(s):

Kevin L. Hockett ◽

Tanya Renner ◽

David A. Baltrus

Keyword(s):

Pseudomonas Syringae ◽

Genomic Region ◽

Content Type ◽

Bacterial Flagellum ◽

Evolutionary Innovation ◽

Genomic Location ◽

Type Three Secretion System ◽

Evolutionary Event ◽

Percent Sequence Identity ◽

Ecological Importance

ABSTRACTCompetition between microbes is widespread in nature, especially among those that are closely related. To combat competitors, bacteria have evolved numerous protein-based systems (bacteriocins) that kill strains closely related to the producer. In characterizing the bacteriocin complement and killing spectra for the model strainPseudomonas syringaeB728a, we discovered that its activity was not linked to any predicted bacteriocin but is derived from a prophage. Instead of encoding an active prophage, this region encodes a bacteriophage-derived bacteriocin, termed an R-type syringacin. This R-type syringacin is striking in its convergence with the well-studied R-type pyocin ofP. aeruginosain both genomic location and molecular function. Genomic alignment, amino acid percent sequence identity, and phylogenetic inference all support a scenario where the R-type syringacin has been co-opted independently of the R-type pyocin. Moreover, the presence of this region is conserved among several otherPseudomonasspecies and thus is likely important for intermicrobial interactions throughout this important genus.IMPORTANCEEvolutionary innovation is often achieved through modification of complexes or processes for alternate purposes, termed co-option. Notable examples include the co-option of a structure functioning in locomotion (bacterial flagellum) to one functioning in protein secretion (type three secretion system). Similar co-options can occur independently in distinct lineages. We discovered a genomic region in the plant pathogenPseudomonas syringaethat consists of a fragment of a bacteriophage genome. The fragment encodes only the tail of the bacteriophage, which is lethal toward strains of this species. This structure is similar to a previously described structure produced by the related speciesPseudomonas aeruginosa. The two structures, however, are not derived from the same evolutionary event. Thus, they represent independent bacteriophage co-options. The co-opted bacteriophage fromP. syringaeis found in the genomes of many otherPseudomonasspecies, suggesting ecological importance across this genus.

Independent Co-option of a Tailed Bacteriophage into a Killing Complex in Pseudomonas

10.1101/011486 ◽

2014 ◽

Cited By ~ 1

Author(s):

Kevin L Hockett ◽

Tanya Renner ◽

David A Baltrus

Keyword(s):

Amino Acid ◽

Phylogenetic Inference ◽

Molecular Function ◽

Sequence Identity ◽

Pseudomonas Species ◽

Percent Sequence Identity ◽

Model Strain ◽

Chromosomal Synteny

Competition between microbes is widespread in nature, especially among those that are closely related. To combat competitors, bacteria have evolved numerous protein-based systems (bacteriocins) that kill strains closely related to the producer. In characterizing the bacteriocin complement and killing spectra for the model strain P. syringae B728a, we discovered its activity was not linked to any predicted bacteriocin, but is derived from a prophage. Instead of encoding an active prophage, this region encodes a bacteriophage-derived bacteriocin, termed an R-type syringacin. The R-type syringacin is striking in its convergence with the well-studied R-type pyocin of P. aeruginosa in both chromosomal synteny and molecular function. Genomic alignment, amino acid percent sequence identity, and phylogenetic inference all support a scenario where the R-type syringacin has been co-opted independently of the R-type pyocin. Moreover, the presence of this region is conserved among several other Pseudomonas species, and thus is likely important for intermicrobial interactions throughout this important genus.

Identification of an rsh Gene from a Novosphingobium sp. Necessary for Quorum-Sensing Signal Accumulation

Journal of Bacteriology ◽

10.1128/jb.01692-08 ◽

2009 ◽

Vol 191 (8) ◽

pp. 2551-2560 ◽

Cited By ~ 17

Author(s):

Han Ming Gan ◽

Larry Buckley ◽

Ernő Szegedi ◽

André O. Hudson ◽

Michael A. Savka

Keyword(s):

Quorum Sensing ◽

Stringent Response ◽

Cell Aggregation ◽

Homoserine Lactone ◽

Nutritional Deficiencies ◽

Plant Tumor ◽

Amino Acid Requirements ◽

Signal Production ◽

Percent Sequence Identity ◽

Multiple Amino Acid

ABSTRACT The stringent response is a mechanism by which bacteria adapt to environmental stresses and nutritional deficiencies through the synthesis and hydrolysis of (p)ppGpp by RelA/SpoT enzymes. Alphaproteobacteria and plants contain a single Rsh enzyme (named for RelA/SpoT homolog) that is bifunctional. Here we report the identification of a new species of bacteria belonging to the genus Novosphingobium and characterization of an rsh mutation in this plant tumor-associated isolate. Isolate Rr 2-17, from a grapevine crown gall tumor, is a member of the Novosphingobium genus that produces the N-acyl-homoserine lactone (AHL) quorum-sensing (QS) signals. A Tn5 mutant, Hx 699, deficient in AHL production was found to have an insertion in an rsh gene. The Rsh protein showed significant percent sequence identity to Rsh proteins of alphaproteobacteria. The Novosphingobium sp. rsh gene (rsh Nsp) complemented the multiple amino acid requirements of the Escherichia coli relA spoT double mutant by restoring the growth on selection media. Besides QS signal production, the rsh mutation also affects soluble polysaccharide production and cell aggregation. Genetic complementation of the Hx 699 mutant with the rsh Nsp gene restored these phenotypes. This is the first discovery of a functional rsh gene in a member of the Novosphingobium genus.