scholarly journals coil: an R package for cytochrome c oxidase I (COI) DNA barcode data cleaning, translation, and error evaluation

Genome ◽  
2020 ◽  
Vol 63 (6) ◽  
pp. 291-305 ◽  
Author(s):  
Cameron M. Nugent ◽  
Tyler A. Elliott ◽  
Sujeevan Ratnasingham ◽  
Sarah J. Adamowicz

Biological conclusions based on DNA barcoding and metabarcoding analyses can be strongly influenced by the methods utilized for data generation and curation, leading to varying levels of success in the separation of biological variation from experimental error. The 5′ region of cytochrome c oxidase subunit I (COI-5P) is the most common barcode gene for animals, with conserved structure and function that allows for biologically informed error identification. Here, we present coil ( https://CRAN.R-project.org/package=coil ), an R package for the pre-processing and frameshift error assessment of COI-5P animal barcode and metabarcode sequence data. The package contains functions for placement of barcodes into a common reading frame, accurate translation of sequences to amino acids, and highlighting insertion and deletion errors. The analysis of 10 000 barcode sequences of varying quality demonstrated how coil can place barcode sequences in reading frame and distinguish sequences containing indel errors from error-free sequences with greater than 97.5% accuracy. Package limitations were tested through the analysis of COI-5P sequences from the plant and fungal kingdoms as well as the analysis of potential contaminants: nuclear mitochondrial pseudogenes and Wolbachia COI-5P sequences. Results demonstrated that coil is a strong technical error identification method but is not reliable for detecting all biological contaminants.

Author(s):  
Cameron M. Nugent ◽  
Tyler A. Elliott ◽  
Sujeevan Ratnasingham ◽  
Sarah J. Adamowicz

AbstractBiological conclusions based on DNA barcoding and metabarcoding analyses can be strongly influenced by the methods utilized for data generation and curation, leading to varying levels of success in the separation of biological variation from experimental error. The five-prime region of cytochrome c oxidase subunit I (COI-5P) is the most common barcode gene for animals, with conserved structure and function that allows for biologically informed error identification. Here, we present coil (https://CRAN.R-project.org/package=coil), an R package for the pre-processing and error assessment of COI-5P animal barcode and metabarcode sequence data. The package contains functions for placement of barcodes into a common reading frame, accurate translation of sequences to amino acids, and highlighting insertion and deletion errors. The analysis of 10,000 barcode sequences of varying quality demonstrated how coil can place barcode sequences in reading frame and distinguish sequences containing indel errors from error-free sequences with greater than 97.5% accuracy. Package limitations were tested through the analysis of COI-5P sequences from the plant and fungal kingdoms as well as the analysis of potential contaminants: nuclear mitochondrial pseudogenes and Wolbachia COI-5P sequences. Results demonstrated that coil is a strong technical error identification method but is not reliable for detecting all biological contaminants.


Zootaxa ◽  
2022 ◽  
Vol 5091 (4) ◽  
pp. 546-558
Author(s):  
ZHAOYANG CHEN ◽  
FENGXIANG LIU ◽  
DAIQIN LI ◽  
XIN XU

This paper reports four new species of the primitively segmented spider genus Songthela from Chongqing Municipality, China, based on morphological characters of both males and females: S. jinyun sp. nov., S. longbao sp. nov., S. serriformis sp. nov. and S. wangerbao sp. nov. We also provide the GenBank accession codes of mitochondrial DNA barcode gene, cytochrome c oxidase subunit I (COI), for the holotype of four new species for future identification.  


2021 ◽  
Vol Vol 66 (1) (January (1)) ◽  
pp. 1-5
Author(s):  
Jerome Goddard ◽  
Gerald Baker ◽  
Petra Jericke ◽  
Lawrence Birchman ◽  
Ethan Woodward ◽  
...  

Ultrastructural and molecular data are provided from a single adult female pentastomid opportunistically collected from a road-killed rattlesnake in Russell, KS. Ultrastructural data consisted of light and SEM microscopy of the pentastomid and its eggs, while molecular data consisted of partial 18S and 28S ribosomal sequences and a partial cytochrome c oxidase subunit 1 sequence from the same specimen used for SEM. Ultrastructural and molecular data support generic identification of the pentastomid as Porocephalus sp. These molecular data were also used with previously published pentastomid sequence data for a concatenated phylogenetic analysis, which support the current, morphology-based taxonomic placement of the genus.


Zootaxa ◽  
2020 ◽  
Vol 4803 (3) ◽  
pp. 561-575
Author(s):  
ADRIAN ARDILA-CAMACHO ◽  
CALEB CALIFRE MARTINS ◽  
JORGE ARI NORIEGA

Isostenosmylus Krüger, 1913 is the richest genus of Osmylidae of the Neotropical region, with 17 described species so far, which are distributed mainly in the Andean region and in the South of Brazil and Paraguay. A new remarkable Colombian species of Isostenosmylus—I. ammirabilis sp. nov.—is herein described and illustrated. DNA barcode of mitochondrial gene cytochrome c oxidase subunit I (COI) for this species is also provided. Taxonomic keys for the genus are updated. 


Zootaxa ◽  
2008 ◽  
Vol 1691 (1) ◽  
pp. 67 ◽  
Author(s):  
M. ALEX SMITH

The 5' end (Folmer or Barcode region) of cytochrome c oxidase 1 (CO1) has been proposed as the gene region of choice for a standardized animal DNA barcode (Hebert et al. 2003). Concerns have been raised regarding the decision to utilize this particular mitochondrial gene region as a barcode. Nevertheless, widely divergent taxonomic groups have reported success using CO1 for both species identification and discovery. The utility of CO1 for barcoding amphibians was raised early on (Vences, et al. 2005) and concerns for this group were reported widely (Waugh 2007)—although some considered that the reporting of the concerns outstripped the data that had been analyzed at that point (Smith et al. 2008). Indeed, our analysis of CO1 for a small group of Holarctic amphibians was neither more difficult to generate nor to analyze than for other groups where we have utilized the technique.


Sign in / Sign up

Export Citation Format

Share Document