scholarly journals Combining the 5.8S and ITS2 gene regions to improve classification of fungi

2019 ◽  
Author(s):  
Felix Heeger ◽  
Christian Wurzbacher ◽  
Elizabeth C. Bourne ◽  
Camila J. Mazzoni ◽  
Michael T. Monaghan

SummaryThe internal transcribed spacer (ITS) is used in DNA metabarcoding of fungi. One disadvantage of its high variability may be a failure to classify OTUs when no similar reference sequence exists. We tested whether the 5.8S region, often sequenced with ITS2 but discarded before analysis, could provide OUT classifications when ITS fails.We usedin silicoevaluation to compare classification success of 5.8S and ITS from the UNITE database when reference sequences of the same species, genus, or family were removed. We then developed an automated pipeline for a combined 5.8S - ITS2 analysis and applied it to mixed environmental samples containing many lineages that are underrepresented in databases.ITS was clearlysuperior for species-level classifications with a complete reference database, but 5.8S outperformed ITS at higher level classifications with an incomplete database. Our combined 5.8S-ITS2 pipeline classified 3x more fungal OTUs compared to ITS2 alone, particularly within Chytridiomycota (10x) and Rozellamycota (3x).Missing reference sequences led to the failure of ITS to classify many fungal OTUs at all, and to a significant underestimation of environmental fungal diversity. Using 5.8S to complement ITS classification will likely provide better estimates of diversity in lineages for which database coverage is poor.

PeerJ ◽  
2016 ◽  
Vol 4 ◽  
pp. e1966 ◽  
Author(s):  
Vasco Elbrecht ◽  
Pierre Taberlet ◽  
Tony Dejean ◽  
Alice Valentini ◽  
Philippe Usseglio-Polatera ◽  
...  

Cytochrome c oxidase I (COI) is a powerful marker for DNA barcoding of animals, with good taxonomic resolution and a large reference database. However, when used for DNA metabarcoding, estimation of taxa abundances and species detection are limited due to primer bias caused by highly variable primer binding sites across the COI gene. Therefore, we explored the ability of the 16S ribosomal DNA gene as an alternative metabarcoding marker for species level assessments. Ten bulk samples, each containing equal amounts of tissue from 52 freshwater invertebrate taxa, were sequenced with the Illumina NextSeq 500 system. The 16S primers amplified three more insect species than the Folmer COI primers and amplified more equally, probably due to decreased primer bias. Estimation of biomass might be less biased with 16S than with COI, although variation in read abundances of two orders of magnitudes is still observed. According to these results, the marker choice depends on the scientific question. If the goal is to obtain a taxonomic identification at the species level, then COI is more appropriate due to established reference databases and known taxonomic resolution of this marker, knowing that a greater proportion of insects will be missed using COI Folmer primers. If the goal is to obtain a more comprehensive survey the 16S marker, which requires building a local reference database, or optimised degenerated COI primers could be more appropriate.


2020 ◽  
Vol 145 ◽  
pp. 01020
Author(s):  
Chen Yun-xia ◽  
Xue Xiao-ming ◽  
Huang Ya-lin ◽  
Zhou Yong-wu ◽  
Hou Sen-lin ◽  
...  

In this study, the second-generation high-throughput sequencing and DNA barcoding were combined to manually prepare multi-species mixed samples, and the mitochondrial gene CO І was used as a barcode to simultaneously identify the animal species in the mixed samples and identify endangered species. The results showed that under the family and genus level, the simultaneous detection rate of the species in the mixed samples was as high as 100%, and the species identification rate was as high as 89% at the species level, and with high sensitivity, as little as 1% of the trace species could be detected. However, nearly 30% of non-target classification annotations appeared at the species level. It can be concluded that the mini CO I barcoding can be applied to the simultaneous identification of animal species in mixed biological samples, and the species identification rate is high. Non-target classification match existing at the species level can be further improved by increasing the length of the barcoding, improving the sequencing technology, reference database and so on. In this study, DNA metabarcoding technology was used to evaluate the feasibility of identification of endangered animals in multi-species mixed biological samples with CO І, in order to lay a preliminary foundation for the advancement of DNA metabarcoding method in the field of wildlife forensic identification.


2016 ◽  
Author(s):  
Vasco Elbrecht ◽  
Pierre Taberlet ◽  
Tony Dejean ◽  
Alice Valentini ◽  
Philippe Usseglio-polatera ◽  
...  

Cytochrome c oxidase I (COI) is a powerful marker for DNA barcoding of animals, with good taxonomic resolution and a large reference database. However, when used for DNA metabarcoding, estimation of taxa abundances and species detection are limited due to primer bias caused by highly variable primer binding sites across the COI gene. Therefore, we explored the ability of the 16S ribosomal DNA gene as an alternative metabarcoding marker for species level assessments. Ten bulk samples, each containing equal amounts of tissue from 52 freshwater invertebrate taxa, were sequenced with the Illumina NextSeq 500 system. In comparison to COI, the 16S marker amplified more insect species and amplified more equally, probably due to decreased primer bias. Rough estimation of biomass might thus be less biased with 16S than with COI. According to these results, the marker choice depends on the scientific question. If the goal is to obtain a taxonomic identification at the species level, then COI is more appropriate due to established reference databases and known taxonomic resolution of this marker, knowing that a greater proportion of species will be missed using COI Folmer primers. If the goal is to obtain a more comprehensive survey in a context where it is possible to build a local reference database, the 16S marker could be more appropriate.


2016 ◽  
Author(s):  
Vasco Elbrecht ◽  
Pierre Taberlet ◽  
Tony Dejean ◽  
Alice Valentini ◽  
Philippe Usseglio-polatera ◽  
...  

Cytochrome c oxidase I (COI) is a powerful marker for DNA barcoding of animals, with good taxonomic resolution and a large reference database. However, when used for DNA metabarcoding, estimation of taxa abundances and species detection are limited due to primer bias caused by highly variable primer binding sites across the COI gene. Therefore, we explored the ability of the 16S ribosomal DNA gene as an alternative metabarcoding marker for species level assessments. Ten bulk samples, each containing equal amounts of tissue from 52 freshwater invertebrate taxa, were sequenced with the Illumina NextSeq 500 system. In comparison to COI, the 16S marker amplified more insect species and amplified more equally, probably due to decreased primer bias. Rough estimation of biomass might thus be less biased with 16S than with COI. According to these results, the marker choice depends on the scientific question. If the goal is to obtain a taxonomic identification at the species level, then COI is more appropriate due to established reference databases and known taxonomic resolution of this marker, knowing that a greater proportion of species will be missed using COI Folmer primers. If the goal is to obtain a more comprehensive survey in a context where it is possible to build a local reference database, the 16S marker could be more appropriate.


2021 ◽  
Vol 4 ◽  
Author(s):  
Izabela Mendes ◽  
Heron Hilário ◽  
Daniel Teixeira ◽  
Daniel Cardoso de Carvalho

Species richness is a metric of biodiversity usually used in fish community assessment for monitoring programs. This metric is often obtained using traditional fisheries methods that rely on capture of target organisms, resulting in underestimation of fish species. DNA metabarcoding has been recognized as a powerful noninvasive alternative tool for fish biomonitoring and management. Despite the increasing popularity of this method for the assessment of aquatic megadiverse ecosystems, its implementation for studying the highly diverse Neotropical ichthyofauna still presents some challenges. One of them is to devise what primer set could reliably amplify the DNA of all fish species from a megadiverse river basin and have enough resolution to identify them. In order to identify and overcome these drawbacks, we have investigated the efficiency of the metabarcoding approach on Neotropical fishes using a mock sample containing genomic DNA of 18 fish species from the Jequitinhonha River basin, Eastern Brazil. We compared three primer sets targeting the 12S rRNA gene: two universal and widely used markers for fish metabarcoding [MiFish (~170bp) and Teleo_1 (~60bp)], and NeoFish (~190bp), recently developed by our research group specifically for the identification of Neotropical fishes (Milan et al., 2020). Two samples amplified using three primers were sequenced in a single multiplexed Illumina MiniSeq run, using normalized and non-normalized pools. Bioinformatic analyses were performed using a DADA2/Phyloseq based pipeline to perform filtering steps and to assign Amplicon Sequence Variants (ASVs). We used a custom 12S reference sequence database that included 190 specimens representing 101 species and 70 genera from the Jequitinhonha and São Francisco river basins. A total of 187 ASVs were recovered: 79, 66 and 42 for NeoFish, MiFish and Teleo_1, respectively. ASVs of unexpected species were identified for both pools (Fig. 1), though each of these ASVs had an abundance of less than 50 copies. In addition, species of the Hoplias and Prochilodus genera could not be identified at the species level, due to identical sequences within each genus, possibly because of the insufficient variation within the 12S region recovered by these primers’ amplicons. Unexpectedly, although a single individual of each species was placed in the pools, more than one ASV was identified for some species, likely caused by PCR biases. Overall, all primer sets displayed similar taxonomic resolution for the DNA pools and recovered all species, except for NeoFish, which could not detect Steindachneridion amblyurum due to an incompatibility in the 3’ of the NeoFish forward primer and Teleo_1, which could not identify Steindachnerina elegans. These results highlight the need of reliable databases in order to enable the full assignment of ASVs and OTUs to species level, and the importance of calibrating the DNA metabarcoding approach with mock samples to identify weaknesses and pivotal steps prior to the application on large scale DNA based biodiversity evaluation, that can help with the complex task of conserving the megadiverse Neotropical ichthyofauna.


2021 ◽  
Vol 168 (6) ◽  
Author(s):  
Ann Bucklin ◽  
Katja T. C. A. Peijnenburg ◽  
Ksenia N. Kosobokova ◽  
Todd D. O’Brien ◽  
Leocadio Blanco-Bercial ◽  
...  

AbstractCharacterization of species diversity of zooplankton is key to understanding, assessing, and predicting the function and future of pelagic ecosystems throughout the global ocean. The marine zooplankton assemblage, including only metazoans, is highly diverse and taxonomically complex, with an estimated ~28,000 species of 41 major taxonomic groups. This review provides a comprehensive summary of DNA sequences for the barcode region of mitochondrial cytochrome oxidase I (COI) for identified specimens. The foundation of this summary is the MetaZooGene Barcode Atlas and Database (MZGdb), a new open-access data and metadata portal that is linked to NCBI GenBank and BOLD data repositories. The MZGdb provides enhanced quality control and tools for assembling COI reference sequence databases that are specific to selected taxonomic groups and/or ocean regions, with associated metadata (e.g., collection georeferencing, verification of species identification, molecular protocols), and tools for statistical analysis, mapping, and visualization. To date, over 150,000 COI sequences for ~ 5600 described species of marine metazoan plankton (including holo- and meroplankton) are available via the MZGdb portal. This review uses the MZGdb as a resource for summaries of COI barcode data and metadata for important taxonomic groups of marine zooplankton and selected regions, including the North Atlantic, Arctic, North Pacific, and Southern Oceans. The MZGdb is designed to provide a foundation for analysis of species diversity of marine zooplankton based on DNA barcoding and metabarcoding for assessment of marine ecosystems and rapid detection of the impacts of climate change.


2020 ◽  
Vol 0 (0) ◽  
Author(s):  
Matteo Focardi ◽  
Emanuele Spadaro

AbstractBuilding upon the recent results in [M. Focardi and E. Spadaro, On the measure and the structure of the free boundary of the lower-dimensional obstacle problem, Arch. Ration. Mech. Anal. 230 2018, 1, 125–184] we provide a thorough description of the free boundary for the solutions to the fractional obstacle problem in {\mathbb{R}^{n+1}} with obstacle function φ (suitably smooth and decaying fast at infinity) up to sets of null {{\mathcal{H}}^{n-1}} measure. In particular, if φ is analytic, the problem reduces to the zero obstacle case dealt with in [M. Focardi and E. Spadaro, On the measure and the structure of the free boundary of the lower-dimensional obstacle problem, Arch. Ration. Mech. Anal. 230 2018, 1, 125–184] and therefore we retrieve the same results:(i)local finiteness of the {(n-1)}-dimensional Minkowski content of the free boundary (and thus of its Hausdorff measure),(ii){{\mathcal{H}}^{n-1}}-rectifiability of the free boundary,(iii)classification of the frequencies and of the blowups up to a set of Hausdorff dimension at most {(n-2)} in the free boundary.Instead, if {\varphi\in C^{k+1}(\mathbb{R}^{n})}, {k\geq 2}, similar results hold only for distinguished subsets of points in the free boundary where the order of contact of the solution with the obstacle function φ is less than {k+1}.


2015 ◽  
Vol 87 (1) ◽  
pp. 15-27 ◽  
Author(s):  
José D. Ferreira ◽  
Martín Zamorano ◽  
Ana Maria Ribeiro

The genus Panochthus represents the last lineage of "Panochthini" recorded in the Pleistocene. This genus has a wide latitudinal distribution in South America, and in Brazil it occurs in the southern and northeastern regions. In this paper we describe new material (isolated osteoderms and caudal tube fragments) assigned to Panochthus from the state of Rio Grande do Sul (southern Brazil) and discuss some taxonomic issues related to Panochthus tuberculatus and Panochthus greslebini based on this material . The occurrence of P. greslebini is the first for outside the Brazilian Intertropical Region. In addition, we describe new diagnostic features to differentiate the osteoderms of P. greslebini and P. tuberculatus. Unfortunately, it was not possible to identify some osteoderms at the species level. Interestingly, they showed four distinct morphotypes characterized by their external morphology, and thus were attributed to Panochthus sp. Lastly, we conclude that in addition to P.tuberculatus registered to southern Brazil, there is another species of the genus, assignable to P. cf. P. greslebini. Our analysis reinforce the reliability of caudal tube characters for the classification of species of Panochthus.


Author(s):  
Nicole Foster ◽  
Kor-jent Dijk ◽  
Ed Biffin ◽  
Jennifer Young ◽  
Vicki Thomson ◽  
...  

A proliferation in environmental DNA (eDNA) research has increased the reliance on reference sequence databases to assign unknown DNA sequences to known taxa. Without comprehensive reference databases, DNA extracted from environmental samples cannot be correctly assigned to taxa, limiting the use of this genetic information to identify organisms in unknown sample mixtures. For animals, standard metabarcoding practices involve amplification of the mitochondrial Cytochrome-c oxidase subunit 1 (CO1) region, which is a universally amplifyable region across majority of animal taxa. This region, however, does not work well as a DNA barcode for plants and fungi, and there is no similar universal single barcode locus that has the same species resolution. Therefore, generating reference sequences has been more difficult and several loci have been suggested to be used in parallel to get to species identification. For this reason, we developed a multi-gene targeted capture approach to generate reference DNA sequences for plant taxa across 20 target chloroplast gene regions in a single assay. We successfully compiled a reference database for 93 temperate coastal plants including seagrasses, mangroves, and saltmarshes/samphire’s. We demonstrate the importance of a comprehensive reference database to prevent species going undetected in eDNA studies. We also investigate how using multiple chloroplast gene regions impacts the ability to discriminate between taxa.


2019 ◽  
Author(s):  
Mathias Kuhring ◽  
Joerg Doellinger ◽  
Andreas Nitsche ◽  
Thilo Muth ◽  
Bernhard Y. Renard

AbstractUntargeted accurate strain-level classification of a priori unidentified organisms using tandem mass spectrometry is a challenging task. Reference databases often lack taxonomic depth, limiting peptide assignments to the species level. However, the extension with detailed strain information increases runtime and decreases statistical power. In addition, larger databases contain a higher number of similar proteomes.We present TaxIt, an iterative workflow to address the increasing search space required for MS/MS-based strain-level classification of samples with unknown taxonomic origin. TaxIt first applies reference sequence data for initial identification of species candidates, followed by automated acquisition of relevant strain sequences for low level classification. Furthermore, proteome similarities resulting in ambiguous taxonomic assignments are addressed with an abundance weighting strategy to improve candidate confidence.We apply our iterative workflow on several samples of bacterial and viral origin. In comparison to non-iterative approaches using unique peptides or advanced abundance correction, TaxIt identifies microbial strains correctly in all examples presented (with one tie), thereby demonstrating the potential for untargeted and deeper taxonomic classification. TaxIt makes extensive use of public, unrestricted and continuously growing sequence resources such as the NCBI databases and is available under open-source license at https://gitlab.com/rki_bioinformatics.


Sign in / Sign up

Export Citation Format

Share Document