Evaluating the impacts of metabarcoding primer selection on DNA characterization of diet in an aerial insectivore, the Purple Martin

Author(s):  
Anna M Forsman ◽  
Brandon D Hoenig ◽  
Stephanie A Gaspar ◽  
Jason D Fischer ◽  
Joe Siegrist ◽  
...  

Abstract DNA metabarcoding is a molecular technique frequently used to characterize diet composition of insectivorous birds. However, results are sensitive to methodological decisions made during sample processing, with primer selection being one of the most critical. The most frequently used DNA metabarcoding primer set for avian insectivores is ZBJ. However, recent studies have found that ZBJ produces significant biases in prey classification that likely influence our understanding of foraging ecology. A new primer set, ANML, has shown promise for characterizing insectivorous bat diets with fewer taxonomic biases than ZBJ, but ANML has not yet been used to study insectivorous birds. Here, we evaluate the ANML primer set for use in metabarcoding of avian insectivore diets through comparison with the more commonly used ZBJ primer set. Fecal samples were collected from both adult and nestling Purple Martins (Progne subis subis) at 2 sites in the USA and 1 site in Canada to maximize variation in diet composition and to determine if primer selection impacts our understanding of diet variation among sites. In total, we detected 71 arthropod prey species, 39 families, and 10 orders. Of these, 40 species were uniquely detected by ANML, whereas only 11 were uniquely detected by ZBJ. We were able to classify 54.8% of exact sequence variants from ANML libraries to species compared to 33.3% from ZBJ libraries. We found that ANML outperformed ZBJ for PCR efficacy, taxonomic coverage, and specificity of classification, but that using both primer sets together produced the most comprehensive characterizations of diet composition. Significant variation in both alpha- and beta-diversity between sites was found using each primer set separately and in combination. To our knowledge, this is the first published metabarcoding study using ANML primers to describe avian diet, and also the first to directly compare results returned by ANML and ZBJ primer sets.

2021 ◽  
Vol 4 ◽  
Author(s):  
Daniel Teixeira ◽  
Heron Hilário ◽  
Gustavo Rosa ◽  
Guilherme Santos ◽  
Gilmar Santos ◽  
...  

The study of ichthyoplankton composition, abundance and distribution is paramount to understand the reproductive dynamics of local fish assemblages. The analysis of these parameters allows the identification of spawning sites, nursery areas and migration routes. However, due to the lack of characters in early life stages, the morphological identification of ichthyoplankton is often impractical and many studies identify only fish larvae. Additionally, its accuracy shows great variation between taxonomists and laboratories according to their experience and specialty. DNA barcoding emerged as an alternative to provide assertive identification of fish eggs and larvae, but it becomes too expensive and laborious when the study demands the processing of huge amounts of organisms. DNA metabarcoding can overcome these limitations as a rapid, cost-effective, broad and accurate taxonomy tool, allowing the identification of multiple individuals simultaneously. Here, we present the identification of a sample containing 68 fish eggs and another containing 293 fish larvae from a single site in the São Francisco River Basin, Eastern Brazil, through DNA metabarcoding. We used a low-cost saline DNA extraction followed by PCR amplification with three primer sets targeting the 12S rRNA gene: MiFish (~170bp), Teleo_1 (~60bp), and NeoFish (~190bp). The latter was recently developed by our research group specifically for the identification of Neotropical fishes. All the amplified samples were sequenced in a single multiplexed Illumina MiniSeq run. We performed the filtering steps and assigned Amplicon Sequence Variants (ASVs) using a DADA2/Phyloseq based pipeline and a custom 12S reference sequence database including 101 species and 70 genera from the Jequitinhonha and São Francisco basins. The species Cyphocharax gilbert, Leporinus taeniatus, Megaleporinus elongatus, Prochilodus argenteus, P. costatus and Psalidodon fasciatus were detected by all three primer sets in the larva pool, while Pterygoplichthys etentaculatus was detected solely by NeoFish (Fig. 1). Within the egg pool, all three markers detected the species Characidium zebra, Curimatella lepidura, M. elongatus, Pimelodus fur and P. costatus, but Brycon orthotaenia was detected only by NeoFish, P. maculatus only by Teleo, and P. pohli by MiFish and Teleo (Fig. 1). The consistency in species detection among all three markers underpins the credibility of this method to accurately describe the sample composition. Considering that most of species were exclusive to the larvae or egg pool, our experiment highlights the importance of including the identification of fish eggs in reproduction studies, as it can provide additional information about which species are spawning in an area. Furthermore, the application of DNA metabarcoding to the study of ichthyoplankton can help decision makers create more informed guidelines for conservation of economically and ecologically important fish species.


2020 ◽  
Vol 47 (6) ◽  
pp. 509
Author(s):  
Melanie J. Young ◽  
Ludovic Dutoit ◽  
Fiona Robertson ◽  
Yolanda van Heezik ◽  
Philip J. Seddon ◽  
...  

Abstract Context. Diet variability is a significant driver of seabird decline; however, data on seabird diet composition and trends have been affected by changes in precision and resolution owing to the evolution of different sampling methods over time. We investigated the effectiveness of applying a passive molecular diet method using faeces obtained from the endangered yellow-eyed penguin. Aims. To assess the feasibility of applying DNA metabarcoding methods to yellow-eyed penguin faeces to evaluate diet, and to compare the reliability of diet results derived from adults and chicks, and from latrine versus fresh faecal samples. Methods. We collected 313 faecal samples from yellow-eyed penguins resident on the Otago coast of New Zealand from October 2016 to August 2017. We used polymerase chain reaction (PCR) with mitochondrial 16S cephalopod and chordate primers to amplify prey DNA present in the faecal samples, and tested the completeness of our assembled reference databases based on previous diet research. Amplified prey DNA sequences were then assigned to taxa from our reference databases by using QIIME2. Key results. Mitochondrial 16S chordate PCR primers were effective at identifying 29 fish taxa, with 98.3% of amplified sequences being identified to species or genus level in 193 samples (61.7% collected). There was no significant difference in the number, occurrence or proportion of ray-finned fish prey DNA sequences derived from fresh samples or latrines. Mitochondrial 16S cephalopod PCR primers classified 1.98% of amplified DNA sequences as targets, with 96.5% of these target sequences being identified to species or genus level in 48 samples (15.3% collected), and five taxa identified. Conclusions. We recommend the collection of latrine samples to enable long-term monitoring of the diet of yellow-eyed penguins, which will optimise the trade-off between wildlife disturbance and dietary resolution. Further refinement is needed to identify cephalopod dietary components for yellow-eyed penguins, because our cephalopod primers were not as specific as those used for ray-finned fishes, amplifying a large number (>98%) of non-cephalopod species. Implications. DNA metabarcoding offers a robust and comprehensive alternative to other, more intrusive, seabird diet-assessment methods, but still requires parallel studies to provide critical information on prey size, true diet composition and diet quality.


2021 ◽  
Author(s):  
Sven Weber ◽  
Lukas Brink ◽  
Manuel Wörner ◽  
Sven Künzel ◽  
Michael Veith ◽  
...  

AbstractMolecular gut content analysis is a popular tool to study food web interactions and was recently also suggested as an alternative source for DNA based biomonitoring. However, the overabundant consumer’s DNA often outcompetes that of its diet during PCR. Blocking approaches are an efficient means to reduce consumer amplification while retaining broad specificity for dietary taxa. We here designed an assay to monitor the eukaryotic diet of mussels and test their utility as biological eDNA filters to monitor planktonic communities. We designed several rDNA primer sets with a broad taxonomic suitability for eukaryotes, which suppress the amplification of mussels. The primers were tested using mussel DNA extracts and the results were compared to eDNA water samples collected next to the mussel colonies. Taxonomic recovery, as well as patterns of alpha and beta diversity, were compared between mussels and water samples. In addition, we analyzed time series samples of mussel samples from different German rivers. Our primer sets efficiently block the amplification of various mussel genera. The recovered DNA reflects a broad dietary preference across the eukaryotic tree of life and considerable taxonomic overlap with filtered water samples. We also recover various taxa of possible commensals and parasites, associated with the mussels. Our protocol will enable large scale dietary analysis in mussels, facilitate aquatic food web analysis, elucidate the ecological impact of invasive bivalves and the rapid survey of mussel aquacultures for pathogens. Moreover, we show that mussels could serve as an interesting complementary DNA source for biomonitoring.


2018 ◽  
Author(s):  
Vasco Elbrecht ◽  
Ecaterina Edith Vamos ◽  
Dirk Steinke ◽  
Florian Leese

Background. DNA metabarcoding is used to generate species composition data for entire communities. However, sequencing errors in high throughput sequencing instruments are fairly common, usually requiring reads to be clustered into operational taxonomic units (OTU), losing information on intraspecific diversity in the process. While COI haplotype information is limited in resolution, it is nevertheless useful in a phylogeographic context, helping to formulate hypothesis on taxon dispersal. Methods. This study combines sequence denoising strategies, normally applied in microbial research, with additional abundance-based filtering to extract haplotypes from freshwater macroinvertebrate metabarcoding data sets. This novel approach was added to the R package "JAMP" and can be applied to Cytochrome c oxidase subunit I (COI) amplicon datasets. We tested our haplotyping method by sequencing i) a single-species mock community composed of 31 individuals with different haplotypes spanning three orders of magnitude in biomass and ii) 18 monitoring samples each amplified with four different primer sets and two PCR replicates. Results. We detected all 15 haplotypes of the single specimens in the mock community with relaxed filtering and denoising settings. However, up to 480 additional unexpected haplotypes remained in both replicates. Rigorous filtering removes most unexpected haplotypes, but also can discard expected haplotypes mainly from the small specimens. In the monitoring samples, the different primer sets detected 177 - 200 OTUs, each containing an average of 2.40 to 3.30 haplotypes per OTU. Population structures were consistent between replicates, and similar between primer pairs, depending on the primer length. A closer look at abundant taxa in the data set revealed various population genetic patterns, e.g. Taeniopteryx nebulosa and Hydropsyche pellucidula with a difference in north-south haplotype distribution, while Oulimnius tuberculatus and Asellus aquaticus display no clear population pattern but differ in genetic diversity. Discussion. We developed a strategy to infer intraspecific genetic diversity from bulk invertebrate monitoring samples using metabarcoding data. It needs to be stressed that at this point metabarcoding-informed haplotyping is not capable of capture the full diversity present in such samples, due to variation in specimen size, primer bias and loss of sequence variants with low abundance. Nevertheless, for a high number of species intraspecific diversity was recovered, identifying potentially isolated populations and potential taxa for further more detailed phylogeographic investigation. While we are currently lacking large-scale metabarcoding data sets to fully take advantage of our new approach, metabarcoding-informed haplotyping holds great promise for biomonitoring efforts that not only seek information about biological diversity but also underlying genetic diversity.


2018 ◽  
Author(s):  
Vasco Elbrecht ◽  
Ecaterina Edith Vamos ◽  
Dirk Steinke ◽  
Florian Leese

Background. DNA metabarcoding is used to generate species composition data for entire communities. However, sequencing errors in high throughput sequencing instruments are fairly common, usually requiring reads to be clustered into operational taxonomic units (OTU), losing information on intraspecific diversity in the process. While COI haplotype information is limited in resolution, it is nevertheless useful in a phylogeographic context, helping to formulate hypothesis on taxon dispersal. Methods. This study combines sequence denoising strategies, normally applied in microbial research, with additional abundance-based filtering to extract haplotypes from freshwater macroinvertebrate metabarcoding data sets. This novel approach was added to the R package "JAMP" and can be applied to Cytochrome c oxidase subunit I (COI) amplicon datasets. We tested our haplotyping method by sequencing i) a single-species mock community composed of 31 individuals with different haplotypes spanning three orders of magnitude in biomass and ii) 18 monitoring samples each amplified with four different primer sets and two PCR replicates. Results. We detected all 15 haplotypes of the single specimens in the mock community with relaxed filtering and denoising settings. However, up to 480 additional unexpected haplotypes remained in both replicates. Rigorous filtering removes most unexpected haplotypes, but also can discard expected haplotypes mainly from the small specimens. In the monitoring samples, the different primer sets detected 177 - 200 OTUs, each containing an average of 2.40 to 3.30 haplotypes per OTU. Population structures were consistent between replicates, and similar between primer pairs, depending on the primer length. A closer look at abundant taxa in the data set revealed various population genetic patterns, e.g. Taeniopteryx nebulosa and Hydropsyche pellucidula with a difference in north-south haplotype distribution, while Oulimnius tuberculatus and Asellus aquaticus display no clear population pattern but differ in genetic diversity. Discussion. We developed a strategy to infer intraspecific genetic diversity from bulk invertebrate monitoring samples using metabarcoding data. It needs to be stressed that at this point metabarcoding-informed haplotyping is not capable of capture the full diversity present in such samples, due to variation in specimen size, primer bias and loss of sequence variants with low abundance. Nevertheless, for a high number of species intraspecific diversity was recovered, identifying potentially isolated populations and potential taxa for further more detailed phylogeographic investigation. While we are currently lacking large-scale metabarcoding data sets to fully take advantage of our new approach, metabarcoding-informed haplotyping holds great promise for biomonitoring efforts that not only seek information about biological diversity but also underlying genetic diversity.


2018 ◽  
Author(s):  
Vasco Elbrecht ◽  
Dirk Steinke

The viability of DNA metabarcoding for assessment of freshwater macrozoobenthos has been demonstrated over the past years. It matured to a stage where it can be applied to monitoring at a large scale, keeping pace with increased high throughput sequencing (HTS) capacity. However, workflows and sample tagging need to be optimized to accommodate for hundreds of samples within a single sequencing run. We here conceptualize a streamlined metabarcoding workflow, in which samples are processed in 96-well plates. Each sample is replicated starting with tissue extraction. Negative and positive controls are included to ensure data reliability. With our newly developed fusion primer sets for the BF2+BR2 primer pair up to three 96-well plates (288 wells) can be uniquely tagged for a single Illumina sequencing run. By including Illumina indices tagging can be extended to thousands of samples. We hope that our metabarcoding workflow will be used as a practical guide for future large-scale biodiversity assessments involving freshwater invertebrates. However, we also want to point out that this is just one approach, and that we hope this article will stimulate discussion and publication of alternatives and extensions.


2018 ◽  
Author(s):  
Vasco Elbrecht ◽  
Dirk Steinke

The viability of DNA metabarcoding for assessment of freshwater macrozoobenthos has been demonstrated over the past years. It matured to a stage where it can be applied to monitoring at a large scale, keeping pace with increased high throughput sequencing (HTS) capacity. However, workflows and sample tagging need to be optimized to accommodate for hundreds of samples within a single sequencing run. We here conceptualize a streamlined metabarcoding workflow, in which samples are processed in 96-well plates. Each sample is replicated starting with tissue extraction. Negative and positive controls are included to ensure data reliability. With our newly developed fusion primer sets for the BF2+BR2 primer pair up to three 96-well plates (288 wells) can be uniquely tagged for a single Illumina sequencing run. By including Illumina indices tagging can be extended to thousands of samples. We hope that our metabarcoding workflow will be used as a practical guide for future large-scale biodiversity assessments involving freshwater invertebrates. However, we also want to point out that this is just one approach, and that we hope this article will stimulate discussion and publication of alternatives and extensions.


2020 ◽  
Author(s):  
Marie Hoffmann ◽  
Michael T. Monaghan ◽  
Knut Reinert

AbstractMotivationDNA metabarcoding is a commonly applied technique used to infer the species composition of environmental samples. These samples can comprise hundreds of organisms that can be closely or very distantly related in the taxonomic tree of life. DNA metabarcoding combines polymerase chain reaction (PCR) and next-generation sequencing (NGS), whereby a short, homologous sequence of DNA is amplified and sequenced from all members of the community. Sequences are then taxonomically identified based on their match to a reference database. Ideally, each species of interest would have a unique DNA barcode. This short, variable sequence needs to be flanked by relatively conserved regions that can be used as primer binding sites. Appropriate PCR primer pairs would match to a broad evolutionary range of taxa, such that we only need a few to achieve high taxonomic coverage. At the same time however, the DNA barcodes between primer pairs should be different to allow us to distinguish between species to improve resolution. This poses an interesting optimization problem. More specifically: Given a set of references ℛ = {R1, R2, …, Rm}, the problem is to find a primer set P balancing both: high taxonomic coverage and high resolution. This goal can be captured by filtering for frequent primers and ranking by coverage or variation, i.e. the number of unique barcodes. Here we present the software PriSeT, an offline primer discovery tool that is capable of processing large libraries and is robust against mislabeled or low quality references. It tackles the computationally expensive steps with linear runtime filters and efficient encodings.ResultsWe first evaluated PriSeT on references (mostly 18S rRNA genes) from 19 clades covering eukaryotic organisms that are typical for freshwater plankton samples. PriSeT recovered several published primer sets as well as additional, more chemically suitable primer sets. For these new sets, we compared frequency, taxon coverage, and amplicon variation with published primer sets. For 11 clades we found de novo primer pairs that cover more taxa than the published ones, and for six clades de novo primers resulted in greater sequence (i.e., DNA barcode) variation. We also applied PriSeT to 19 SARS-CoV-2 genomes and computed 114 new primer pairs with the additional constraint that the sequences have no co-occurrences in other taxa. These primer sets would be suitable for empirical testing.Availabilityhttps://github.com/mariehoffmann/[email protected]


Sign in / Sign up

Export Citation Format

Share Document