Using PacBio Long-Read High-Throughput Microbial Gene Amplicon Sequencing To Evaluate Infant Formula Safety

Yi Zheng; Xiaoxia Xi; Haiyan Xu; Qiangchuan Hou; Yanfei Bian; Zhongjie Yu; Lai-Yu Kwok; Wenyi Zhang; Zhihong Sun; Heping Zhang

doi:10.1021/acs.jafc.6b01817

Enabling high-accuracy long-read amplicon sequences using unique molecular identifiers with Nanopore or PacBio sequencing

10.1101/645903 ◽

2019 ◽

Cited By ~ 25

Author(s):

Søren M. Karst ◽

Ryan M. Ziels ◽

Rasmus H. Kirkegaard ◽

Emil A. Sørensen ◽

Daniel McDonald ◽

...

Keyword(s):

High Throughput ◽

Single Molecule ◽

Amplicon Sequencing ◽

High Accuracy ◽

Pacbio Sequencing ◽

Consensus Sequences ◽

Oxford Nanopore ◽

Long Read ◽

Genomic Regions ◽

Oxford Nanopore Technologies

AbstractHigh-throughput amplicon sequencing of large genomic regions remains challenging for short-read technologies. Here, we report a high-throughput amplicon sequencing approach combining unique molecular identifiers (UMIs) with Oxford Nanopore Technologies or Pacific Biosciences CCS sequencing, yielding high accuracy single-molecule consensus sequences of large genomic regions. Our approach generates amplicon and genomic sequences of >10,000 bp in length with a mean error-rate of 0.0049-0.0006% and chimera rate <0.022%.

Download Full-text

Simultaneous multiplexed amplicon sequencing and transcriptome profiling in single cells; High-throughput targeted long-read single cell sequencing reveals the clonal and transcriptional landscape of lymphocytes

10.1242/prelights.5740 ◽

2018 ◽

Author(s):

Samantha Seah

Keyword(s):

Single Cell ◽

High Throughput ◽

Single Cells ◽

Transcriptome Profiling ◽

Amplicon Sequencing ◽

Single Cell Sequencing ◽

Long Read ◽

Transcriptional Landscape

Download Full-text

Ultra-accurate microbial amplicon sequencing with synthetic long reads

Microbiome ◽

10.1186/s40168-021-01072-3 ◽

2021 ◽

Vol 9 (1) ◽

Author(s):

Benjamin J. Callahan ◽

Dmitry Grinevich ◽

Siddhartha Thakur ◽

Michael A. Balamotis ◽

Tuval Ben Yehezkel

Keyword(s):

Microbial Community ◽

16S Rrna ◽

Amplicon Sequencing ◽

Species Level ◽

Full Length ◽

16S Rrna Genes ◽

Rrna Genes ◽

Strain Identification ◽

Long Reads ◽

Long Read

Abstract Background Out of the many pathogenic bacterial species that are known, only a fraction are readily identifiable directly from a complex microbial community using standard next generation DNA sequencing. Long-read sequencing offers the potential to identify a wider range of species and to differentiate between strains within a species, but attaining sufficient accuracy in complex metagenomes remains a challenge. Methods Here, we describe and analytically validate LoopSeq, a commercially available synthetic long-read (SLR) sequencing technology that generates highly accurate long reads from standard short reads. Results LoopSeq reads are sufficiently long and accurate to identify microbial genes and species directly from complex samples. LoopSeq perfectly recovered the full diversity of 16S rRNA genes from known strains in a synthetic microbial community. Full-length LoopSeq reads had a per-base error rate of 0.005%, which exceeds the accuracy reported for other long-read sequencing technologies. 18S-ITS and genomic sequencing of fungal and bacterial isolates confirmed that LoopSeq sequencing maintains that accuracy for reads up to 6 kb in length. LoopSeq full-length 16S rRNA reads could accurately classify organisms down to the species level in rinsate from retail meat samples, and could differentiate strains within species identified by the CDC as potential foodborne pathogens. Conclusions The order-of-magnitude improvement in length and accuracy over standard Illumina amplicon sequencing achieved with LoopSeq enables accurate species-level and strain identification from complex- to low-biomass microbiome samples. The ability to generate accurate and long microbiome sequencing reads using standard short read sequencers will accelerate the building of quality microbial sequence databases and removes a significant hurdle on the path to precision microbial genomics.

Download Full-text

Connecting structure to function with the recovery of over 1000 high-quality metagenome-assembled genomes from activated sludge using long-read sequencing

Nature Communications ◽

10.1038/s41467-021-22203-2 ◽

2021 ◽

Vol 12 (1) ◽

Cited By ~ 2

Author(s):

Caitlin M. Singleton ◽

Francesca Petriglieri ◽

Jannie M. Kristensen ◽

Rasmus H. Kirkegaard ◽

Thomas Y. Michaelsen ◽

...

Keyword(s):

16S Rrna ◽

Wastewater Treatment Plants ◽

In Situ Hybridisation ◽

Amplicon Sequencing ◽

Rrna Genes ◽

Fluorescence In Situ Hybridisation ◽

Sequencing Data ◽

High Quality ◽

16S Rrna Amplicon Sequencing ◽

Long Read

AbstractMicroorganisms play crucial roles in water recycling, pollution removal and resource recovery in the wastewater industry. The structure of these microbial communities is increasingly understood based on 16S rRNA amplicon sequencing data. However, such data cannot be linked to functional potential in the absence of high-quality metagenome-assembled genomes (MAGs) for nearly all species. Here, we use long-read and short-read sequencing to recover 1083 high-quality MAGs, including 57 closed circular genomes, from 23 Danish full-scale wastewater treatment plants. The MAGs account for ~30% of the community based on relative abundance, and meet the stringent MIMAG high-quality draft requirements including full-length rRNA genes. We use the information provided by these MAGs in combination with >13 years of 16S rRNA amplicon sequencing data, as well as Raman microspectroscopy and fluorescence in situ hybridisation, to uncover abundant undescribed lineages belonging to important functional groups.

Download Full-text

Extremely Halophilic Biohydrogen Producing Microbial Communities from High-Salinity Soil and Salt Evaporation Pond

Fuels ◽

10.3390/fuels2020014 ◽

2021 ◽

Vol 2 (2) ◽

pp. 241-252

Author(s):

Dyah Asri Handayani Taroepratjeka ◽

Tsuyoshi Imai ◽

Prapaipid Chairattanamanokorn ◽

Alissara Reungsang

Keyword(s):

Microbial Communities ◽

High Throughput ◽

High Throughput Sequencing ◽

High Salinity ◽

Amplicon Sequencing ◽

Spatial Proximity ◽

Lignocellulosic Waste ◽

Evaporation Pond ◽

Operational Taxonomic Units ◽

Determining Factor

Extreme halophiles offer the advantage to save on the costs of sterilization and water for biohydrogen production from lignocellulosic waste after the pretreatment process with their ability to withstand extreme salt concentrations. This study identifies the dominant hydrogen-producing genera and species among the acclimatized, extremely halotolerant microbial communities taken from two salt-damaged soil locations in Khon Kaen and one location from the salt evaporation pond in Samut Sakhon, Thailand. The microbial communities’ V3–V4 regions of 16srRNA were analyzed using high-throughput amplicon sequencing. A total of 345 operational taxonomic units were obtained and the high-throughput sequencing confirmed that Firmicutes was the dominant phyla of the three communities. Halanaerobium fermentans and Halanaerobacter lacunarum were the dominant hydrogen-producing species of the communities. Spatial proximity was not found to be a determining factor for similarities between these extremely halophilic microbial communities. Through the study of the microbial communities, strategies can be developed to increase biohydrogen molar yield.

Download Full-text

Microdiversity and phylogeographic diversification of bacterioplankton in pelagic freshwater systems revealed through long-read amplicon sequencing

Microbiome ◽

10.1186/s40168-020-00974-y ◽

2021 ◽

Vol 9 (1) ◽

Author(s):

Yusuke Okazaki ◽

Shohei Fujinaga ◽

Michaela M. Salcher ◽

Cristiana Callieri ◽

Atsushi Tanaka ◽

...

Keyword(s):

16S Rrna ◽

Regional Scale ◽

Scale Up ◽

Amplicon Sequencing ◽

Freshwater Ecosystems ◽

16S Rrna Genes ◽

Rrna Genes ◽

Rrna Gene ◽

Metagenomic Sequencing ◽

Long Read

Abstract Background Freshwater ecosystems are inhabited by members of cosmopolitan bacterioplankton lineages despite the disconnected nature of these habitats. The lineages are delineated based on > 97% 16S rRNA gene sequence similarity, but their intra-lineage microdiversity and phylogeography, which are key to understanding the eco-evolutional processes behind their ubiquity, remain unresolved. Here, we applied long-read amplicon sequencing targeting nearly full-length 16S rRNA genes and the adjacent ribosomal internal transcribed spacer sequences to reveal the intra-lineage diversities of pelagic bacterioplankton assemblages in 11 deep freshwater lakes in Japan and Europe. Results Our single nucleotide-resolved analysis, which was validated using shotgun metagenomic sequencing, uncovered 7–101 amplicon sequence variants for each of the 11 predominant bacterial lineages and demonstrated sympatric, allopatric, and temporal microdiversities that could not be resolved through conventional approaches. Clusters of samples with similar intra-lineage population compositions were identified, which consistently supported genetic isolation between Japan and Europe. At a regional scale (up to hundreds of kilometers), dispersal between lakes was unlikely to be a limiting factor, and environmental factors or genetic drift were potential determinants of population composition. The extent of microdiversification varied among lineages, suggesting that highly diversified lineages (e.g., Iluma-A2 and acI-A1) achieve their ubiquity by containing a consortium of genotypes specific to each habitat, while less diversified lineages (e.g., CL500-11) may be ubiquitous due to a small number of widespread genotypes. The lowest extent of intra-lineage diversification was observed among the dominant hypolimnion-specific lineage (CL500-11), suggesting that their dispersal among lakes is not limited despite the hypolimnion being a more isolated habitat than the epilimnion. Conclusions Our novel approach complemented the limited resolution of short-read amplicon sequencing and limited sensitivity of the metagenome assembly-based approach, and highlighted the complex ecological processes underlying the ubiquity of freshwater bacterioplankton lineages. To fully exploit the performance of the method, its relatively low read throughput is the major bottleneck to be overcome in the future.

Download Full-text

Long-Read Amplicon Sequencing of Nitric Oxide Dismutase (nod) Genes Reveal Diverse Oxygenic Denitrifiers in Agricultural Soils and Lake Sediments

Microbial Ecology ◽

10.1007/s00248-020-01482-0 ◽

2020 ◽

Vol 80 (1) ◽

pp. 243-247 ◽

Cited By ~ 4

Author(s):

Baoli Zhu ◽

Zhe Wang ◽

Dheeraj Kanaparthi ◽

Susanne Kublik ◽

Tida Ge ◽

...

Keyword(s):

Nitric Oxide ◽

Lake Sediments ◽

Agricultural Soils ◽

Amplicon Sequencing ◽

Nod Genes ◽

Long Read

Download Full-text

Different Amplicon Targets for Sequencing-Based Studies of Fungal Diversity

Applied and Environmental Microbiology ◽

10.1128/aem.00905-17 ◽

2017 ◽

Vol 83 (17) ◽

Cited By ~ 40

Author(s):

Francesca De Filippis ◽

Manolo Laiola ◽

Giuseppe Blaiotta ◽

Danilo Ercolini

Keyword(s):

Microbial Ecology ◽

High Throughput ◽

Biological Samples ◽

Fungal Diversity ◽

High Throughput Sequencing ◽

Amplicon Sequencing ◽

Fungal Species ◽

Fungal Communities ◽

Its2 Region ◽

Mock Community

ABSTRACT Target-gene amplicon sequencing is the most exploited high-throughput sequencing application in microbial ecology. The targets are taxonomically relevant genes, with 16S rRNA being the gold standard for bacteria. As for fungi, the most commonly used target is the internal transcribed spacer (ITS). However, the uneven ITS length among species may promote preferential amplification and sequencing and incorrect estimation of their abundance. Therefore, the use of different targets is desirable. We evaluated the use of three different target amplicons for the characterization of fungal diversity. After an in silico primer evaluation, we compared three amplicons (the ITS1-ITS2 region [ITS1-2], 18S ribosomal small subunit RNA, and the D1/D2 domain of the 26S ribosomal large subunit RNA), using biological samples and a mock community of common fungal species. All three targets allowed for accurate identification of the species present. Nevertheless, high heterogeneity in ITS1-2 length was found, and this caused an overestimation of the abundance of species with a shorter ITS, while both 18S and 26S amplicons allowed for more reliable quantification. We demonstrated that ITS1-2 amplicon sequencing, although widely used, may lead to an incorrect evaluation of fungal communities, and efforts should be made to promote the use of different targets in sequencing-based microbial ecology studies. IMPORTANCE Amplicon-sequencing approaches for fungi may rely on different targets affecting the diversity and abundance of the fungal species. An increasing number of studies will address fungal diversity by high-throughput amplicon sequencing. The description of the communities must be accurate and reliable in order to draw useful insights and to address both ecological and biological questions. By analyzing a mock community and several biological samples, we demonstrate that using different amplicon targets may change the results of fungal microbiota analysis, and we highlight how a careful choice of the target is fundamental for a thorough description of the fungal communities.

Download Full-text

1621 Comparisons of microbial populations found in the rumen and in a dual-flow continuous culture fermentation system using high-throughput 16S amplicon sequencing

Journal of Animal Science ◽

10.2527/jam2016-1621 ◽

2016 ◽

Vol 94 (suppl_5) ◽

pp. 789-789

Author(s):

I. J. Salfer ◽

H. E. Larson ◽

M. D. Stern

Keyword(s):

Continuous Culture ◽

High Throughput ◽

Amplicon Sequencing ◽

Microbial Populations ◽

Fermentation System ◽

16S Amplicon Sequencing

Download Full-text

High-throughput functional analysis of natural variants in yeast

10.1101/2021.02.26.433108 ◽

2021 ◽

Author(s):

Chiann-Ling C Yeh ◽

Andreas Tsouris ◽

Joseph Schacherer ◽

Maitreya J. Dunham

Keyword(s):

High Throughput ◽

Site Directed Mutagenesis ◽

Loss Of Function ◽

Evolutionary Pattern ◽

Intermediate Phenotypes ◽

Sulfate Limitation ◽

Long Read ◽

Natural Variants ◽

Natural Isolates ◽

Error Correction Algorithm

How natural variation affects phenotype is difficult to determine given our incomplete ability to deduce the functional impact of the polymorphisms detected in a population. Although current computational and experimental tools can predict and measure allele function, there has previously been no assay that does so in a high-throughput manner while also representing haplotypes derived from wild populations. Here, we present such an assay that measures the fitness of hundreds of natural alleles of a given gene without site-directed mutagenesis or DNA synthesis. With a large collection of diverse Saccharomyces cerevisiae natural isolates, we piloted this technique using the gene SUL1, which encodes a high-affinity sulfate permease that, at increased copy number, can improve the fitness of cells grown in sulfate-limited media. We cloned and barcoded all alleles from a collection of over 1000 natural isolates en masse and matched barcodes with their respective variants using PacBio long-read sequencing and a novel error-correction algorithm. We then transformed the reference S288C strain with this library and used barcode sequencing to track growth ability in sulfate limitation of lineages carrying each allele. We show that this approach allows us to measure the fitness conferred by each allele and stratify functional and nonfunctional alleles. Additionally, we pinpoint which polymorphisms in both coding and noncoding regions are detrimental to fitness or are of small effect and result in intermediate phenotypes. Integrating these results with a phylogenetic tree, we observe how often loss-of-function occurs and whether or not there is an evolutionary pattern to our observable phenotypic results. This approach is easily applicable to other genes. Our results complement classic genotype-phenotype mapping strategies and demonstrate a high-throughput approach for understanding the effects of polymorphisms across an entire species which can greatly propel future investigations into quantitative traits.

Download Full-text