scholarly journals Resolving microbial microdiversity with high accuracy full length 16S rRNA Illumina sequencing

2014 ◽  
Author(s):  
Catherine Burke ◽  
Aaron E Darling

We describe a method for sequencing full-length 16S rRNA gene amplicons using the high throughput Illumina MiSeq platform. The resulting sequences have about 100-fold higher accuracy than standard Illumina reads and are chimera filtered using information from a single molecule dual tagging scheme that boosts the signal available for chimera detection. We demonstrate that the data provides fine scale phylogenetic resolution not available from Illumina amplicon methods targeting smaller variable regions of the 16S rRNA gene.

PeerJ ◽  
2016 ◽  
Vol 4 ◽  
pp. e2492 ◽  
Author(s):  
Catherine M. Burke ◽  
Aaron E. Darling

BackgroundThe bacterial 16S rRNA gene has historically been used in defining bacterial taxonomy and phylogeny. However, there are currently no high-throughput methods to sequence full-length 16S rRNA genes present in a sample with precision.ResultsWe describe a method for sequencing near full-length 16S rRNA gene amplicons using the high throughput Illumina MiSeq platform and test it using DNA from human skin swab samples. Proof of principle of the approach is demonstrated, with the generation of 1,604 sequences greater than 1,300 nt from a single Nano MiSeq run, with accuracy estimated to be 100-fold higher than standard Illumina reads. The reads were chimera filtered using information from a single molecule dual tagging scheme that boosts the signal available for chimera detection.ConclusionsThis method could be scaled up to generate many thousands of sequences per MiSeq run and could be applied to other sequencing platforms. This has great potential for populating databases with high quality, near full-length 16S rRNA gene sequences from under-represented taxa and environments and facilitates analyses of microbial communities at higher resolution.


2020 ◽  
Vol 66 (9) ◽  
pp. 495-504
Author(s):  
Yan Zheng ◽  
Xiaolong Hu ◽  
Zhongjun Jia ◽  
Paul L.E. Bodelier ◽  
Zhiying Guo ◽  
...  

It is widely believed that the quality and characteristics of Chinese strong-flavor liquor (CSFL) are closely related to the age of the pit mud; CSFL produced from older pit mud tastes better. This study aimed to investigate the alteration and interaction of prokaryotic communities across an age gradient in pit mud. Prokaryotic microbes in different-aged pit mud (1, 6, and 10 years old) were analyzed by Illumina MiSeq sequencing of the 16S rRNA gene. Analysis of the 16S rRNA gene indicated that the prokaryotic community was significantly altered with pit mud age. There was a significant increase in the genera Methanosarcina, Methanobacterium, and Aminobacterium with increased age of pit mud, while the genus Lactobacillus showed a significant decreasing trend. Network analysis demonstrated that both synergetic co-occurrence and niche competition were dominated by 68 prokaryotic genera. These genera formed 10 hubs of co-occurrence patterns, mainly under the phyla Firmicutes, Euryarchaeota, and Bacteroidetes, playing important roles on ecosystem stability of the pit mud. Environmental variables (pH, NH4+, available P, available K, and Ca2+) correlated significantly with prokaryotic community assembly. The interaction of prokaryotic communities in the pit mud ecosystem and the relationship among prokaryotic communities and environmental factors contribute to the higher quality of the pit mud in older fermentation pits.


2017 ◽  
Author(s):  
Garold Fuks ◽  
Michael Elgart ◽  
Amnon Amir ◽  
Amit Zeisel ◽  
Peter J. Turnbaugh ◽  
...  

AbstractBackgroundMost of our knowledge about the remarkable microbial diversity on Earth comes from sequencing the 16S rRNA gene. The use of next-generation sequencing methods has increased sample number and sequencing depth, but the read length of the most widely used sequencing platforms today is quite short, requiring the researcher to choose a subset of the gene to sequence (typically 16-33% of the total length). Thus, many bacteria may share the same amplified region and the resolution of profiling is inherently limited. Platforms that offer ultra long read lengths, whole genome shotgun sequencing approaches, and computational frameworks formerly suggested by us and by others, all allow different ways to circumvent this problem yet suffer various shortcomings. There is need for a simple and low cost 16S rRNA gene based profiling approach that harnesses the short read length to provide a much larger coverage of the gene to allow for high resolution, even in harsh conditions of low bacterial biomass and fragmented DNA.ResultsThis manuscript suggests Short MUltiple Regions Framework (SMURF), a method to combine sequencing results from different PCR-amplified regions to provide one coherent profiling. The de facto amplicon length is the total length of all amplified regions, thus providing much higher resolution compared to current techniques. Computationally, the method solves a convex optimization problem that allows extremely fast reconstruction and requires only moderate memory. We demonstrate the increase in resolution by in silico simulations and by profiling two mock mixtures and real-world biological samples. Reanalyzing a mock mixture from the Human Microbiome Project achieved about two-fold improvement in resolution when combing two independent regions. Using a custom set of six primer pairs spanning about 1200bp (80%) of the 16S rRNA gene we were able to achieve ~100 fold improvement in resolution compared to a single region, over a mock mixture of common human gut bacterial isolates. Finally, profiling of a Drosophila melanogaster microbiome using the set of six primer pairs provided a ~100 fold increase in resolution, and thus enabling efficient downstream analysis.ConclusionsSMURF enables identification of near full-length 16S rRNA gene sequences in microbial communities, having resolution superior compared to current techniques. It may be applied to standard sample preparation protocols with very little modifications. SMURF also paves the way to high-resolution profiling of low-biomass and fragmented DNA, e.g., in the case of Formalin-fixed and Paraffin-embedded samples, fossil-derived DNA or DNA exposed to other degrading conditions. The approach is not restricted to combining amplicons of the 16S rRNA gene and may be applied to any set of amplicons, e.g., in Multilocus Sequence Typing (MLST).


Author(s):  
Patrick D Schloss ◽  
Matthew L Jenior ◽  
Charles C. Koumpouras ◽  
Sarah L Westcott ◽  
Sarah K Highlander

Over the past 10 years, microbial ecologists have largely abandoned sequencing 16S rRNA genes by the Sanger sequencing method and have instead adopted highly parallelized sequencing platforms. These new platforms, such as 454 and Illumina's MiSeq, have allowed researchers to obtain millions of high quality, but short sequences. The result of the added sequencing depth has been significant improvements in experimental design. The tradeoff has been the decline in the number of full-length reference sequences that are deposited into databases. To overcome this problem, we tested the ability of the PacBio Single Molecule, Real-Time (SMRT) DNA sequencing platform to generate sequence reads from the 16S rRNA gene. We generated sequencing data from the V4, V3-V5, V1-V3, V1-V5, V1-V6, and V1-V9 variable regions from within the 16S rRNA gene using DNA from a synthetic mock community and natural samples collected from human feces, mouse feces, and soil. The mock community allowed us to assess the actual sequencing error rate and how that error rate changed when different curation methods were applied. We developed a simple method based on sequence characteristics and quality scores to reduce the observed error rate for the V1-V9 region from 0.69 to 0.027%. This error rate is comparable to what has been observed for the shorter reads generated by 454 and Illumina's MiSeq sequencing platforms. Although the per base sequencing cost is still significantly more than that of MiSeq, the prospect of supplementing reference databases with full-length sequences from organisms below the limit of detection from the Sanger approach is exciting.


2019 ◽  
Vol 10 (1) ◽  
Author(s):  
Jethro S. Johnson ◽  
Daniel J. Spakowicz ◽  
Bo-Young Hong ◽  
Lauren M. Petersen ◽  
Patrick Demkowicz ◽  
...  

Abstract The 16S rRNA gene has been a mainstay of sequence-based bacterial analysis for decades. However, high-throughput sequencing of the full gene has only recently become a realistic prospect. Here, we use in silico and sequence-based experiments to critically re-evaluate the potential of the 16S gene to provide taxonomic resolution at species and strain level. We demonstrate that targeting of 16S variable regions with short-read sequencing platforms cannot achieve the taxonomic resolution afforded by sequencing the entire (~1500 bp) gene. We further demonstrate that full-length sequencing platforms are sufficiently accurate to resolve subtle nucleotide substitutions (but not insertions/deletions) that exist between intragenomic copies of the 16S gene. In consequence, we argue that modern analysis approaches must necessarily account for intragenomic variation between 16S gene copies. In particular, we demonstrate that appropriate treatment of full-length 16S intragenomic copy variants has the potential to provide taxonomic resolution of bacterial communities at species and strain level.


Microbiome ◽  
2014 ◽  
Vol 2 (1) ◽  
pp. 6 ◽  
Author(s):  
Douglas W Fadrosh ◽  
Bing Ma ◽  
Pawel Gajer ◽  
Naomi Sengamalay ◽  
Sandra Ott ◽  
...  

2016 ◽  
Author(s):  
Patrick D Schloss ◽  
Matthew L Jenior ◽  
Charles C. Koumpouras ◽  
Sarah L Westcott ◽  
Sarah K Highlander

Over the past 10 years, microbial ecologists have largely abandoned sequencing 16S rRNA genes by the Sanger sequencing method and have instead adopted highly parallelized sequencing platforms. These new platforms, such as 454 and Illumina's MiSeq, have allowed researchers to obtain millions of high quality, but short sequences. The result of the added sequencing depth has been significant improvements in experimental design. The tradeoff has been the decline in the number of full-length reference sequences that are deposited into databases. To overcome this problem, we tested the ability of the PacBio Single Molecule, Real-Time (SMRT) DNA sequencing platform to generate sequence reads from the 16S rRNA gene. We generated sequencing data from the V4, V3-V5, V1-V3, V1-V5, V1-V6, and V1-V9 variable regions from within the 16S rRNA gene using DNA from a synthetic mock community and natural samples collected from human feces, mouse feces, and soil. The mock community allowed us to assess the actual sequencing error rate and how that error rate changed when different curation methods were applied. We developed a simple method based on sequence characteristics and quality scores to reduce the observed error rate for the V1-V9 region from 0.69 to 0.027%. This error rate is comparable to what has been observed for the shorter reads generated by 454 and Illumina's MiSeq sequencing platforms. Although the per base sequencing cost is still significantly more than that of MiSeq, the prospect of supplementing reference databases with full-length sequences from organisms below the limit of detection from the Sanger approach is exciting.


Author(s):  
Jessica L. O’Callaghan ◽  
Dana Willner ◽  
Melissa Buttini ◽  
Flavia Huygens ◽  
Elise S. Pelzer

The endometrial cavity is an upper genital tract site previously thought as sterile, however, advances in culture-independent, next-generation sequencing technology have revealed that this low-biomass site harbors a rich microbial community which includes multiple Lactobacillus species. These bacteria are considered to be the most abundant non-pathogenic genital tract commensals. Next-generation sequencing of the female lower genital tract has revealed significant variation amongst microbial community composition with respect to Lactobacillus sp. in samples collected from healthy women and women with urogenital conditions. The aim of this study was to evaluate our ability to characterize members of the genital tract microbial community to species-level taxonomy using variable regions of the 16S rRNA gene. Samples were interrogated for the presence of microbial DNA using next-generation sequencing technology that targets the V5–V8 regions of the 16S rRNA gene and compared to speciation using qPCR. We also performed re-analysis of published data using alternate variable regions of the 16S rRNA gene. In this analysis, we explore next-generation sequencing of clinical genital tract isolates as a method for high throughput identification to species-level of key Lactobacillus sp. Data revealed that characterization of genital tract taxa is hindered by a lack of a consensus protocol and 16S rRNA gene region target allowing comparison between studies.


Sign in / Sign up

Export Citation Format

Share Document