scholarly journals Combining 16S rRNA gene variable regions enables high-resolution microbial community profiling

2017 ◽  
Author(s):  
Garold Fuks ◽  
Michael Elgart ◽  
Amnon Amir ◽  
Amit Zeisel ◽  
Peter J. Turnbaugh ◽  
...  

AbstractBackgroundMost of our knowledge about the remarkable microbial diversity on Earth comes from sequencing the 16S rRNA gene. The use of next-generation sequencing methods has increased sample number and sequencing depth, but the read length of the most widely used sequencing platforms today is quite short, requiring the researcher to choose a subset of the gene to sequence (typically 16-33% of the total length). Thus, many bacteria may share the same amplified region and the resolution of profiling is inherently limited. Platforms that offer ultra long read lengths, whole genome shotgun sequencing approaches, and computational frameworks formerly suggested by us and by others, all allow different ways to circumvent this problem yet suffer various shortcomings. There is need for a simple and low cost 16S rRNA gene based profiling approach that harnesses the short read length to provide a much larger coverage of the gene to allow for high resolution, even in harsh conditions of low bacterial biomass and fragmented DNA.ResultsThis manuscript suggests Short MUltiple Regions Framework (SMURF), a method to combine sequencing results from different PCR-amplified regions to provide one coherent profiling. The de facto amplicon length is the total length of all amplified regions, thus providing much higher resolution compared to current techniques. Computationally, the method solves a convex optimization problem that allows extremely fast reconstruction and requires only moderate memory. We demonstrate the increase in resolution by in silico simulations and by profiling two mock mixtures and real-world biological samples. Reanalyzing a mock mixture from the Human Microbiome Project achieved about two-fold improvement in resolution when combing two independent regions. Using a custom set of six primer pairs spanning about 1200bp (80%) of the 16S rRNA gene we were able to achieve ~100 fold improvement in resolution compared to a single region, over a mock mixture of common human gut bacterial isolates. Finally, profiling of a Drosophila melanogaster microbiome using the set of six primer pairs provided a ~100 fold increase in resolution, and thus enabling efficient downstream analysis.ConclusionsSMURF enables identification of near full-length 16S rRNA gene sequences in microbial communities, having resolution superior compared to current techniques. It may be applied to standard sample preparation protocols with very little modifications. SMURF also paves the way to high-resolution profiling of low-biomass and fragmented DNA, e.g., in the case of Formalin-fixed and Paraffin-embedded samples, fossil-derived DNA or DNA exposed to other degrading conditions. The approach is not restricted to combining amplicons of the 16S rRNA gene and may be applied to any set of amplicons, e.g., in Multilocus Sequence Typing (MLST).

2010 ◽  
Vol 56 (12) ◽  
pp. 1040-1049 ◽  
Author(s):  
Michal Slany ◽  
Martina Vanerkova ◽  
Eva Nemcova ◽  
Barbora Zaloudikova ◽  
Filip Ruzicka ◽  
...  

High-resolution melting analysis (HRMA) is a fast (post-PCR) high-throughput method to scan for sequence variations in a target gene. The aim of this study was to test the potential of HRMA to distinguish particular bacterial species of the Staphylococcus genus even when using a broad-range PCR within the 16S rRNA gene where sequence differences are minimal. Genomic DNA samples isolated from 12 reference staphylococcal strains ( Staphylococcus aureus , Staphylococcus capitis , Staphylococcus caprae , Staphylococcus epidermidis , Staphylococcus haemolyticus , Staphylococcus hominis , Staphylococcus intermedius , Staphylococcus saprophyticus , Staphylococcus sciuri , Staphylococcus simulans , Staphylococcus warneri , and Staphylococcus xylosus ) were subjected to a real-time PCR amplification of the 16S rRNA gene in the presence of fluorescent dye EvaGreen™, followed by HRMA. Melting profiles were used as molecular fingerprints for bacterial species differentiation. HRMA of S. saprophyticus and S. xylosus resulted in undistinguishable profiles because of their identical sequences in the analyzed 16S rRNA region. The remaining reference strains were fully differentiated either directly or via high-resolution plots obtained by heteroduplex formation between coamplified PCR products of the tested staphylococcal strain and phylogenetically unrelated strain.


2014 ◽  
Author(s):  
Catherine Burke ◽  
Aaron E Darling

We describe a method for sequencing full-length 16S rRNA gene amplicons using the high throughput Illumina MiSeq platform. The resulting sequences have about 100-fold higher accuracy than standard Illumina reads and are chimera filtered using information from a single molecule dual tagging scheme that boosts the signal available for chimera detection. We demonstrate that the data provides fine scale phylogenetic resolution not available from Illumina amplicon methods targeting smaller variable regions of the 16S rRNA gene.


Microbiome ◽  
2018 ◽  
Vol 6 (1) ◽  
Author(s):  
Garold Fuks ◽  
Michael Elgart ◽  
Amnon Amir ◽  
Amit Zeisel ◽  
Peter J. Turnbaugh ◽  
...  

Author(s):  
Jessica L. O’Callaghan ◽  
Dana Willner ◽  
Melissa Buttini ◽  
Flavia Huygens ◽  
Elise S. Pelzer

The endometrial cavity is an upper genital tract site previously thought as sterile, however, advances in culture-independent, next-generation sequencing technology have revealed that this low-biomass site harbors a rich microbial community which includes multiple Lactobacillus species. These bacteria are considered to be the most abundant non-pathogenic genital tract commensals. Next-generation sequencing of the female lower genital tract has revealed significant variation amongst microbial community composition with respect to Lactobacillus sp. in samples collected from healthy women and women with urogenital conditions. The aim of this study was to evaluate our ability to characterize members of the genital tract microbial community to species-level taxonomy using variable regions of the 16S rRNA gene. Samples were interrogated for the presence of microbial DNA using next-generation sequencing technology that targets the V5–V8 regions of the 16S rRNA gene and compared to speciation using qPCR. We also performed re-analysis of published data using alternate variable regions of the 16S rRNA gene. In this analysis, we explore next-generation sequencing of clinical genital tract isolates as a method for high throughput identification to species-level of key Lactobacillus sp. Data revealed that characterization of genital tract taxa is hindered by a lack of a consensus protocol and 16S rRNA gene region target allowing comparison between studies.


2000 ◽  
Vol 38 (3) ◽  
pp. 953-959 ◽  
Author(s):  
M. S. Hughes ◽  
G. James ◽  
N. Ball ◽  
M. Scally ◽  
R. Malik ◽  
...  

PCR amplifications of the 16S rRNA gene were performed on 46 specimens obtained from 43 dogs with canine leproid granuloma syndrome to help determine its etiology. Sequence capture PCR was applied to 37 paraffin-embedded specimens from 37 dogs, and nested PCR was attempted on DNA from 9 fresh tissue specimens derived from 3 of the 37 aforementioned dogs and from an additional 6 dogs. Molecular analyses of the paraffin-embedded tissues and fresh tissue specimen analyses were performed at separate institutions. PCR products with identical sequences over a 350-bp region encompassing variable regions 2 and 3 of the 16S rRNA gene were obtained from 4 of 37 paraffin-embedded specimens and from all 9 specimens of fresh tissue originating from 12 of the 43 dogs. Identical sequences were determined from amplicons obtained from paraffin-embedded and fresh specimens from one dog. The consensus DNA sequence, amplified from paraffin-embedded tissue and represented by GenBank accession no. AF144747, shared highest nucleotide identity (99.4% over 519 bp) with mycobacterial strain IWGMT 90413 but did not correspond exactly to any EMBL or GenBank database sequence. With a probe derived from the V2 region of the novel canine sequence, reverse cross blot hybridization identified an additional four paraffin-embedded specimens containing the same novel sequence. In total, molecular methodologies identified the proposed novel mycobacterial sequence in 16 of 43 dogs with canine leproid granuloma syndrome, indicating that the species represented by this sequence may be the principal etiological agent of canine leproid granuloma syndrome.


2013 ◽  
Vol 79 (17) ◽  
pp. 5112-5120 ◽  
Author(s):  
James J. Kozich ◽  
Sarah L. Westcott ◽  
Nielson T. Baxter ◽  
Sarah K. Highlander ◽  
Patrick D. Schloss

ABSTRACTRapid advances in sequencing technology have changed the experimental landscape of microbial ecology. In the last 10 years, the field has moved from sequencing hundreds of 16S rRNA gene fragments per study using clone libraries to the sequencing of millions of fragments per study using next-generation sequencing technologies from 454 and Illumina. As these technologies advance, it is critical to assess the strengths, weaknesses, and overall suitability of these platforms for the interrogation of microbial communities. Here, we present an improved method for sequencing variable regions within the 16S rRNA gene using Illumina's MiSeq platform, which is currently capable of producing paired 250-nucleotide reads. We evaluated three overlapping regions of the 16S rRNA gene that vary in length (i.e., V34, V4, and V45) by resequencing a mock community and natural samples from human feces, mouse feces, and soil. By titrating the concentration of 16S rRNA gene amplicons applied to the flow cell and using a quality score-based approach to correct discrepancies between reads used to construct contigs, we were able to reduce error rates by as much as two orders of magnitude. Finally, we reprocessed samples from a previous study to demonstrate that large numbers of samples could be multiplexed and sequenced in parallel with shotgun metagenomes. These analyses demonstrate that our approach can provide data that are at least as good as that generated by the 454 platform while providing considerably higher sequencing coverage for a fraction of the cost.


2009 ◽  
Vol 107 (6) ◽  
pp. 2017-2028 ◽  
Author(s):  
T. Robertson ◽  
S. Bibby ◽  
D. O’Rourke ◽  
T. Belfiore ◽  
H. Lambie ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document