scholarly journals PhenoGMM: Gaussian mixture modelling of microbial cytometry data enables efficient predictions of biodiversity

2019 ◽  
Author(s):  
Peter Rubbens ◽  
Ruben Props ◽  
Frederiek-Maarten Kerckhof ◽  
Nico Boon ◽  
Willem Waegeman

AbstractMicrobial flow cytometry allows to rapidly characterize microbial communities. Recent research has demonstrated a moderate to strong connection between the cytometric diversity and taxonomic diversity based on 16S rRNA gene amplicon sequencing data. This creates the opportunity to integrate both types of data to study and predict the microbial community diversity in an automated and efficient way. However, microbial flow cytometry data results in a number of unique challenges that need to be addressed. The results of our work are threefold: i) We expand current microbial cytometry fingerprinting approaches by proposing and validating a model-based fingerprinting approach based upon Gaussian Mixture Models, which we called PhenoGMM. ii) We show that microbial diversity can be rapidly estimated by PhenoGMM. In combination with a supervised machine learning model, diversity estimations based on 16S rRNA gene amplicon sequencing data can be predicted. iii) We evaluate our method extensively by using multiple datasets from different ecosystems and compare its predictive power with a generic binning fingerprinting approach that is commonly used in microbial flow cytometry. These results demonstrate the strong connection between the genetic make-up of a microbial community and its phenotypic properties as measured by flow cytometry. Our workflow facilitates the study of microbial diversity and community dynamics using flow cytometry in a fast and quantitative way.ImportanceMicroorganisms are vital components in various ecoystems on Earth. In order to investigate the microbial diversity, researchers have largely relied on the analysis of 16S rRNA gene sequences from DNA. Flow cytometry has been proposed as an alternative technique to characterize microbial community diversity and dynamics. It is an optical technique, able to rapidly characterize a number of phenotypic properties of individual cells. So-called fingerprinting techniques are needed in order to describe microbial community diversity and dynamics based on flow cytometry data. In this work, we propose a more advanced fingerprinting strategy based on Gaussian Mixture Models. When samples have been analyzed by both flow cytometry and 16S rRNA gene amplicon sequencing, we show that supervised machine learning models can be used to find the relationship between the two types of data. We evaluate our workflow on datasets from different ecosystems, illustrating its general applicability for the analysisof microbial flow cytometry data. PhenoGMM facilitates the rapid characterization and predictive modelling of microbial diversity using flow cytometry.


2020 ◽  
Vol 11 ◽  
Author(s):  
Daniel Straub ◽  
Nia Blackwell ◽  
Adrian Langarica-Fuentes ◽  
Alexander Peltzer ◽  
Sven Nahnsen ◽  
...  


2020 ◽  
Vol 178 ◽  
pp. 115815 ◽  
Author(s):  
Theo Y.C. Lam ◽  
Ran Mei ◽  
Zhuoying Wu ◽  
Patrick K.H. Lee ◽  
Wen-Tso Liu ◽  
...  


2015 ◽  
Author(s):  
Alfonso Benítez-Páez ◽  
Kevin J. Portune ◽  
Yolanda Sanz

AbstractBackgroundThe miniaturised and portable DNA sequencer MinIONTM has been released to the scientific community within the framework of an early access programme to evaluate its application for a wide variety of genetic approaches. This technology has demonstrated great potential, especially in genome-wide analyses. In this study, we tested the ability of the MinIONTM system to perform amplicon sequencing in order to design new approaches to study microbial diversity using nearly full-length 16S rDNA sequences.ResultsUsing R7.3 chemistry, we generated more than 3.8 million events (nt) during a single sequencing run. These data were sufficient to reconstruct more than 90% of the 16S rRNA gene sequences for 20 different species present in a mock reference community. After read mapping and 16S rRNA gene assembly, consensus sequences and 2d reads were recovered to assign taxonomic classification down to the species level. Additionally, we were able to measure the relative abundance of all the species present in a mock community and detected a biased species distribution originating from the PCR reaction using ‘universal’ primers.ConclusionsAlthough nanopore-based sequencing produces reads with lower per-base accuracy compared with other platforms, the MinIONTM DNA sequencer is valuable for both high taxonomic resolution and microbial diversity analysis. Improvements in nanopore chemistry, such as minimising base-calling errors and the nucleotide bias reported here for 16S amplicon sequencing, will further deliver more reliable information that is useful for the specific detection of microbial species and strains in complex ecosystems.



mSphere ◽  
2021 ◽  
Vol 6 (1) ◽  
Author(s):  
Peter Rubbens ◽  
Ruben Props ◽  
Frederiek-Maarten Kerckhof ◽  
Nico Boon ◽  
Willem Waegeman

ABSTRACT Microbial flow cytometry can rapidly characterize the status of microbial communities. Upon measurement, large amounts of quantitative single-cell data are generated, which need to be analyzed appropriately. Cytometric fingerprinting approaches are often used for this purpose. Traditional approaches either require a manual annotation of regions of interest, do not fully consider the multivariate characteristics of the data, or result in many community-describing variables. To address these shortcomings, we propose an automated model-based fingerprinting approach based on Gaussian mixture models, which we call PhenoGMM. The method successfully quantifies changes in microbial community structure based on flow cytometry data, which can be expressed in terms of cytometric diversity. We evaluate the performance of PhenoGMM using data sets from both synthetic and natural ecosystems and compare the method with a generic binning fingerprinting approach. PhenoGMM supports the rapid and quantitative screening of microbial community structure and dynamics. IMPORTANCE Microorganisms are vital components in various ecosystems on Earth. In order to investigate the microbial diversity, researchers have largely relied on the analysis of 16S rRNA gene sequences from DNA. Flow cytometry has been proposed as an alternative technology to characterize microbial community diversity and dynamics. The technology enables a fast measurement of optical properties of individual cells. So-called fingerprinting techniques are needed in order to describe microbial community diversity and dynamics based on flow cytometry data. In this work, we propose a more advanced fingerprinting strategy based on Gaussian mixture models. We evaluated our workflow on data sets from both synthetic and natural ecosystems, illustrating its general applicability for the analysis of microbial flow cytometry data. PhenoGMM supports a rapid and quantitative analysis of microbial community structure using flow cytometry.



2021 ◽  
Vol 10 (2) ◽  
Author(s):  
Junho Lee ◽  
Ilwon Jeong ◽  
Jong-Oh Kim ◽  
Kyunghoi Kim

ABSTRACT The Yeosu New Harbor in the South Korean benthic environment shows a mesotrophic environment affected by the Tsushima Current and the Seomjin River. Here, we report microbial diversity in sediments of Yeosu New Harbor based on 16S rRNA gene amplicon sequencing. The dominant bacterial phylum was Proteobacteria (relative abundance, 72.5 to 78.1%).



PLoS ONE ◽  
2014 ◽  
Vol 9 (4) ◽  
pp. e93827 ◽  
Author(s):  
Rachel Poretsky ◽  
Luis M. Rodriguez-R ◽  
Chengwei Luo ◽  
Despina Tsementzi ◽  
Konstantinos T. Konstantinidis


2021 ◽  
Vol 10 (27) ◽  
Author(s):  
Nur Indradewi Oktavitri ◽  
Jong-Oh Kim ◽  
Kyunghoi Kim

Benthic microbial diversity in Tongyeong, South Korea, was analyzed using next-generation sequencing of the 16S rRNA genes, to reveal the effects of seasonal variations on the microbial community in sediment. Proteobacteria was the dominant phylum, with a relative abundance of 61.5 to 68.1%.



Author(s):  
Daniel Straub ◽  
Nia Blackwell ◽  
Adrian Langarica Fuentes ◽  
Alexander Peltzer ◽  
Sven Nahnsen ◽  
...  

AbstractOne of the major methods to identify microbial community composition, to unravel microbial population dynamics, and to explore microbial diversity in environmental samples is DNA- or RNA-based 16S rRNA (gene) amplicon sequencing. Subsequent bioinformatics analyses are required to extract valuable information from the high-throughput sequencing approach. However, manifold bioinformatics tools complicate their choice and might cause differences in data interpretation, making the selection of the pipeline a crucial step.Here, we compared the performance of most widely used 16S rRNA gene amplicon sequencing analysis tools (i.e. Mothur, QIIME1, QIIME2, and MEGAN) using mock datasets and environmental samples from contrasting terrestrial and freshwater sites. Our results showed that QIIME2 outcompeted all other investigated tools in sequence recovery (>10 times less false positives), taxonomic assignments (>22% better F-score) and diversity estimates (>5% better assessment), while there was still room for improvement e.g. imperfect sequence recovery (recall up to 87%) or detection of additional false sequences (precision up to 72%). Furthermore, we found that microbial diversity estimates and highest abundant taxa varied among analysis pipelines (i.e. only one in five genera was shared among all analysis tools) when analyzing environmental samples, which might skew biological conclusions.Our findings were subsequently implemented in a high-performance computing conformant workflow following the FAIR (Findable, Accessible, Interoperable, and Re-usable) principle, allowing reproducible 16S rRNA gene amplicon sequence analysis starting from raw sequence files. Our presented workflow can be utilized for future studies, thereby facilitating the analysis of high-throughput DNA- or RNA-based 16S rRNA (gene) sequencing data substantially.ImportanceMicroorganisms play an essential role in biogeochemical cycling events across the globe. Phylogenetic marker gene analysis is a widely used method to explore microbial community dynamics in space and time, to predict the ecological relevance of microbial populations, or to identify microbial key players in biogeochemical cycles. Several computational analysis methods were developed to aid 16S rRNA gene analysis but choosing the best method is not trivial. In this study, we compared popular analysis methods (i.e. Mothur, QIIME1 and 2, and MEGAN) using samples with known microbial composition (i.e. mock community samples) and environmental samples from contrasting habitats (i.e. groundwater, soil, sediment, and river water). Our findings provide guidance for choosing the currently optimal 16S rRNA gene sequencing analysis method and we implemented our recommended pipeline into a reproducible workflow, which follows highest bioinformatics standards and is open source and free to use.



Diversity ◽  
2021 ◽  
Vol 13 (10) ◽  
pp. 494
Author(s):  
Camila G. C. Lemes ◽  
Morghana M. Villa ◽  
Érica B. Felestrino ◽  
Luiza O. Perucci ◽  
Renata A. B. Assis ◽  
...  

The Iron Quadrangle (IQ) is one of the main iron ore producing regions of the world. The exploitation of its reserves jeopardizes the high biological endemism associated with this region. This work aimed to understand the diversity and bacterial potential associated with IQ caves. Floor and ceiling samples of seven ferruginous caves and one quartzite cave were collected, and their microbial relative abundance and diversity were established by 16S rRNA gene amplicon sequencing data. The results showed that ferruginous caves present higher microbial abundance and greater microbial diversity compared to the quartzite cave. Many species belonging to genera found in these caves, such as Pseudonocardia and Streptacidiphilus, are known to produce biomolecules of biotechnological interest as macrolides and polyketides. Moreover, comparative analysis of microbial diversity and metabolic potential in a biofilm in pendant microfeature revealed that the microbiota associated with this structure is more similar to the floor rather than ceiling samples, with the presence of genera that may participate in the genesis of these cavities, for instance, Ferrovum, Geobacter, and Sideroxydans. These results provide the first glimpse of the microbial life in these environments and emphasize the need of conservation programs for these areas, which are under intense anthropogenic exploration.



Sign in / Sign up

Export Citation Format

Share Document