scholarly journals Re-purposing software for functional characterization of the microbiome

Microbiome ◽  
2021 ◽  
Vol 9 (1) ◽  
Author(s):  
Laura-Jayne Gardiner ◽  
Niina Haiminen ◽  
Filippo Utro ◽  
Laxmi Parida ◽  
Ed Seabolt ◽  
...  

Abstract Background Widespread bioinformatic resource development generates a constantly evolving and abundant landscape of workflows and software. For analysis of the microbiome, workflows typically begin with taxonomic classification of the microorganisms that are present in a given environment. Additional investigation is then required to uncover the functionality of the microbial community, in order to characterize its currently or potentially active biological processes. Such functional analysis of metagenomic data can be computationally demanding for high-throughput sequencing experiments. Instead, we can directly compare sequencing reads to a functionally annotated database. However, since reads frequently match multiple sequences equally well, analyses benefit from a hierarchical annotation tree, e.g. for taxonomic classification where reads are assigned to the lowest taxonomic unit. Results To facilitate functional microbiome analysis, we re-purpose well-known taxonomic classification tools to allow us to perform direct functional sequencing read classification with the added benefit of a functional hierarchy. To enable this, we develop and present a tree-shaped functional hierarchy representing the molecular function subset of the Gene Ontology annotation structure. We use this functional hierarchy to replace the standard phylogenetic taxonomy used by the classification tools and assign query sequences accurately to the lowest possible molecular function in the tree. We demonstrate this with simulated and experimental datasets, where we reveal new biological insights. Conclusions We demonstrate that improved functional classification of metagenomic sequencing reads is possible by re-purposing a range of taxonomic classification tools that are already well-established, in conjunction with either protein or nucleotide reference databases. We leverage the advances in speed, accuracy and efficiency that have been made for taxonomic classification and translate these benefits for the rapid functional classification of microbiomes. While we focus on a specific set of commonly used methods, the functional annotation approach has broad applicability across other sequence classification tools. We hope that re-purposing becomes a routine consideration during bioinformatic resource development.

2015 ◽  
Vol 15 (6) ◽  
pp. 1403-1414 ◽  
Author(s):  
Johan Bengtsson-Palme ◽  
Martin Hartmann ◽  
Karl Martin Eriksson ◽  
Chandan Pal ◽  
Kaisa Thorell ◽  
...  

2021 ◽  
Author(s):  
Seth Commichaux ◽  
Kiran Javkar ◽  
Harihara Subrahmaniam Muralidharan ◽  
Padmini Ramachandran ◽  
Andrea Ottesen ◽  
...  

Abstract BackgroundMicrobial eukaryotes are nearly ubiquitous in microbiomes on Earth and contribute to many integral ecological functions. Metagenomics is a proven tool for studying the microbial diversity, functions, and ecology of microbiomes, but has been underutilized for microeukaryotes due to the computational challenges they present. For taxonomic classification, the use of a eukaryotic marker gene database can improve the computational efficiency, precision and sensitivity. However, state-of-the-art tools which use marker gene databases implement universal thresholds for classification rather than dynamically learning the thresholds from the database structure, impacting the accuracy of the classification process.ResultsHere we introduce taxaTarget, a method for the taxonomic classification of microeukaryotes in metagenomic data. Using a database of eukaryotic marker genes and a supervised learning approach for training, we learned the discriminatory power and classification thresholds for each 20 amino acid region of each marker gene in our database. This approach provided improved sensitivity and precision compared to other state-of-the-art approaches, with rapid runtimes and low memory usage. Additionally, taxaTarget was better able to detect the presence of multiple closely related species as well as species with no representative sequences in the database. One of the greatest challenges faced during the development of taxaTarget was the general sparsity of available sequences for microeukaryotes. Several algorithms were implemented, including threshold padding, which effectively handled the missing training data and reduced classification errors. Using taxaTarget on metagenomes from human fecal microbiomes, a broader range of genera were detected, including multiple parasites that the other tested tools missed.ConclusionData-driven methods for learning classification thresholds from the structure of an input database can provide granular information about the discriminatory power of the sequences and improve the sensitivity and precision of classification. These methods will help facilitate a more comprehensive analysis of metagenomic data and expand our knowledge about the diverse eukaryotes in microbial communities.


2018 ◽  
Vol 19 (S7) ◽  
Author(s):  
Antonino Fiannaca ◽  
Laura La Paglia ◽  
Massimo La Rosa ◽  
Giosue’ Lo Bosco ◽  
Giovanni Renda ◽  
...  

2018 ◽  
Author(s):  
Raphael Eisenhofer ◽  
Laura Susan Weyrich

The field of paleomicrobiology—the study of ancient microorganisms—is rapidly growing due to recent methodological and technological advancements. It is now possible to obtain vast quantities of DNA data from ancient specimens in a high-throughput manner and use this information to investigate the dynamics and evolution of past microbial communities. However, we still know very little about how the characteristics of ancient DNA influence our ability to accurately assign microbial taxonomies (i.e. identify species) within ancient metagenomic samples. Here, we use both simulated and published metagenomic data sets to investigate how ancient DNA characteristics affect alignment-based taxonomic classification. We find that nucleotide-to-nucleotide, rather than nucleotide-to-protein, alignments are preferable when assigning taxonomies to DNA fragment lengths routinely identified within ancient specimens (<60 bp). We determine that deamination (a form of ancient DNA damage) and random sequence substitutions corresponding to ~100,000 years of genomic divergence minimally impact alignment-based classification. We also test four different reference databases and find that database choice can significantly bias the results of alignment-based taxonomic classification in ancient metagenomic studies. Finally, we perform a reanalysis of previously published ancient dental calculus data, increasing the number of microbial DNA sequences assigned taxonomically by an average of 64.2-fold and identifying microbial species previously unidentified in the original study. Overall, this study enhances our understanding of how ancient DNA characteristics influence alignment-based taxonomic classification of ancient microorganisms and provides recommendations for future paleomicrobiological studies.


2019 ◽  
Vol 14 (1) ◽  
Author(s):  
Jolanta Kawulok ◽  
Michal Kawulok ◽  
Sebastian Deorowicz

Abstract Background Nowadays, not only are single genomes commonly analyzed, but also metagenomes, which are sets of, DNA fragments (reads) derived from microbes living in a given environment. Metagenome analysis is aimed at extracting crucial information on the organisms that have left their traces in an investigated environmental sample.In this study we focus on the MetaSUB Forensics Challenge (organized within the CAMDA 2018 conference) which consists in predicting the geographical origin of metagenomic samples. Contrary to the existing methods for environmental classification that are based on taxonomic or functional classification, we rely on the similarity between a sample and the reference database computed at a reads level. Results We report the results of our extensive experimental study to investigate the behavior of our method and its sensitivity to different parameters. In our tests, we have followed the protocol of the MetaSUB Challenge, which allowed us to compare the obtained results with the solutions based on taxonomic and functional classification. Conclusions The results reported in the paper indicate that our method is competitive with those based on taxonomic classification. Importantly, by measuring the similarity at the reads level, we avoid the necessity of using large databases with annotated gene sequences. Hence our main finding is that environmental classification of metagenomic data can be proceeded without using large databases required for taxonomic or functional classification. Reviewers This article was reviewed by Eran Elhaik, Alexandra Bettina Graf, Chengsheng Zhu, and Andre Kahles.


2015 ◽  
Vol 25 (4) ◽  
pp. 292-299 ◽  
Author(s):  
Neelam M. Nathani ◽  
Ramesh K. Kothari ◽  
Amrutlal K. Patel ◽  
Chaitanya G. Joshi

<b><i>Aim:</i></b> To reassemble <i>Prevotella ruminicola</i> genome from rumen metagenomic data of cattle and buffalo and compare with the published reference genome. <b><i>Method:</i></b> Rumen microbial communities from Mehsani buffaloes (n = 8) and Kankrej cattle (n = 8), each adapted to different proportions of a dry or green roughage diet, were subjected to metagenomic sequencing by Ion Torrent PGM, and subsequent reads were analyzed by MG-RAST. Using reference-guided assembly of the sequences against the published <i>P. ruminicola</i> strain 23, draft genomes of 2.56 and 2.46 Mb were reconstructed from Mehsani buffalo and Kankrej cows, respectively. The genomes were annotated using the RAST Server and carbohydrate active enzyme (CAZyme) analysis. <b><i>Results:</i></b> Taxonomic analysis by MG-RAST revealed <i>P. ruminicola </i>to be the most abundant species present among the rumen microflora. Functional annotation of reconstructed genomes using the RAST Server depicted the maximum assignment of coding sequences involved in the subsystems amino acid and derivatives and carbohydrate metabolism. CAZyme profiling revealed the glycoside hydrolases (GH) family to be the most abundant. GH family subclassification revealed that the extracted genomes had more sequence hits for GH2, GH3, GH92 and GH97 as compared to the reference. <b><i>Conclusion:</i></b> The results reflect the metabolic significance of rumen-adapted <i>P. ruminicola</i> in utilizing a coarse diet for animals based on acquisition of novel genetic elements.


2018 ◽  
Author(s):  
Raphael Eisenhofer ◽  
Laura Susan Weyrich

The field of paleomicrobiology—the study of ancient microorganisms—is rapidly growing due to recent methodological and technological advancements. It is now possible to obtain vast quantities of DNA data from ancient specimens in a high-throughput manner and use this information to investigate the dynamics and evolution of past microbial communities. However, we still know very little about how the characteristics of ancient DNA influence our ability to accurately assign microbial taxonomies (i.e. identify species) within ancient metagenomic samples. Here, we use both simulated and published metagenomic data sets to investigate how ancient DNA characteristics affect alignment-based taxonomic classification. We find that nucleotide-to-nucleotide, rather than nucleotide-to-protein, alignments are preferable when assigning taxonomies to DNA fragment lengths routinely identified within ancient specimens (<60 bp). We determine that deamination (a form of ancient DNA damage) and random sequence substitutions corresponding to ~100,000 years of genomic divergence minimally impact alignment-based classification. We also test four different reference databases and find that database choice can significantly bias the results of alignment-based taxonomic classification in ancient metagenomic studies. Finally, we perform a reanalysis of previously published ancient dental calculus data, increasing the number of microbial DNA sequences assigned taxonomically by an average of 64.2-fold and identifying microbial species previously unidentified in the original study. Overall, this study enhances our understanding of how ancient DNA characteristics influence alignment-based taxonomic classification of ancient microorganisms and provides recommendations for future paleomicrobiological studies.


PeerJ ◽  
2019 ◽  
Vol 7 ◽  
pp. e6594 ◽  
Author(s):  
Raphael Eisenhofer ◽  
Laura Susan Weyrich

The field of palaeomicrobiology—the study of ancient microorganisms—is rapidly growing due to recent methodological and technological advancements. It is now possible to obtain vast quantities of DNA data from ancient specimens in a high-throughput manner and use this information to investigate the dynamics and evolution of past microbial communities. However, we still know very little about how the characteristics of ancient DNA influence our ability to accurately assign microbial taxonomies (i.e. identify species) within ancient metagenomic samples. Here, we use both simulated and published metagenomic data sets to investigate how ancient DNA characteristics affect alignment-based taxonomic classification. We find that nucleotide-to-nucleotide, rather than nucleotide-to-protein, alignments are preferable when assigning taxonomies to short DNA fragment lengths routinely identified within ancient specimens (<60 bp). We determine that deamination (a form of ancient DNA damage) and random sequence substitutions corresponding to ∼100,000 years of genomic divergence minimally impact alignment-based classification. We also test four different reference databases and find that database choice can significantly bias the results of alignment-based taxonomic classification in ancient metagenomic studies. Finally, we perform a reanalysis of previously published ancient dental calculus data, increasing the number of microbial DNA sequences assigned taxonomically by an average of 64.2-fold and identifying microbial species previously unidentified in the original study. Overall, this study enhances our understanding of how ancient DNA characteristics influence alignment-based taxonomic classification of ancient microorganisms and provides recommendations for future palaeomicrobiological studies.


Physiotherapy ◽  
2013 ◽  
Vol 21 (3) ◽  
Author(s):  
Natalia Uścinowicz ◽  
Wojciech Seidel ◽  
Paweł Zostawa ◽  
Sebastian Klich

AbstractThe recent Olympic Games in London incited much interest in the competition of disabled athletes. Various people connected with swimming, including coaches and athletes, have speculated about the fairness of competitions of disabled athletes. A constant problem are the subjective methods of classification in disabled sport. Originally, athletes with disabilities were classified according to medical diagnosis. Due to the injustice which still affects the competitors, functional classification was created shortly after. In the present review, the authors show the anomalies in the structure of the classification. The presented discovery led to the suggestion to introduce objective methods, thanks to which it would be no longer necessary to rely on the subjective assessment of the classifier. According to the authors, while using objective methods does not completely rule out the possibility of fraud by disabled athletes in the classification process, it would certainly reduce their incidence. Some of the objective methods useful for the classification of disabled athletes are: posturography, evaluation of the muscle parameters, electrogoniometric assessment, surface electromyography, and analysis of kinematic parameters. These methods have provide objective evaluation in the diagnostic sense but only if they are used in tandem. The authors demonstrate the undeniable benefits of using objective methods. Unfortunately, there are not only advantages of such solution, there also several drawbacks to be found. The conclusion of the article is the statement by the authors that it is right to use objective methods which allow to further the most important rule in sport: fair-play.


2020 ◽  
Vol 73 (3) ◽  
pp. 358-367
Author(s):  
Júlio Cezar Rebés Azambuja Filho ◽  
Paulo Cesar de Faccio Carvalho ◽  
Olivier Jean François Bonnet ◽  
Denis Bastianelli ◽  
Magali Jouven

Sign in / Sign up

Export Citation Format

Share Document