PathoFact: A pipeline for the prediction of virulence factors and antimicrobial resistance genes in metagenomic data
AbstractBackgroundPathogenic microorganisms cause disease by invading, colonizing and damaging their host. Virulence factors including bacterial toxins contribute to their pathogenicity. Additionally, antimicrobial resistance genes allow pathogens to evade otherwise curative treatments. To understand causal relationships between microbiome compositions, functioning, and disease, it is therefore essential to identify virulence factors and antimicrobial resistance genes in metagenomic datasets. At present, there is a clear lack of computational approaches to simultaneously identifying these factors. Here we present PathoFact, a tool for the contextualized prediction of virulence factors and antimicrobial resistance genes in metagenomic data.ResultsPathoFact predicts virulence factors, bacterial toxins and antimicrobial resistance genes with high accuracy (0.92, 0.83 and 0.99) and specificity (0.96, 0.99 and 0.98), respectively. The performance of PathoFact was furthermore demonstrated on three publicly available case-control metagenomic datasets representing an actual infection as well as chronic diseases in which either pathogenic potential or bacterial toxins were predicted to play a role. With PathoFact, we identified virulence factors (including toxins) and antimicrobial resistance genes, and identified signature genes which differentiated between the disease and control groups.ConclusionPathoFact is an easy-to-use, modular, and reproducible pipeline for the identification of virulence factors, toxins and antimicrobial resistance genes in metagenomic data. Additionally, PathoFact combines the prediction of these pathogenicity factors with the identification of mobile genetic elements. This provides further depth to the analysis by considering the genomic context of the pertinent genes. Furthermore, each module (virulence factors, toxin and antimicrobial resistance genes) of PathoFact is also a standalone component, making it a flexible and versatile tool. PathoFact is freely available online at https://git-r3lab.uni.lu/laura.denies/PathoFact.