scholarly journals Overcoming the pitfalls of automatic interpretation of whole genome sequencing data by online tools for the prediction of pyrazinamide resistance in Mycobacterium tuberculosis

PLoS ONE ◽  
2019 ◽  
Vol 14 (2) ◽  
pp. e0212798 ◽  
Author(s):  
Tomotada Iwamoto ◽  
Yoshiro Murase ◽  
Shiomi Yoshida ◽  
Akio Aono ◽  
Makoto Kuroda ◽  
...  
2021 ◽  
Author(s):  
Einar Gabbasov ◽  
Miguel Moreno-Molina ◽  
Iñaki Comas ◽  
Maxwell Libbrecht ◽  
Leonid Chindelevitch

AbstractThe occurrence of multiple strains of a bacterial pathogen such as M. tuberculosis or C. difficile within a single human host, referred to as a mixed infection, has important implications for both healthcare and public health. However, methods for detecting it, and especially determining the proportion and identities of the underlying strains, from WGS (whole-genome sequencing) data, have been limited.In this paper we introduce SplitStrains, a novel method for addressing these challenges. Grounded in a rigorous statistical model, SplitStrains not only demonstrates superior performance in proportion estimation to other existing methods on both simulated as well as real M. tuberculosis data, but also successfully determines the identity of the underlying strains.We conclude that SplitStrains is a powerful addition to the existing toolkit of analytical methods for data coming from bacterial pathogens, and holds the promise of enabling previously inaccessible conclusions to be drawn in the realm of public health microbiology.Author summaryWhen multiple strains of a pathogenic organism are present in a patient, it may be necessary to not only detect this, but also to identify the individual strains. However, this problem has not yet been solved for bacterial pathogens processed via whole-genome sequencing. In this paper, we propose the SplitStrains algorithm for detecting multiple strains in a sample, identifying their proportions, and inferring their sequences, in the case of Mycobacterium tuberculosis. We test it on both simulated and real data, with encouraging results. We believe that our work opens new horizons in public health microbiology by allowing a more precise detection, identification and quantification of multiple infecting strains within a sample.


Data in Brief ◽  
2019 ◽  
Vol 26 ◽  
pp. 104445 ◽  
Author(s):  
Jaeyres Jani ◽  
Zainal Arifin Mustapha ◽  
Norfazirah Binti Jamal ◽  
Cheronie Shely Stanis ◽  
Chin Kai Ling ◽  
...  

2020 ◽  
Vol 64 (5) ◽  
Author(s):  
Theresa Enkirch ◽  
Jim Werngren ◽  
Ramona Groenheit ◽  
Erik Alm ◽  
Reza Advani ◽  
...  

ABSTRACT In this retrospective study, whole-genome sequencing (WGS) data generated on an Ion Torrent platform was used to predict phenotypic drug resistance profiles for first- and second-line drugs among Swedish clinical Mycobacterium tuberculosis isolates from 2016 to 2018. The accuracy was ∼99% for all first-line drugs and 100% for four second-line drugs. Our analysis supports the introduction of WGS into routine diagnostics, which might, at least in Sweden, replace phenotypic drug susceptibility testing in the future.


2019 ◽  
Vol 57 (6) ◽  
Author(s):  
R. C. Jones ◽  
L. G. Harris ◽  
S. Morgan ◽  
M. C. Ruddy ◽  
M. Perry ◽  
...  

ABSTRACT An inability to standardize the bioinformatic data produced by whole-genome sequencing (WGS) has been a barrier to its widespread use in tuberculosis phylogenetics. The aim of this study was to carry out a phylogenetic analysis of tuberculosis in Wales, United Kingdom, using Ridom SeqSphere software for core genome multilocus sequence typing (cgMLST) analysis of whole-genome sequencing data. The phylogenetics of tuberculosis in Wales have not previously been studied. Sixty-six Mycobacterium tuberculosis isolates (including 42 outbreak-associated isolates) from south Wales were sequenced using an Illumina platform. Isolates were assigned to principal genetic groups, single nucleotide polymorphism (SNP) cluster groups, lineages, and sublineages using SNP-calling protocols. WGS data were submitted to the Ridom SeqSphere software for cgMLST analysis and analyzed alongside 179 previously lineage-defined isolates. The data set was dominated by the Euro-American lineage, with the sublineage composition being dominated by T, X, and Haarlem family strains. The cgMLST analysis successfully assigned 58 isolates to major lineages, and the results were consistent with those obtained by traditional SNP mapping methods. In addition, the cgMLST scheme was used to resolve an outbreak of tuberculosis occurring in the region. This study supports the use of a cgMLST method for standardized phylogenetic assignment of tuberculosis isolates and for outbreak resolution and provides the first insight into Welsh tuberculosis phylogenetics, identifying the presence of the Haarlem sublineage commonly associated with virulent traits.


BMC Genomics ◽  
2020 ◽  
Vol 21 (1) ◽  
Author(s):  
Christine Anyansi ◽  
Arlin Keo ◽  
Bruce J. Walker ◽  
Timothy J. Straub ◽  
Abigail L. Manson ◽  
...  

2019 ◽  
Author(s):  
Christine Anyansi ◽  
Arlin Keo ◽  
Bruce Walker ◽  
Timothy J. Straub ◽  
Abigail L. Manson ◽  
...  

AbstractBackgroundMixed infections of Mycobacterium tuberculosis, and antibiotic heteroresistance, continue to complicate tuberculosis (TB) diagnosis and treatment. Detection of mixed infections has been limited to molecular genotyping techniques, which lack the sensitivity and resolution to accurately estimate the multiplicity of TB infections. In contrast, whole genome sequencing offers sensitive views of the genetic differences between strains of M. tuberculosis within a sample. Although metagenomic tools exist to classify strains in a metagenomic sample, most tools have been developed for more divergent species, and therefore cannot provide the sensitivity required to disentangle strains within closely related bacterial species such as M. tuberculosis.Here we present QuantTB, a method to identify and quantify individual M. tuberculosis strains in whole genome sequencing data. QuantTB uses SNP markers to determine the combination of strains that best explain the allelic variation observed in a sample. QuantTB outputs a list of identified strains, their corresponding relative abundances, as well as a list of drugs for which resistance-conferring mutations (or heteroresistance) has been predicted within the sample.ResultsWe show that QuantTB has a high degree of resolution, and is capable of differentiating communities differing by less than 25 SNPs and identifying strains down to 1× coverage. Using simulated data, we found QuantTB outperformed other metagenomic strain identification tools at detecting strains and quantifying strain multiplicity. In a real-world scenario, using a dataset of paired clinical isolates from a study of patients with either reinfections or relapses, we found that QuantTB could detect mixed infections and reinfections at rates concordant with a manually curated approach.ConclusionQuantTB can determine infection multiplicity, identify hetero-resistance patterns, enable differentiation between relapse and re-infection, and clarify transmission events across seemingly unrelated patients – even in low-coverage (1x) samples. QuantTB outperforms existing tools and promises to serve as a valuable resource for both clinicians and researchers working with clinical TB samples.


Sign in / Sign up

Export Citation Format

Share Document