scholarly journals Machine learning models to identify patient and microbial genetic factors associated with carbapenem-resistant Klebsiella pneumoniae infection

Author(s):  
Zena Lapp ◽  
Jennifer Han ◽  
Jenna Wiens ◽  
Ellie JC Goldstein ◽  
Ebbing Lautenbach ◽  
...  

AbstractBackgroundAmong patients colonized with carbapenem-resistant Klebsiella pneumoniae (CRKP), only a subset develop clinical infection. While patient characteristics may influence risk for infection, it remains unclear if the genetic background of CRKP strains contributes to this risk. We applied machine learning to quantify the capacity of patient characteristics and microbial genotypes to discriminate infection and colonization, and identified patient and microbial features associated with infection across multiple healthcare facilities.MethodsMachine learning models were built using whole-genome sequences and clinical metadata from 331 patients colonized or infected with CRKP across 21 long-term acute care hospitals. To quantify variation in performance, we built models using 100 different train/test splits of the entire dataset, and urinary and respiratory site-specific subsets, and evaluated predictive performance on each test split using the area under the receiver operating characteristics curve (AUROC). Patient and microbial features predictive of infection were identified as those consistently important for predicting infection based on average change in AUROC when included in the model.FindingsWe found that patient and genomic features were only weakly predictive of clinical CRKP infection vs. colonization (AUROC IQRs: patient=0·59-0·68, genomic=0·55-0·61, combined=0·62-0·68), and that one feature set did not consistently outperform the other (genomic vs. patient p=0·4). Comparable model performances were observed for anatomic site-specific models (combined AUROC IQRs: respiratory=0·61-0·71, urinary=0·54-0·64). Strong genomic predictors of infection included the presence of the ICEKp10 mobile genetic element carrying an iron acquisition system (yersiniabactin) and a toxin (colibactin), along with disruption of an O-antigen biosynthetic gene in a sub-lineage of the epidemic ST258 clone. Teasing apart sequential evolutionary steps in the context of clinical metadata indicated that altered O-antigen biosynthesis increased association with the respiratory tract, and subsequent acquisition of ICEKp10 was associated with increased virulence.InterpretationOur results support the need for rigorous machine learning frameworks to gain realistic estimates of the performance of clinical models of infection. Moreover, integrating microbial genomic and clinical data using such a framework can help tease apart the contribution of microbial genetic variation to clinical outcomes.FundingCenters for Disease Control and Prevention, National Institutes of Health, National Science FoundationResearch in contextEvidence before this studyWe searched PubMed for “crkp” OR “carbapenem resistant klebsiella pneumoniae” AND “infection” AND “machine learning” for papers published up to April 14, 2020 and found no results. Substituting “machine learning” with “bacterial genome-wide association studies” produced one relevant paper investigating pathogenicity-associated loci in K. pneumoniae clinical isolates. When we searched for “infection” AND “machine learning” AND “genom*” AND “clinical”, there was one relevant result - a study that used clinical and bacterial genomic features in a machine learning model to identify clonal differences related to Staphylococcus aureus infection outcome.Added value of this studyTo our knowledge, this is the first study to integrate clinical and genomic data to study anatomic site-specific colonization and infection across multiple healthcare facilities. Using this method, we identified clinical features associated with CRKP infection, as well as a sub-lineage of CRKP with potentially altered niche-specific adaptation and virulence. This method could be used for other organisms and other clinical outcomes to evaluate performance of predictive models and identify features that are consistently associated with clinical outcomes of interest across facilities or geographic regions.Implications of all the available evidenceFew studies have combined patient and microbial genomic data to study important clinical outcomes. However, those that have done this, including ours, have identified clinical and/or genomic features associated with the outcome of interest that provide a foundation for future epidemiological, clinical, and biological studies to better understand bacterial infections and clinical outcomes.

mSystems ◽  
2021 ◽  
Vol 6 (2) ◽  
Author(s):  
Zena Lapp ◽  
Jennifer H. Han ◽  
Jenna Wiens ◽  
Ellie J. C. Goldstein ◽  
Ebbing Lautenbach ◽  
...  

ABSTRACT Carbapenem-resistant Klebsiella pneumoniae (CRKP) is a critical-priority antibiotic resistance threat that has emerged over the past several decades, spread across the globe, and accumulated resistance to last-line antibiotic agents. While CRKP infections are associated with high mortality, only a subset of patients acquiring CRKP extraintestinal colonization will develop clinical infection. Here, we sought to ascertain the relative importance of patient characteristics and CRKP genetic background in determining patient risk of infection. Machine learning models classifying colonization versus infection were built using whole-genome sequences and clinical metadata from a comprehensive set of 331 CRKP extraintestinal isolates collected across 21 long-term acute-care hospitals over the course of a year. Model performance was evaluated based on area under the receiver operating characteristic curve (AUROC) on held-out test data. We found that patient and genomic features were predictive of clinical CRKP infection to similar extents (AUROC interquartile ranges [IQRs]: patient = 0.59 to 0.68, genomic = 0.55 to 0.61, combined = 0.62 to 0.68). Patient predictors of infection included the presence of indwelling devices, kidney disease, and length of stay. Genomic predictors of infection included presence of the ICEKp10 mobile genetic element carrying the yersiniabactin iron acquisition system and disruption of an O-antigen biosynthetic gene in a sublineage of the epidemic ST258 clone. Altered O-antigen biosynthesis increased association with the respiratory tract, and subsequent ICEKp10 acquisition was associated with increased virulence. These results highlight the potential of integrated models including both patient and microbial features to provide a more holistic understanding of patient clinical trajectories and ongoing within-lineage pathogen adaptation. IMPORTANCE Multidrug-resistant organisms, such as carbapenem-resistant Klebsiella pneumoniae (CRKP), colonize alarmingly large fractions of patients in regions of endemicity, but only a subset of patients develop life-threatening infections. While patient characteristics influence risk for infection, the relative contribution of microbial genetic background to patient risk remains unclear. We used machine learning to determine whether patient and/or microbial characteristics can discriminate between CRKP extraintestinal colonization and infection across multiple health care facilities and found that both patient and microbial factors were predictive. Examination of informative microbial genetic features revealed variation within the ST258 epidemic lineage that was associated with respiratory tract colonization and increased rates of infection. These findings indicate that circulating genetic variation within a highly prevalent epidemic lineage of CRKP influences patient clinical trajectories. In addition, this work supports the need for future studies examining the microbial genetic determinants of clinical outcomes in human populations, as well as epidemiologic and experimental follow-ups of identified features to discern generalizability and biological mechanisms.


2019 ◽  
Author(s):  
Qiqiang Liang ◽  
Fang Qian ◽  
Yibing Chen ◽  
Zhijun Xu ◽  
Zhijiang Xu ◽  
...  

Abstract Purpose To establish mortality prediction models in 14 days of Carbapenem-Resistant Klebsiella Pneumoniae bacteremia using Machine learning.Materials and Methods It is a single-center retrospective study. We collect the relevant clinical information of all patients with Carbapenem-Resistant Klebsiella Pneumoniae (CRKP) bacteremia in the past 5 years using the local database. Data analysis and verification are carried out by multiple logical regression, decision tree, random forest, support vector machine (SVM), and XGBoost.Result This study includes 187 patients with 40 related variables. In multiple logical regression, acute renal injury (P=0.003), Apache II score (P=0.036), immunodeficiency (P=0.025), severe thrombocytopenia (P=0.025) and septic shock (P=0.044) are the high-risk factors for 14 days mortality of CRKP bloodstream infections. According to the importance of those parameters, risk scoring is established to predict the survival rate of CRKP bacteremia. The analysis of the five models, with 70% training set and 30% test set, show the comprehensive performance of random forest (AUROC=0.953, precision=91.85%) is slightly better than that of XGBoost (AUROC=0.912, precision=86.41%) and SVM (AUROC=0.936, precision=79.89%) in predicting 14-day mortality of CRKP bacteremia. The multiple logical regression model (AUROC=0.825, precision=81.52%) is the second, and the decision tree model (AUROC=0.712, precision=79.89%) is not very ideal.Conclusion Machine learning has good performances in predicting 14-day mortality of CRKP bacteremia than multiple logical regression. Acute renal injury, severe thrombocytopenia, and septic shock are the high-risk factors of CRKP bacteremia mortality.


2021 ◽  
Vol 118 (48) ◽  
pp. e2110227118
Author(s):  
Melissa J. Martin ◽  
Brendan W. Corey ◽  
Filomena Sannio ◽  
Lindsey R. Hall ◽  
Ulrike MacDonald ◽  
...  

A protracted outbreak of New Delhi metallo-β-lactamase (NDM)–producing carbapenem-resistant Klebsiella pneumoniae started in Tuscany, Italy, in November 2018 and continued in 2020 and through 2021. To understand the regional emergence and transmission dynamics over time, we collected and sequenced the genomes of 117 extensively drug-resistant, NDM-producing K. pneumoniae isolates cultured over a 20-mo period from 76 patients at several healthcare facilities in southeast Tuscany. All isolates belonged to high-risk clone ST-147 and were typically nonsusceptible to all first-line antibiotics. Albeit sporadic, resistances to colistin, tigecycline, and fosfomycin were also observed as a result of repeated, independent mutations. Genomic analysis revealed that ST-147 isolates circulating in Tuscany were monophyletic and highly genetically related (including a network of 42 patients from the same hospital and sharing nearly identical isolates), and shared a recent ancestor with clinical isolates from the Middle East. While the blaNDM-1 gene was carried by an IncFIB-type plasmid, our investigations revealed that the ST-147 lineage from Italy also acquired a hybrid IncFIB/IncHIB–type plasmid carrying the 16S methyltransferase armA gene as well as key virulence biomarkers often found in hypervirulent isolates. This plasmid shared extensive homologies with mosaic plasmids circulating globally including from ST-11 and ST-307 convergent lineages. Phenotypically, the carriage of this hybrid plasmid resulted in increased siderophore production but did not confer virulence to the level of an archetypical, hypervirulent K. pneumoniae in a subcutaneous model of infection with immunocompetent CD1 mice. Our findings highlight the importance of performing genomic surveillance to identify emerging threats.


Author(s):  
Maria Burgos-Garay ◽  
Christine Ganim ◽  
Tom J.B. de Man ◽  
Terri Davy ◽  
Amy J. Mathers ◽  
...  

Abstract Background: Sink drains in healthcare facilities may provide an environment for antimicrobial-resistant microorganisms, including carbapenemase-producing Klebsiella pneumoniae (CPKP). Methods: We investigated the colonization of a biofilm consortia by CPKP in a model system simulating a sink-drain P-trap. Centers for Disease Control (CDC) biofilm reactors (CBRs) were inoculated with microbial consortia originally recovered from 2 P-traps collected from separate patient rooms (designated rooms A and B) in a hospital. Biofilms were grown on stainless steel (SS) or polyvinyl chloride (PVC) coupons in autoclaved municipal drinking water (ATW) for 7 or 28 days. Results: Microbial communities in model systems (designated CBR-A or CBR-B) were less diverse than communities in respective P-traps A and B, and they were primarily composed of β and γ Proteobacteria, as determined using 16S rRNA community analysis. Following biofilm development CBRs were inoculated with either K. pneumoniae ST45 (ie, strain CAV1016) or K. pneumoniae ST258 KPC+ (ie, strain 258), and samples were collected over 21 days. Under most conditions tested (CBR-A: SS, 7-day biofilm; CBR-A: PVC, 28-day biofilm; CBR-B: SS, 7-day and 28-day biofilm; CBR-B: PVC, 28-day biofilm) significantly higher numbers of CAV1016 were observed compared to 258. CAV1016 showed no significant difference in quantity or persistence based on biofilm age (7 days vs 28 days) or substratum type (SS vs PVC). However, counts of 258 were significantly higher on 28-day biofilms and on SS. Conclusions: These results suggest that CPKP persistence in P-trap biofilms may be strain specific or may be related to the type of P-trap material or age of the biofilm.


PLoS ONE ◽  
2021 ◽  
Vol 16 (2) ◽  
pp. e0247058
Author(s):  
Catarina Ferreira ◽  
Santosh K. Bikkarolla ◽  
Karolin Frykholm ◽  
Saga Pohjanen ◽  
Margarida Brito ◽  
...  

Carbapenem-resistant Klebsiella pneumoniae are a major global threat in healthcare facilities. The propagation of carbapenem resistance determinants can occur through vertical transmission, with genetic elements being transmitted by the host bacterium, or by horizontal transmission, with the same genetic elements being transferred among distinct bacterial hosts. This work aimed to track carbapenem resistance transmission by K. pneumoniae in a healthcare facility. The study involved a polyphasic approach based on conjugation assays, resistance phenotype and genotype analyses, whole genome sequencing, and plasmid characterization by pulsed field gel electrophoresis and optical DNA mapping. Out of 40 K. pneumoniae clinical isolates recovered over two years, five were carbapenem- and multidrug-resistant and belonged to multilocus sequence type ST147. These isolates harboured the carbapenemase encoding blaKPC-3 gene, integrated in conjugative plasmids of 140 kbp or 55 kbp, belonging to replicon types incFIA/incFIIK or incN/incFIIK, respectively. The two distinct plasmids encoding the blaKPC-3 gene were associated with distinct genetic lineages, as confirmed by optical DNA mapping and whole genome sequence analyses. These results suggested vertical (bacterial strain-based) transmission of the carbapenem-resistance genetic elements. Determination of the mode of transmission of antibiotic resistance in healthcare facilities, only possible based on polyphasic approaches as described here, is essential to control resistance propagation.


2015 ◽  
Vol 59 (9) ◽  
pp. 5226-5231 ◽  
Author(s):  
Luigi Garbari ◽  
Marina Busetti ◽  
Lucilla Dolzani ◽  
Vincenzo Petix ◽  
Anna Knezevich ◽  
...  

ABSTRACTHere, we report the first detection of aKlebsiella pneumoniaecarbapenemase 2 (KPC-2)-producingKlebsiella pneumoniaestrain belonging to sequence type 833 (ST833), collected in an Italian hospital from a patient coming from South America. ItsblaKPCdeterminant was carried by a ColE1 plasmid, pKBuS13, that showed the Tn4401b::blaKPC-2transposon inserted into the regulatory region of an Xer site-specific recombination locus. This interfered with the correct resolution of plasmid multimers into monomers, lowering plasmid stability and leading to overestimation of the number of plasmids harbored by a single host cell. Sequencing of the fragments adjacent to Tn4401bdetected a region that did not have significant matches in databases other than the genome of a carbapenem-resistantEscherichia colistrain collected during the same year at a hospital in Boston. This is interesting in an epidemiologic context, as it suggests that despite the absence oftragenes and the instability under nonselective conditions, the circulation of pKBuS13 or of analogous plasmids might be wider than reported.


2016 ◽  
Vol 3 (1) ◽  
Author(s):  
Sorabh Dhar ◽  
Emily T. Martin ◽  
Paul R. Lephart ◽  
John P. McRoberts ◽  
Teena Chopra ◽  
...  

Abstract A “high risk” clone of carbapenem-resistant Klebsiella pneumoniae (CRKP) identified by multilocus sequence typing (MLST) as sequence type (ST) 258 has disseminated worldwide. As the molecular epidemiology of the CRE pandemic continues to evolve, the clinical impact of non-ST258 strains is less well defined. We conducted an epidemiological investigation of CRKP based on strains MLST. Among 68 CRKP patients, 61 were ST258 and 7 belonged to non-ST258. Klebsiella pneumoniae ST258 strains were significantly associated with blaKPC production and with resistance to an increased number of antimicrobials. Clinical outcomes were not different. Based on this analysis, one cannot rely solely on the presence of blaKPC in order to diagnose CRKP.


Sign in / Sign up

Export Citation Format

Share Document