Integrating Heterogeneous Data for a Multi-disease Outbreak Detection Framework

SUMMARYAntimicrobial resistance is a priority emerging public health threat, and the ability to detect promptly outbreaks caused by resistant pathogens is critical for resistance containment and disease control efforts. We describe and evaluate the use of an electronic laboratory data system (WHONET) and a space–time permutation scan statistic for semi-automated disease outbreak detection. In collaboration with WHONET-Argentina, the national network for surveillance of antimicrobial resistance, we applied the system to the detection of local and regional outbreaks of Shigella spp. We searched for clusters on the basis of genus, species, and resistance phenotype and identified 19 statistical ‘events’ in a 12-month period. Of the six known outbreaks reported to the Ministry of Health, four had good or suggestive agreement with SaTScan-detected events. The most discriminating analyses were those involving resistance phenotypes. Electronic laboratory-based disease surveillance incorporating statistical cluster detection methods can enhance infectious disease outbreak detection and response.

Download Full-text

A new PSO-optimized geometry of spatial and spatio-temporal scan statistics for disease outbreak detection

Swarm and Evolutionary Computation ◽

10.1016/j.swevo.2012.02.001 ◽

2012 ◽

Vol 4 ◽

pp. 1-11 ◽

Cited By ~ 18

Author(s):

Hesam Izakian ◽

Witold Pedrycz

Keyword(s):

Disease Outbreak ◽

Scan Statistics ◽

Outbreak Detection ◽

Disease Outbreak Detection ◽

Spatio Temporal

Download Full-text

A real-time temporal Bayesian architecture for event surveillance and its application to patient-specific multiple disease outbreak detection

Data Mining and Knowledge Discovery ◽

10.1007/s10618-009-0151-4 ◽

2009 ◽

Vol 20 (3) ◽

pp. 328-360 ◽

Cited By ~ 12

Author(s):

Xia Jiang ◽

Gregory F. Cooper

Keyword(s):

Real Time ◽

Disease Outbreak ◽

Outbreak Detection ◽

Patient Specific ◽

Disease Outbreak Detection

Download Full-text

Using Ambulatory Syndromic Surveillance Data for Chronic Disease: A BMI Case Study

Online Journal of Public Health Informatics ◽

10.5210/ojphi.v7i1.5723 ◽

2015 ◽

Vol 7 (1) ◽

Author(s):

Andrew Walsh

Keyword(s):

Syndromic Surveillance ◽

Disease Outbreak ◽

Surveillance Data ◽

Outbreak Detection ◽

Infectious Disease Outbreak ◽

Disease Outbreak Detection ◽

Health Domains ◽

Ambulatory Practice ◽

Practice Care

Ambulatory practice syndromic surveillance data needs to demonstrate utility beyond infectious disease outbreak detection to warrant integration into existing systems. The nature of ambulatory practice care makes it well suited for monitoring health domains not covered by emergency departments. This project demonstrates collection of height and weight measurements from ambulatory practice syndromic surveillance data. These data are used to calculate patient BMI, an important risk factor for many chronic diseases. This work is presented as a proof-of-principle for applying syndromic surveillance data to additional health domains.

Download Full-text

Supervised Learning for Automated Infectious-Disease-Outbreak Detection

Online Journal of Public Health Informatics ◽

10.5210/ojphi.v11i1.9770 ◽

2019 ◽

Vol 11 (1) ◽

Cited By ~ 1

Author(s):

Stephane Ghozzi ◽

Benedikt Zacher ◽

Alexander Ullrich

Keyword(s):

Infectious Disease ◽

Machine Learning ◽

Time Series ◽

Supervised Learning ◽

Expert Knowledge ◽

Disease Outbreak ◽

Outbreak Detection ◽

Infectious Disease Outbreak ◽

Count Time Series ◽

Disease Outbreak Detection

ObjectiveBy systematically scoring algorithms and integrating outbreak data through statistical learning, evaluate and improve the performance of automated infectious-disease-outbreak detection. The improvements should be directly relevant to the epidemiological practice. A broader objective is to explore the usefulness of machine-learning approaches in epidemiology.IntroductionWithin the traditional surveillance of notifiable infectious diseases in Germany, not only are individual cases reported to the Robert Koch Institute, but also outbreaks themselves are recorded: A label is assigned by epidemiologists to each case, indicating whether it is part of an outbreak and of which. This expert knowledge represents, in the language of machine leaning, a "ground truth" for the algorithmic task of detecting outbreaks from a stream of surveillance data. The integration of this kind of information in the design and evaluation of algorithms is called supervised learning.MethodsReported cases were aggregated weekly and divided into two count time series, one for endemic (not part of an outbreak) and one for epidemic cases. Two new algorithms were developed for the analysis of such time series: farringtonOutbreak is an adaptation of the standard method farringtonFlexible as implemented in the surveillance R package: It trains on endemic case counts but detects anomalies on total case counts. The second algorithm is hmmOutbreak, which is based on a hidden Markov model (HMM): A binary hidden state indicates whether an outbreak was reported in a given week, the transition matrix for this state is learned from the outbreak data and this state is integrated as factor in a generalised linear model of the total case count. An explicit probability of being in a state of outbreak is then computed for each week (one-week ahead) and a signal is generated if it is higher than a user-defined threshold.To evaluate performance, we framed outbreak detection as a simple binary classification problem: Is there an outbreak in a given week, yes or no? Was a signal generated for this week, yes or no? One can thus count, for each time series, the true positives (outbreak data and signals agree), false positives, true negatives and false negatives. From those, classical performance scores can be computed, such as sensitivity, specificity, precision, F-score or area under the ROC curve (AUC).For the evaluation with real-word data we used time series of reported cases of salmonellosis and campylobacteriosis for each of the 412 German counties over 9 years. We also ran simple simulations with different parameter sets, generating count time series and outbreaks with the sim.pointSource function of the surveillance R package.ResultsWe have developed a supervised-learning framework for outbreak detection based on reported infections and outbreaks, proposing two algorithms and an evaluation method. hmmOutbreak performs overall much better than the standard farringtonFlexible, with e.g. a 60% improvement in sensitivity (0.5 compared to 0.3) at a fixed specificity of 0.9. The results were confirmed by simulations. Furthermore, the computation of explicit outbreak probabilities allows a better and clearer interpretation of detection results than the usual testing of the null hypothesis "is endemic".ConclusionsMethods of machine learning can be usefully applied in the context of infectious-disease surveillance. Already a simple HMM shows large improvements and better interpretability: More refined methods, in particular semi-supervised approaches, look thus very promising. The systematic integration of available expert knowledge, in this case the recording of outbreaks, allows an evaluation of algorithmic performance that is of direct relevance for the epidemiological practice, in contrast to the usual intrinsic statistical metrics. Beyond that, this knowledge can be readily used to improve that performance and, in the future, gain insights in outbreak dynamics. Moreover, other types of labels will be similarly integrated in automated surveillance analyses, e.g. user feedback on whether a signal was relevant (reinforcement learning) or messages on specialised internet platforms that were found to be useful warnings of international epidemic events.

Download Full-text

First steps towards a sentinel network for Huanglongbing disease outbreak detection in the Mediterranean basin

Fruits ◽

10.1051/fruits:2011036 ◽

2011 ◽

Vol 66 (3) ◽

pp. A11-A12

Keyword(s):

Mediterranean Basin ◽

Disease Outbreak ◽

Outbreak Detection ◽

Sentinel Network ◽

Disease Outbreak Detection ◽

The Mediterranean ◽

The Mediterranean Basin

Download Full-text