Streamlining Foodborne Disease Surveillance with Open-Source Data Management Software

Objective: The “ledsmanageR”, a data management platform built in R, aims to improve the timeliness and accuracy of national foodborne surveillance data submitted to the Laboratory-based Enteric Disease Surveillance (LEDS) system by automating the data processing, validating, and reporting workflow.Introduction: The National Surveillance Team in the Enteric Diseases Epidemiology Branch of the Centers for Disease Control and Prevention (CDC) collects electronic data from all state and regional public health laboratories on human infections caused by Campylobacter, Salmonella, Shiga toxin-producing E. coli, and Shigella in LEDS. These data inform annual estimates of the burden of illness, assessments of patterns in bacterial subtypes, and can be used to describe trends in incidence. Robust digital infrastructure is required to process, validate, and summarize data on approximately 60,000 infections annually while optimizing use of financial and personnel resources.Methods: We leveraged the robust and extensible programming facilities of the R programming language and the active community of R users to develop a data integration, processing, and reporting pipeline for LEDS via an internal software package we named “ledsmanageR”. We designed all data retrieval, cleaning, and provisioning algorithms using tools from RStudio software packages1–3 and tracked changes to source code and data using CDC’s internal Gitlab server. We automated data validation requests to reporting partners by generating customizable emails directly from the R console4. We streamlined the data reconciliation process using OpenRefine5, a point-and-click tool for cleaning big data. We automated generation of annual reports, a process that was previously manual, using parameterized RMarkdown documents. Staff epidemiologists performed design and implementation internally, requiring no external consulting.Results: Developing our free and open-source software platform for national foodborne surveillance data management has saved the Enteric Diseases Epidemiology Branch thousands of dollars because we no longer depend on proprietary software requiring annual licensing fees. This transition occurred without any disruption in surveillance operations. Partial automation of email-based data validation and annual report generation processes reduced employee time requirement from one full-time position to one part-time position. The modular nature of ledsmanageR permitted LEDS to collect an expanded set of data elements with no changes to the core data processing and reporting workflow.Conclusions: We developed and implemented a flexible tool that helps maintain the integrity of surveillance data and reduces the need for manual data cleaning, which can be laborious and error-prone. The user-friendly design features of ledsmanageR demonstrate that data management can be optimized using programming skills that are increasingly common among epidemiologists. Our work on improving the accuracy and efficiency of enteric disease surveillance has served as a proof of concept for plans to streamline data processing for other surveillance systems.

Download Full-text

Use of a Business Approach to Improve Disease Surveillance Data Management Systems and Information Technology Process in Florida's Bureau of STD Prevention and Control

Public Health Reports ◽

10.1177/00333549091240s214 ◽

2009 ◽

Vol 124 (2_suppl) ◽

pp. 98-102 ◽

Cited By ~ 7

Author(s):

Stacy A. Shiver ◽

Karla Schmitt ◽

Adrian Cooksey

Keyword(s):

Information Technology ◽

Data Management ◽

Disease Surveillance ◽

Prevention And Control ◽

Surveillance Data ◽

Management Systems ◽

Data Management Systems ◽

Technology Process ◽

Business Approach ◽

And Control

Download Full-text

Evaluating the Timeliness of Enteric Disease Surveillance in British Columbia, Canada, 2012-13

Canadian Journal of Infectious Diseases and Medical Microbiology ◽

10.1155/2017/9854103 ◽

2017 ◽

Vol 2017 ◽

pp. 1-7 ◽

Cited By ~ 1

Author(s):

Eleni Galanis ◽

Marsha Taylor ◽

Kamila Romanowski ◽

Olga Bitzikos ◽

Jennifer Jeyes ◽

...

Keyword(s):

British Columbia ◽

Case Report ◽

Disease Surveillance ◽

International Literature ◽

Enteric Disease ◽

E Coli ◽

Enteric Diseases ◽

Laboratory Results ◽

Laboratory Information Systems ◽

And Control

Timely surveillance of enteric diseases is necessary to identify and control cases and outbreaks. Our objective was to evaluate the timeliness of enteric disease surveillance in British Columbia, Canada, compare these results to other settings, and recommend improvements. In 2012 and 2013, information was collected from case report forms and laboratory information systems on 2615Salmonella, shigatoxin-producingE. coli,Shigella, andListeriainfections. Twelve date variables representing the surveillance process from onset of symptoms to case interview and final laboratory results were collected, and intervals were measured. The median time from onset of symptoms to reporting subtyping results to BC epidemiologists was 26–36 days and from onset of symptoms to case interview was 12–14 days. Our findings were comparable to the international literature except for a longer time (up to 29 day difference) to reporting of PFGE results to epidemiologists in BC. Such a delay may impact our ability to identify and solve outbreaks. Several process and system changes were implemented which should improve the timeliness of enteric disease surveillance.

Download Full-text

Polio Eradication Initiative contribution in strengthening immunization and integrated disease surveillance data management in WHO African region, 2014

Vaccine ◽

10.1016/j.vaccine.2016.05.057 ◽

2016 ◽

Vol 34 (43) ◽

pp. 5181-5186 ◽

Cited By ~ 3

Author(s):

Alain Poy ◽

Etienne Minkoulou ◽

Keith Shaba ◽

Ali Yahaya ◽

Peter Gaturuku ◽

...

Keyword(s):

Data Management ◽

Disease Surveillance ◽

Polio Eradication ◽

Surveillance Data ◽

African Region

Download Full-text

Structuring and visualization of indicators in multidimensional data cubes

Informacionno-technologicheskij vestnik ◽

10.21499/2409-1650-2018-4-79-87 ◽

2018 ◽

pp. 79-87

Author(s):

E. E. Akimkina

Keyword(s):

Decision Making ◽

Data Processing ◽

Data Management ◽

Multidimensional Data ◽

End User ◽

Processing Technologies ◽

Data Cubes ◽

Multidimensional Visualization ◽

Multidimensional Data Cubes ◽

Practical Recommendations

The problems of structuring of indicators in multidimensional data cubes with their subsequent processing with the help of end-user tools providing multidimensional visualization and data management are analyzed; the possibilities of multidimensional data processing technologies for managing and supporting decision making at a design and technological enterprise are shown; practical recommendations on the use of domestic computer environments for the structuring and visualization of multidimensional data cubes are given.

Download Full-text

Performance Analysis of Implementation Model Architecture Reference and Master Data Management using Open Source Platform

2020 3rd International Conference on Information and Communications Technology (ICOIACT) ◽

10.1109/icoiact50329.2020.9332046 ◽

2020 ◽

Author(s):

Immanuela Christiantari Perdana ◽

Tien Fabrianti Kusumasari ◽

Ekky Novriza Alam

Keyword(s):

Performance Analysis ◽

Data Management ◽

Open Source ◽

Implementation Model ◽

Master Data ◽

Master Data Management

Download Full-text

Case fatality risk estimated from routinely collected disease surveillance data: application to COVID–19

Biostatistics & Epidemiology ◽

10.1080/24709360.2021.1913708 ◽

2021 ◽

pp. 1-20

Author(s):

Ian C. Marschner

Keyword(s):

Disease Surveillance ◽

Case Fatality ◽

Surveillance Data ◽

Fatality Risk ◽

Data Application ◽

Case Fatality Risk

Download Full-text

Shock tube data processing tools using open source hardware and software platforms

Engineering Reports ◽

10.1002/eng2.12353 ◽

2020 ◽

Author(s):

K. Thirumalesh ◽

Salgeri Puttaswamy Raju ◽

Hiriyur Mallaiah Somashekarappa ◽

Kumaraswamy Swaroop

Keyword(s):

Data Processing ◽

Shock Tube ◽

Open Source ◽

Software Platforms ◽

Open Source Hardware

Download Full-text

Timeliness of Enteric Disease Surveillance in 6 US States

Emerging Infectious Diseases ◽

10.3201/eid1402.070666 ◽

2008 ◽

Vol 14 (2) ◽

pp. 311-313 ◽

Cited By ~ 34

Author(s):

Craig W. Hedberg ◽

Jesse F. Greenblatt ◽

Bela T. Matyas ◽

Jennifer Lemmings ◽

Donald J. Sharp ◽

...

Keyword(s):

Disease Surveillance ◽

Enteric Disease ◽

Us States

Download Full-text

A python code for automatic construction of Fischer plots using proxy data

Scientific Reports ◽

10.1038/s41598-021-90017-9 ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

Daming Yang ◽

Yongjian Huang ◽

Zongyang Chen ◽

Qinghua Huang ◽

Yanguang Ren ◽

...

Keyword(s):

Data Processing ◽

Open Source ◽

Lake Level ◽

Gamma Ray ◽

Data Series ◽

Proxy Data ◽

Cycle Number ◽

Automatic Construction ◽

Natural Gamma ◽

Graphic Representations

AbstractFischer plots are widely used in paleoenvironmental research as graphic representations of sea- and lake-level changes through mapping linearly corrected variation of accumulative cycle thickness over cycle number or stratum depth. Some kinds of paleoenvironmental proxy data (especially subsurface data, such as natural gamma-ray logging data), which preserve continuous cyclic signals and have been largely collected, are potential materials for constructing Fischer Plots. However, it is laborious to count the cycles preserved in these proxy data manually and map Fischer plots with these cycles. In this paper, we introduce an original open-source Python code “PyFISCHERPLOT” for constructing Fischer Plots in batches utilizing paleoenvironmental proxy data series. The principle of constructing Fischer plots based on proxy data, the data processing and usage of the PyFISCHERPLOT code and the application cases of the code are presented. The code is compared with existing methods for constructing Fischer plots.

Download Full-text

Current Status of Cyber Security in the Surveillance Data Processing Systems in Europe

2018 XIII International Scientific Conference - New Trends in Aviation Development (NTAD) ◽

10.1109/ntad.2018.8551678 ◽

2018 ◽

Cited By ~ 1

Author(s):

Marian Jancik ◽

Simon Holoda ◽

Milan DZUNDA ◽

Branislav Kandera

Keyword(s):

Data Processing ◽

Cyber Security ◽

Surveillance Data ◽

Current Status

Download Full-text