scholarly journals Comparing Methods for Record Linkage for Public Health Action: Matching Algorithm Validation Study (Preprint)

2019 ◽  
Author(s):  
Tigran Avoundjian ◽  
Julia C Dombrowski ◽  
Matthew R Golden ◽  
James P Hughes ◽  
Brandon L Guthrie ◽  
...  

BACKGROUND Many public health departments use record linkage between surveillance data and external data sources to inform public health interventions. However, little guidance is available to inform these activities, and many health departments rely on deterministic algorithms that may miss many true matches. In the context of public health action, these missed matches lead to missed opportunities to deliver interventions and may exacerbate existing health inequities. OBJECTIVE This study aimed to compare the performance of record linkage algorithms commonly used in public health practice. METHODS We compared five deterministic (exact, Stenger, Ocampo 1, Ocampo 2, and Bosh) and two probabilistic record linkage algorithms (fastLink and beta record linkage [BRL]) using simulations and a real-world scenario. We simulated pairs of datasets with varying numbers of errors per record and the number of matching records between the two datasets (ie, overlap). We matched the datasets using each algorithm and calculated their recall (ie, sensitivity, the proportion of true matches identified by the algorithm) and precision (ie, positive predictive value, the proportion of matches identified by the algorithm that were true matches). We estimated the average computation time by performing a match with each algorithm 20 times while varying the size of the datasets being matched. In a real-world scenario, HIV and sexually transmitted disease surveillance data from King County, Washington, were matched to identify people living with HIV who had a syphilis diagnosis in 2017. We calculated the recall and precision of each algorithm compared with a composite standard based on the agreement in matching decisions across all the algorithms and manual review. RESULTS In simulations, BRL and fastLink maintained a high recall at nearly all data quality levels, while being comparable with deterministic algorithms in terms of precision. Deterministic algorithms typically failed to identify matches in scenarios with low data quality. All the deterministic algorithms had a shorter average computation time than the probabilistic algorithms. BRL had the slowest overall computation time (14 min when both datasets contained 2000 records). In the real-world scenario, BRL had the lowest trade-off between recall (309/309, 100.0%) and precision (309/312, 99.0%). CONCLUSIONS Probabilistic record linkage algorithms maximize the number of true matches identified, reducing gaps in the coverage of interventions and maximizing the reach of public health action.

10.2196/15917 ◽  
2020 ◽  
Vol 6 (2) ◽  
pp. e15917
Author(s):  
Tigran Avoundjian ◽  
Julia C Dombrowski ◽  
Matthew R Golden ◽  
James P Hughes ◽  
Brandon L Guthrie ◽  
...  

Background Many public health departments use record linkage between surveillance data and external data sources to inform public health interventions. However, little guidance is available to inform these activities, and many health departments rely on deterministic algorithms that may miss many true matches. In the context of public health action, these missed matches lead to missed opportunities to deliver interventions and may exacerbate existing health inequities. Objective This study aimed to compare the performance of record linkage algorithms commonly used in public health practice. Methods We compared five deterministic (exact, Stenger, Ocampo 1, Ocampo 2, and Bosh) and two probabilistic record linkage algorithms (fastLink and beta record linkage [BRL]) using simulations and a real-world scenario. We simulated pairs of datasets with varying numbers of errors per record and the number of matching records between the two datasets (ie, overlap). We matched the datasets using each algorithm and calculated their recall (ie, sensitivity, the proportion of true matches identified by the algorithm) and precision (ie, positive predictive value, the proportion of matches identified by the algorithm that were true matches). We estimated the average computation time by performing a match with each algorithm 20 times while varying the size of the datasets being matched. In a real-world scenario, HIV and sexually transmitted disease surveillance data from King County, Washington, were matched to identify people living with HIV who had a syphilis diagnosis in 2017. We calculated the recall and precision of each algorithm compared with a composite standard based on the agreement in matching decisions across all the algorithms and manual review. Results In simulations, BRL and fastLink maintained a high recall at nearly all data quality levels, while being comparable with deterministic algorithms in terms of precision. Deterministic algorithms typically failed to identify matches in scenarios with low data quality. All the deterministic algorithms had a shorter average computation time than the probabilistic algorithms. BRL had the slowest overall computation time (14 min when both datasets contained 2000 records). In the real-world scenario, BRL had the lowest trade-off between recall (309/309, 100.0%) and precision (309/312, 99.0%). Conclusions Probabilistic record linkage algorithms maximize the number of true matches identified, reducing gaps in the coverage of interventions and maximizing the reach of public health action.


2020 ◽  
Author(s):  
Ignacio Garitano ◽  
Manuel Linares ◽  
Laura Santos ◽  
Ruth Gil ◽  
Elena Lapuente ◽  
...  

UNSTRUCTURED On 28th February a case of COVID-19 was declared in Araba-Álava province, Spain. In Spain, a confinement and movement restrictions were established by Spanish Government at 14th March 2020. We implemented a web-based tool to estimate number of cases during the pandemic. We present the results in Áraba-Álava province. We reached a response rate of 10,3% out a 331.549 population. We found that 22,4 % fulfilled the case definition. This tool rendered useful to inform public health action.


Author(s):  
David A Savitz

Abstract Interpreting the results of epidemiologic studies calls for objectivity and rigorous scrutiny, acknowledging the limitations that temper the applicability of the findings to public health action. Current trends have posed new challenges to balancing goal of scientific objectivity and validity with public health applications. The ongoing tension between epidemiology’s aspirations and capability has several sources: the need to overpromise in research proposals, compromising methodologic rigor because of public health importance, defending findings in the face of hostile critics, and appealing to core constituencies who have specific expectations from the research.


BMJ Open ◽  
2021 ◽  
Vol 11 (6) ◽  
pp. e048042
Author(s):  
Andrew Hayward ◽  
Ellen Fragaszy ◽  
Jana Kovar ◽  
Vincent Nguyen ◽  
Sarah Beale ◽  
...  

IntroductionThe coronavirus (COVID-19) pandemic has caused significant global mortality and impacted lives around the world. Virus Watch aims to provide evidence on which public health approaches are most likely to be effective in reducing transmission and impact of the virus, and will investigate community incidence, symptom profiles and transmission of COVID-19 in relation to population movement and behaviours.Methods and analysisVirus Watch is a household community cohort study of acute respiratory infections in England and Wales and will run from June 2020 to August 2021. The study aims to recruit 50 000 people, including 12 500 from minority ethnic backgrounds, for an online survey cohort and monthly antibody testing using home fingerprick test kits. Nested within this larger study will be a subcohort of 10 000 individuals, including 3000 people from minority ethnic backgrounds. This cohort of 10 000 people will have full blood serology taken between October 2020 and January 2021 and repeat serology between May 2021 and August 2021. Participants will also post self-administered nasal swabs for PCR assays of SARS-CoV-2 and will follow one of three different PCR testing schedules based on symptoms.Ethics and disseminationThis study has been approved by the Hampstead National Health Service (NHS) Health Research Authority Ethics Committee (ethics approval number 20/HRA/2320). We are monitoring participant queries and using these to refine methodology where necessary, and are providing summaries and policy briefings of our preliminary findings to inform public health action by working through our partnerships with our study advisory group, Public Health England, NHS and government scientific advisory panels.


2018 ◽  
Vol 46 (22_suppl) ◽  
pp. 48-57 ◽  
Author(s):  
Ditte H. Holt ◽  
Gemma Carey ◽  
Morten H. Rod

Aims: This paper examines the role of organizational structure within government(s) in attempts to implement intersectoral action for health in Danish municipalities. We discuss the implications of structural reorganization and the governance structures that are established in order to ensure coordination and integration between policy sectors. Methods: The paper is based on 49 interviews with civil servants from health and non-health sectors of 10 municipalities. Based on participants’ experiences, cases have been described and analyzed in an iterative process consulting the literature on Health in All Policies and joined-up government. Results: Continuous and frequent processes of reorganizing were widespread in the municipalities. However, they appeared to have little effect on policy change. The two most common governance structures established to transcend organizational boundaries were the central unit and the intersectoral committee. According to the experiences of participants, paradoxically both of these organizational solutions tend to reproduce the organizational problems they are intended to overcome. Even if structural reorganization may succeed in dissolving some sector boundaries, it will inevitably create new ones. Conclusions: It is time to dismiss the idea that intersectoral action for health can be achieved by means of a structural fix. Rather than rearranging organizational boundaries it may be more useful to seek to manage the silos which exist in any organization, e.g. by promoting awareness of their implications for public health action and by enhancing the boundary spanning skills of public health officers.


2010 ◽  
Vol 362 (7) ◽  
pp. 650-652 ◽  
Author(s):  
Lawrence J. Appel ◽  
Cheryl A.M. Anderson

Sign in / Sign up

Export Citation Format

Share Document