validation sampling
Recently Published Documents


TOTAL DOCUMENTS

19
(FIVE YEARS 1)

H-INDEX

7
(FIVE YEARS 0)

BMC Medicine ◽  
2021 ◽  
Vol 19 (1) ◽  
Author(s):  
Rena C. Patel ◽  
Gustavo Amorim ◽  
Beatrice Jakait ◽  
Bryan E. Shepherd ◽  
A. Rain Mocello ◽  
...  

Abstract Background Preventing unintended pregnancies is paramount for women living with HIV (WLHIV). Previous studies have suggested that efavirenz-containing antiretroviral therapy (ART) reduces contraceptive effectiveness of implants, but there are uncertainties regarding the quality of the electronic medical record (EMR) data used in these prior studies. Methods We conducted a retrospective, cohort study of EMR data from 2011 to 2015 among WLHIV of reproductive age accessing HIV care in public facilities in western Kenya. We validated a large subsample of records with manual chart review and telephone interviews. We estimated adjusted incidence rate ratios (aIRRs) with Poisson regression accounting for the validation sampling using inverse probability weighting and generalized raking. Results A total of 85,324 women contributed a total of 170,845 women-years (w-y) of observation time; a subset of 5080 women had their charts reviewed, and 1285 underwent interviews. Among implant users, the aIRR of pregnancy for efavirenz- vs. nevirapine-containing ART was 1.9 (95% CI 1.6, 2.4) using EMR data only and 3.2 (95% CI 1.8, 5.7) when additionally using both chart review and interview validated data. Among efavirenz users, the aIRR of pregnancy for depomedroxyprogesterone acetate (DMPA) vs. implant use was 1.8 (95% CI 1.5, 2.1) in EMR only and 2.4 (95% CI 1.0, 6.1) using validated data. Conclusion Pregnancy rates are higher when contraceptive implants are concomitantly used with efavirenz-containing ART, though rates were similar to leading alternative contraceptive methods such as DMPA. Our data provides policymakers, program staff, and WLHIV greater confidence in guiding their decision-making around contraceptive and ART options. Our novel, 3-phase validation sampling provides an innovative tool for using routine EMR data to improve the robustness of data quality.



2020 ◽  
Vol 83 (10) ◽  
pp. 1656-1666 ◽  
Author(s):  
GENEVIEVE SULLIVAN ◽  
MARTIN WIEDMANN

ABSTRACT Listeria monocytogenes (LM) contamination of produce can often be traced back to the environment of packinghouses and fresh-cut facilities. Because there is limited information on the detection, prevalence, and distribution of this pathogen in produce operations, environmental “routine sampling” plans for LM and other Listeria spp. were developed and implemented in three packinghouses and five fresh-cut facilities in the United States. For routine sampling, a total of 2,014 sponge samples were collected over six to eight separate samplings per operation, performed over 1 year; vector and preproduction samples (n = 156) were also collected as needed to follow up on positive findings. In addition, a single “validation sampling” visit by an outside expert was used to evaluate the routine sampling. Among the 2,014 routine sponge samples collected, 35 and 30 were positive for LM and Listeria species other than LM (LS), respectively. LM prevalence varied from 0.8 to 5.8% for packinghouses and <0.4 to 1.6% for fresh-cut facilities. Among the 394 validation sponge samples, 23 and 13 were positive for LM and LS, respectively. Validation sampling found statistically significantly higher LM prevalence compared with routine sampling for three of eight operations. For all samples collected, up to eight isolates per sample were characterized by sequencing of sigB, which allowed for classification into sigB allelic types. Among the 97 samples with more than one Listeria isolate characterized, 28 had more than one sigB allelic type present, including 18 sponges that were positive for LM and another Listeria species and 13 sponges that were positive for more than one LM subtype. This indicates that collection of multiple isolates is necessary to capture Listeria diversity present in produce operations. Additionally, 17 of 77 sponges that were positive for LM were positive at only one enrichment time (i.e., 24 or 48 h), indicating that LM testing after two different enrichment times provides enhanced sensitivity. HIGHLIGHTS



2020 ◽  
Vol 4 (2) ◽  
pp. 1006-1016 ◽  
Author(s):  
Stephanie K Muir ◽  
Nick P Linden ◽  
Andrew Kennedy ◽  
Grace Calder ◽  
Gavin Kearney ◽  
...  

Abstract The development of feeding systems that can individually measure and control feed intake in a group-housed environment would allow a greater understanding of sheep intake without compromising animal welfare and behavior through the removal of social interactions between sheep. This study validated an automated feeding system for measuring feed intake of individual sheep when housed in groups. Validation of the feeding system was conducted during three separate experiments. The validation sampling involved the activation of four individual “feed events,” whereby four separate samples weighing approximately 50, 100, 200, and 400 g were removed from each feeder, with each feed event being linked to a specific radio frequency identification (RFID) tag. The feeder validation experiments evaluated the ability of the feeding system to 1) create a unique feed event every time a sample of pellets was collected from the feeder, 2) link the feed event to the correct RFID, and 3) accurately record the weight of feed that was manually removed. All feed events were initiated and logged in the feeding system with 100% of the events being linked to the correct test RFID. Concordance correlation coefficients between the feeding system-recorded feed weight and the manually removed weight were 0.99 within all three experiments. There was also no overall and little level-dependent bias between the weights measured by the feeding system and weights measured on the external scales. These results indicate the stability of the feeding system over time and consistency between the feeders within and across the three experiments. In conclusion, the automated feeding system developed for measuring individual animal feed intake was able to detect and record the unique electronic RFID associated with unique feed events and accurately capture the weight of feed removed. Furthermore, there was no change in the accuracy of the system from the start to the end of experimental periods, and the amount of feed removed in the feed event (or meal size) did not impact the accuracy of the results.



2019 ◽  
Vol 28 (2) ◽  
pp. 227-233
Author(s):  
Christopher A. Gravel ◽  
Patrick J. Farrell ◽  
Daniel Krewski


2018 ◽  
Vol 37 (27) ◽  
pp. 3887-3903
Author(s):  
Christopher A. Gravel ◽  
Anup Dewanji ◽  
Patrick J. Farrell ◽  
Daniel Krewski


The 2017 SIS Conference aims to highlight the crucial role of the Statistics in Data Science. In this new domain of ‘meaning’ extracted from the data, the increasing amount of produced and available data in databases, nowadays, has brought new challenges. That involves different fields of statistics, machine learning, information and computer science, optimization, pattern recognition. These afford together a considerable contribute in the analysis of ‘Big data’, open data, relational and complex data, structured and no-structured. The interest is to collect the contributes which provide from the different domains of Statistics, in the high dimensional data quality validation, sampling extraction, dimensional reduction, pattern selection, data modelling, testing hypotheses and confirming conclusions drawn from the data.



2016 ◽  
Vol 79 (12) ◽  
pp. 2095-2106 ◽  
Author(s):  
SARAH M. BENO ◽  
MATTHEW J. STASIEWICZ ◽  
ALEXIS D. ANDRUS ◽  
ROBERT D. RALYEA ◽  
DAVID J. KENT ◽  
...  

ABSTRACT Pathogen environmental monitoring programs (EMPs) are essential for food processing facilities of all sizes that produce ready-to-eat food products exposed to the processing environment. We developed, implemented, and evaluated EMPs targeting Listeria spp. and Salmonella in nine small cheese processing facilities, including seven farmstead facilities. Individual EMPs with monthly sample collection protocols were designed specifically for each facility. Salmonella was detected in only one facility, with likely introduction from the adjacent farm indicated by pulsed-field gel electrophoresis data. Listeria spp. were isolated from all nine facilities during routine sampling. The overall Listeria spp. (other than Listeria monocytogenes) and L. monocytogenes prevalences in the 4,430 environmental samples collected were 6.03 and 1.35%, respectively. Molecular characterization and subtyping data suggested persistence of a given Listeria spp. strain in seven facilities and persistence of L. monocytogenes in four facilities. To assess routine sampling plans, validation sampling for Listeria spp. was performed in seven facilities after at least 6 months of routine sampling. This validation sampling was performed by independent individuals and included collection of 50 to 150 samples per facility, based on statistical sample size calculations. Two of the facilities had a significantly higher frequency of detection of Listeria spp. during the validation sampling than during routine sampling, whereas two other facilities had significantly lower frequencies of detection. This study provides a model for a science- and statistics-based approach to developing and validating pathogen EMPs.



2015 ◽  
Vol 23 (e1) ◽  
pp. e71-e78 ◽  
Author(s):  
Liwen Ouyang ◽  
Daniel W Apley ◽  
Sanjay Mehrotra

Abstract Background and Objective Electronic medical record (EMR) databases offer significant potential for developing clinical hypotheses and identifying disease risk associations by fitting statistical models that capture the relationship between a binary response variable and a set of predictor variables that represent clinical, phenotypical, and demographic data for the patient. However, EMR response data may be error prone for a variety of reasons. Performing a manual chart review to validate data accuracy is time consuming, which limits the number of chart reviews in a large database. The authors’ objective is to develop a new design-of-experiments–based systematic chart validation and review (DSCVR) approach that is more powerful than the random validation sampling used in existing approaches. Methods The DSCVR approach judiciously and efficiently selects the cases to validate (i.e., validate whether the response values are correct for those cases) for maximum information content, based only on their predictor variable values. The final predictive model will be fit using only the validation sample, ignoring the remainder of the unvalidated and unreliable error-prone data. A Fisher information based D-optimality criterion is used, and an algorithm for optimizing it is developed. Results The authors’ method is tested in a simulation comparison that is based on a sudden cardiac arrest case study with 23 041 patients’ records. This DSCVR approach, using the Fisher information based D-optimality criterion, results in a fitted model with much better predictive performance, as measured by the receiver operating characteristic curve and the accuracy in predicting whether a patient will experience the event, than a model fitted using a random validation sample. Conclusions The simulation comparisons demonstrate that this DSCVR approach can produce predictive models that are significantly better than those produced from random validation sampling, especially when the event rate is low.



Sign in / Sign up

Export Citation Format

Share Document