scholarly journals Relative Inference for Paired Data: More than Meets the Eye

Author(s):  
Kenneth Gerow ◽  
David R. Stewart ◽  
Calvin Farris
Keyword(s):  
Author(s):  
Elizabette Johnson ◽  
Elizabeth Roth

Objective Our goal is to improve the wellness of our Family Medicine residents now and in the future by educating them on more efficient use of our electronic health record (EHR). Resident physician burnout is a significant problem and is correlated with time spent using an EHR after work hours. Family physicians have the highest rate of burnout of all specialties, and the EHR is a significant contributor to this burnout. Studies have shown that increased EHR education can improve job satisfaction. Method Over 5 months, we provided weekly brief (15 minute) educational sessions covering 6 topics twice and a one-hour individualized meeting of each resident physician with an EHR trainer. We evaluated our intervention with wellness surveys and objective measures of EHR efficiency both pre and post intervention. We further evaluated efficiency by comparing pre and post-intervention values of the following: average keystrokes, mouseclicks, accelerator use, minutes per encounter and percent closed encounters at month’s end. Results Resident questionnaires showed lessons increased knowledge and intention to use EHR accelerators, but this was not statistically significant. Analysis of objective data showed most efficiency metrics worsened, though most not to a degree that was statistically significant. Residents reported subjective increases in efficiency, and paired data from wellness surveys showed an overall decrease in burnout post-intervention vs. baseline. Conclusions Much of the data in this pilot study does not reach statistical significance, but is highly suggestive that increased EHR training can improve at least perceived efficiency and thereby resident wellness.


2021 ◽  
Vol 16 (1) ◽  
Author(s):  
Sujin Kang

Abstract Background The degree to which a validated instrument is able to detect clinically significant change over time is an important issue for the better management of hip or knee replacement surgery. This study examines the internal responsiveness of the EQ-5D-3L, the Oxford Hip Score (OHS), and the Oxford Knee Score (OKS) by various methods. Data from NHS patient-reported outcome measures (PROMs) linked to the Hospital Episodes Statistics (HES) dataset (2009–2015) was analysed for patients who underwent primary hip surgery (N = 181,424) and primary knee surgery (N = 191,379). Methods Paired data-specific univariate responsiveness was investigated using the standardized response mean (SRM), the standardized effect size (SES), and the responsiveness index (RI). Multivariate responsiveness was furthermore examined using the defined capacity of benefit score (i.e. paired data-specific MCID), adjusting baseline covariates such as age, gender, and comorbidities in the Box-Cox regression models. The observed and predicted percentages of patient improvement were examined both as a whole and by the patients' self-assessed transition level. Results The results showed that both the OHS and the OKS demonstrated great univariate and multivariate responsiveness. The percentages of the observed (predicted) total improvement were high: 51 (54)% in the OHS and 73 (58)% in OKS. The OHS and the OKS showed distinctive differences in improvement by the 3-level transition, i.e. a little better vs. about the same vs. a little worse. The univariate responsiveness of the EQ-5D-3L showed moderate effects in total by Cohen’s thresholds. The percentages of improvement in the EQ-5D-3L were moderate: 44 (48)% in the hip and 42 (44)% for the knee replacement population. Conclusions Distinctive percentage differences in patients’ perception of improvement were observed when the paired data-specific capacity of benefit score was applied to examine responsiveness. This is useful in clinical practice as rationale for access to surgery at the individual-patient level. This study shows the importance of analytic methods and instruments for investigation of the health status in hip and/or knee replacement surgery. The study finding also supports the idea of using a generic measure along with the disease-specific instruments in terms of cross-validation.


2021 ◽  
pp. 096228022199595
Author(s):  
Yalda Zarnegarnia ◽  
Shari Messinger

Receiver operating characteristic curves are widely used in medical research to illustrate biomarker performance in binary classification, particularly with respect to disease or health status. Study designs that include related subjects, such as siblings, usually have common environmental or genetic factors giving rise to correlated biomarker data. The design could be used to improve detection of biomarkers informative of increased risk, allowing initiation of treatment to stop or slow disease progression. Available methods for receiver operating characteristic construction do not take advantage of correlation inherent in this design to improve biomarker performance. This paper will briefly review some developed methods for receiver operating characteristic curve estimation in settings with correlated data from case–control designs and will discuss the limitations of current methods for analyzing correlated familial paired data. An alternative approach using conditional receiver operating characteristic curves will be demonstrated. The proposed approach will use information about correlation among biomarker values, producing conditional receiver operating characteristic curves that evaluate the ability of a biomarker to discriminate between affected and unaffected subjects in a familial paired design.


Atmosphere ◽  
2021 ◽  
Vol 12 (5) ◽  
pp. 577
Author(s):  
Gabriele Graffieti ◽  
Davide Maltoni

In this paper, we present a novel defogging technique, named CurL-Defog, with the aim of minimizing the insertion of artifacts while maintaining good contrast restoration and visibility enhancement. Many learning-based defogging approaches rely on paired data, where fog is artificially added to clear images; this usually provides good results on mildly fogged images but is not effective for difficult cases. On the other hand, the models trained with real data can produce visually impressive results, but unwanted artifacts are often present. We propose a curriculum learning strategy and an enhanced CycleGAN model to reduce the number of produced artifacts, where both synthetic and real data are used in the training procedure. We also introduce a new metric, called HArD (Hazy Artifact Detector), to numerically quantify the number of artifacts in the defogged images, thus avoiding the tedious and subjective manual inspection of the results. HArD is then combined with other defogging indicators to produce a solid metric that is not deceived by the presence of artifacts. The proposed approach compares favorably with state-of-the-art techniques on both real and synthetic datasets.


2021 ◽  
Vol 21 (1) ◽  
Author(s):  
Javier Caviedes-Bucheli ◽  
Nestor Rios-Osorio ◽  
Diana Usme ◽  
Cristian Jimenez ◽  
Adriana Pinzon ◽  
...  

Abstract Background The purpose of this study was to evaluate the changes in canal volume after root canal preparation in vivo with 3 different single-file techniques (Reciproc-Blue®, WaveOne-Gold® and XP-EndoShaper®), with a new method using CBCT and 3D reconstruction. Methods In this prospective study, thirty human lower premolars from healthy patients were used, in which extraction was indicated for orthodontic reasons. All the teeth used were caries- and restoration-free with complete root development, without signs of periodontal disease or traumatic occlusion, and with only one straight canal (up to 25º curvature). Teeth were randomly divided into three different groups: Reciproc-Blue, WaveOne-Gold and XP-EndoShaper. CBCT scans before root canal preparation were used to create a 3D reconstruction with RHINOCEROS 5.0 software to assess the initial canal volume, and then compared with 3D reconstructions after canal preparation to measure the increase in canal volume. Student’s t test for paired data were used to determine statistically significant differences between the before and after canal volumes. Anova test was used to determine statistically significant differences in the percentage of canal volume increase between the groups and Tukey's post-hoc test were used to paired comparison. Results Reciproc-Blue showed the higher increase in canal volume, followed by WaveOne-Gold and XP-EndoShaper (p = 0.003). XP-EndoShaper did not show a statistically significant increase in canal volume after root canal preparation (p = 0.06). Conclusion With this model, Reciproc-Blue showed higher increase in root canal volume, followed by WaveOne-Gold, while XP-EndoShaper did not significantly increase root canal volume during preparation.


2020 ◽  
Vol 41 (Supplement_2) ◽  
Author(s):  
P Codina ◽  
M De Antonio ◽  
E Santiago-Vacas ◽  
M Domingo ◽  
E Zamora ◽  
...  

Abstract Background Heart failure (HF) contemporary management has significantly improved over the past two decades leading to better survival. How application of the contemporary HF management guidelines affects the risk of death estimated by available web-based risk scores is not elucidated. Objective To assess changes in mortality risk prediction after a after a 12-month management period in a multidisciplinary HF Clinic. Methods Out of 1,689 consecutive patients with HF admitted at our ambulatory HF Clinic from May 2006 to November 2018, those who completed one year follow-up were considered for the study. Patients without NTproBNP measurement or with more than 3 missing variables for risk estimation were excluded. Three contemporary web-based HF risk scores were evaluated: MAGGIC-HF, Seattle HF Model (SHFM) and the Barcelona Bio-HF Calculator containing NTproBNP (BCN Bio-HF). Risk of all-cause death at one year and at 3 years were calculated at baseline and re-evaluated after 12-month management in a multidsisciplinary HF Clinic. Wilcoxon paired data test was used to compare changes in mortality risk estimation over time and test equality of matched pairs for comparing estimated change among tools. 442 patients used to derive the Barcelona Bio-HF Calculator were excluded for discrimination purposes. Results 1,157 patients were included (age 65.7±12.7 years, 70.4% men). A significant reduction in mortality risk estimation was observed with the three HF risk scores evaluated at 12-months (Table). The BCN Bio-HF model showed significantly different changes in risk estimation, fact that indeed was partnered with numerically better discrimination. AUC at 1 and 3 years, respectively, were: BCN Bio-HF (0.773 and 0.775), MAGGIC HF (0.686 and 0.748) and SHFM (0.773 and 0.739). Conclusions The three web-based risk scores evaluated showed a significant reduction in mortality risk estimation after 12 month management in a multidisciplinary HF Clinic. The BCN Bio-HF score showed higher reduction in estimated risk, together with better discrimination, likely because it incorporates contemporary treatment and use of biomarkers. Funding Acknowledgement Type of funding source: None


Author(s):  
Simon Lindner ◽  
Steffen Eitelbuss ◽  
Svetlana Hetjens ◽  
Joshua Gawlitza ◽  
Julia Hardt ◽  
...  

Abstract Purpose No clear consensus exists on how to routinely assess the integrity of the colorectal anastomosis prior to ileostomy reversal. The objective of this study was to evaluate the accuracy of contrast enema, endoscopic procedures, and digital rectal examination in rectal cancer patients in this setting. Methods A systematic literature search was performed. Studies assessing at least one index test for which a 2 × 2 table was calculable were included. Hierarchical summary receiver operating characteristic curves were calculated and used for test comparison. Paired data were used where parameters could not be calculated. Methodological quality was assessed with the QUADAS-2 tool. Results Two prospective and 11 retrospective studies comprising 1903 patients were eligible for inclusion. Paired data analysis showed equal or better results for sensitivity and specificity of both endoscopic procedures and digital rectal examination compared to contrast enema. Subgroup analysis of contrast enema according to methodological quality revealed that studies with higher methodological quality reported poorer sensitivity for equal specificity and vice versa. No case was described where a contrast enema revealed an anastomotic leak that was overseen in digital rectal examination or endoscopic procedures. Conclusions Endoscopy and digital rectal examination appear to be the best diagnostic tests to assess the integrity of the colorectal anastomosis prior to ileostomy reversal. Accuracy measures of contrast enema are overestimated by studies with lower methodological quality. Synopsis of existing evidence and risk–benefit considerations justifies omission of contrast enema in favor of endoscopic and clinical assessment. Trial registration https://www.crd.york.ac.uk/prospero/display_record.php?ID=CRD42019107771


2021 ◽  
Vol 17 (4) ◽  
pp. 1-16
Author(s):  
Xiaowe Xu ◽  
Jiawei Zhang ◽  
Jinglan Liu ◽  
Yukun Ding ◽  
Tianchen Wang ◽  
...  

As one of the most commonly ordered imaging tests, the computed tomography (CT) scan comes with inevitable radiation exposure that increases cancer risk to patients. However, CT image quality is directly related to radiation dose, and thus it is desirable to obtain high-quality CT images with as little dose as possible. CT image denoising tries to obtain high-dose-like high-quality CT images (domain Y ) from low dose low-quality CT images (domain X ), which can be treated as an image-to-image translation task where the goal is to learn the transform between a source domain X (noisy images) and a target domain Y (clean images). Recently, the cycle-consistent adversarial denoising network (CCADN) has achieved state-of-the-art results by enforcing cycle-consistent loss without the need of paired training data, since the paired data is hard to collect due to patients’ interests and cardiac motion. However, out of concerns on patients’ privacy and data security, protocols typically require clinics to perform medical image processing tasks including CT image denoising locally (i.e., edge denoising). Therefore, the network models need to achieve high performance under various computation resource constraints including memory and performance. Our detailed analysis of CCADN raises a number of interesting questions that point to potential ways to further improve its performance using the same or even fewer computation resources. For example, if the noise is large leading to a significant difference between domain X and domain Y , can we bridge X and Y with a intermediate domain Z such that both the denoising process between X and Z and that between Z and Y are easier to learn? As such intermediate domains lead to multiple cycles, how do we best enforce cycle- consistency? Driven by these questions, we propose a multi-cycle-consistent adversarial network (MCCAN) that builds intermediate domains and enforces both local and global cycle-consistency for edge denoising of CT images. The global cycle-consistency couples all generators together to model the whole denoising process, whereas the local cycle-consistency imposes effective supervision on the process between adjacent domains. Experiments show that both local and global cycle-consistency are important for the success of MCCAN, which outperforms CCADN in terms of denoising quality with slightly less computation resource consumption.


2018 ◽  
Vol 28 (5) ◽  
pp. 1477-1488
Author(s):  
Yaeji Lim ◽  
Ji Soo Choi ◽  
Kiyoun Kim ◽  
Mira Park ◽  
Seonwoo Kim

Diagnostic procedures are mostly used to detect a particular disease, and each procedure indicates the presence or absence of the disease in an individual. Sensitivity and positive predictive value, which are measures of the effectiveness of a diagnostic procedure, are simply calculated as the proportion of the individuals diagnosed with the disease by the test among the patients with the disease, and of the diseased persons among the individuals in whom the disease was detected by the test, respectively. For a diagnosis with such a binary result, sensitivity and the positive predictive value of diagnostic procedures can be compared using the chi-square statistic. However, in the treatment of cancer patients, it is important not only to diagnose the disease status of an individual patient but also to detect the correct location of the cancer. The tumor location may be incorrectly identified in some subjects diagnosed with cancer. It is therefore of interest whether a procedure that diagnoses cancer also correctly indicates the tumor location. In this paper, we re-define the sensitivity and the positive predictive value of tumor detection as the ratio of the number of cases with a correct diagnosis of the tumor location by the test to the number of cases of cancer, and as the ratio of patients with a correct diagnosis of the tumor location to the number of individuals diagnosed with cancer by the test, respectively. We refer to these parameters as ‘semi-sensitivity’ and ‘semi-positive predictive value’. To compare these ratios between diagnostic procedures, test statistics are developed from binary diagnostic results. Simulation studies conducted to evaluate the nominal level and power are presented, and two sets of example data are also analyzed using the new test statistic.


Sign in / Sign up

Export Citation Format

Share Document