scholarly journals Sensitivity and Specificity of Real-World Social Factor Screening Approaches

2021 ◽  
Vol 45 (12) ◽  
Author(s):  
Joshua R. Vest ◽  
Wei Wu ◽  
Eneida A. Mendonca
Circulation ◽  
2019 ◽  
Vol 140 (Suppl_2) ◽  
Author(s):  
Xavier J Szigethy ◽  
Connor J Willson ◽  
David D Salcido ◽  
Dylan A Defilippi ◽  
James J Menegazzi

Background: Automated external defibrillators (AEDs) perform rhythm analysis in order to facilitate defibrillation. The effectiveness of AEDs is dependent on the accuracy of their rhythm classification, which includes differentiation of shockable rhythms from non-shockable rhythms Independent (i.e. non-industry) evaluation of the performance of AEDs against real-world ECG could lead to improvements in their performance. Objective: To evaluate the sensitivity and specificity characteristics of commercial AEDs with respect to quantitative properties of the ECG waveform in several rhythm presentations using real world ECG data. Methods: We conducted a prospective simulation study evaluating three commercially available AEDs from Defibtech, Phillips, and Zoll on the determination of ECG rhythm shockability. Performance was evaluated for 181 human ECG recordings (101 ventricular fibrillation-VF, 55 PEA, and 25 asystole) ranging widely in signal characteristics, obtained from the Pittsburgh site of the Resuscitation Outcomes Consortium. We used a commercially available digital-to-analog converter (National Instruments USB-6001) to inject the recordings into each AED through a direct lead-wire interface, recording shock advisement decisions in a best-out-of-three approach for each device/rhythm pairing. We calculated the sensitivity and specificity for discriminating VF and non-VF rhythms for each device and overall. VF signal characteristics were calculated, including peak frequency, median amplitude, and peak amplitude, and the VF quantitative waveform measures AMSA and median slope. Results: The 101 VF trials featured signals with mean peak frequency 10.02 Hz(IQR 4.80 Hz), mean AMSA 9.13(IQR 7.29), mean median slope 6.72 (IQR 3.66). The sensitivities were: Defibtech 99.0%; Philips 97.0%; Zoll 98.0%. The specificities were: Defibtech 98.7%; Philips 96.2%; Zoll 97.4%. Defibtech recorded 5 discordant advisements and Philips and Zoll recorded eight each. The overall sensitivity was 98.0%, and the specificity 97.4%. Conclusion: Evaluated against a wide variety of real-world signal presentations, commercial AEDs demonstrated a high degree of sensitivity and specificity for shockable ECG rhythms.


2017 ◽  
Vol 32 (4) ◽  
pp. 449-464 ◽  
Author(s):  
Valerie B. Shapiro ◽  
B. K. Elizabeth Kim ◽  
Jennifer L. Robitaille ◽  
Paul A. LeBuffe

PLoS ONE ◽  
2020 ◽  
Vol 15 (12) ◽  
pp. e0244023
Author(s):  
Darunee Chotiprasitsakul ◽  
Pataraporn Pewloungsawat ◽  
Chavachol Setthaudom ◽  
Pitak Santanirand ◽  
Prapaporn Pornsuriyasak

Background PCR is more sensitive than immunofluorescence assay (IFA) for detection of Pneumocystis jirovecii. However, PCR cannot always distinguish infection from colonization. This study aimed to compare the performance of real-time PCR and IFA for diagnosis of P. jirovecii pneumonia (PJP) in a real-world clinical setting. Methods A retrospective cohort study was conducted at a 1,300-bed hospital between April 2017 and December 2018. Patients whose respiratory sample (bronchoalveolar lavage or sputum) were tested by both Pneumocystis PCR and IFA were included. Diagnosis of PJP was classified based on multicomponent criteria. Sensitivity, specificity, 95% confidence intervals (CI), and Cohen's kappa coefficient were calculated. Results There were 222 eligible patients. The sensitivity and specificity of PCR was 91.9% (95% CI, 84.0%–96.7%) and 89.7% (95% CI, 83.3%–94.3%), respectively. The sensitivity and specificity of IFA was 7.0% (95% CI, 2.6%–14.6%) and 99.2% (95% CI, 95.6%–100.0%), respectively. The percent agreement between PCR and IFA was 56.7% (Cohen's kappa -0.02). Among discordant PCR-positive and IFA-negative samples, 78% were collected after PJP treatment. Clinical management would have changed in 14% of patients using diagnostic information, mainly based on PCR results. Conclusions PCR is highly sensitive compared with IFA for detection of PJP. Combining clinical, and radiological features with PCR is useful for diagnosis of PJP, particularly when respiratory specimens cannot be promptly collected before initiation of PJP treatment.


2020 ◽  
Vol 38 (15_suppl) ◽  
pp. 2051-2051
Author(s):  
Jeffrey J. Kirshner ◽  
Kelly Cohn ◽  
Steven Dunder ◽  
Karri Donahue ◽  
Madeline Richey ◽  
...  

2051 Background: Efforts to facilitate patient identification for clinical trials in routine practice, such as automating electronic health record (EHR) data reviews, are hindered by the lack of information on metastatic status in structured format. We developed a machine learning tool that infers metastatic status from unstructured EHR data, and we describe its real-world implementation. Methods: This machine learning model scans EHR documents, extracting features from text snippets surrounding key words (ie, ‘Metastatic’ ‘Progression’ ‘Local’). A regularized logistic regression model was trained, and used to classify patients across 5 metastatic status inference categories: highly-likely and likely positive, highly-likely and likely negative, and unknown. The model accuracy was characterized using the Flatiron Health EHR-derived de-identified database of patients with solid tumors, where manually abstracted information served as standard accurate reference. We assessed model accuracy using sensitivity and specificity (patients in the ‘unknown’ category omitted from numerator), negative and positive predictive values (NPV, PPV; patients ‘unknown’ included in denominator), and its performance in a real-world dataset. In a separate validation, we evaluated the accuracy gained upon additional user review of the model outputs after integration of this tool into workflows. Results: This metastatic status inference model was characterized using a sample of 66,532 patients. The model sensitivity and specificity (95%CI) were 82.% (82, 83) and 95% (95, 96), respectively; PPV was 89% (89, 90) and NPV was 94% (94, 94). In the validation sample (N = 200 originated from 5 distinct care sites), and after user review of model outputs, values increased to 97% (85, 100) for sensitivity, 98% (95, 100) for specificity, 92 (78, 98) for PPV and 99% (97, 100) for NPV. The model assigned 163/200 patients to the highly-likely categories, which were deemed not to require further EHR review by users. The prevalence of errors was 4% without user review, and 2% after user review. Conclusions: This machine learning model infers metastatic status from unstructured EHR data with high accuracy. The tool assigns metastatic status with high confidence in more than 75% of cases without requiring additional manual review, allowing more efficient identification of clinical trial candidates and clinical trial matching, thus mitigating a key barrier for clinical trial participation in community clinics.


2021 ◽  
Author(s):  
Victor Henrique Alves Ribeiro ◽  
Gabriela Steinhaus ◽  
Evair Borges Severo ◽  
José Raniery Ferreira Junior ◽  
Luiz José Lucas Barbosa ◽  
...  

The world currently suffers from the global COVID-19 pandemic. Billions of people have been impacted, and millions of casualties have already occurred. Therefore, it is of extreme importance to identify individuals contaminated by SARS-CoV-2, allowing governments to plan actions to reduce further impacts. In this context, this work employed machine learning to improve the detection of SARS-CoV-2 antibodies in blood exams. Models have been developed in a real-world scenario with 500 thousand exams and were deployed in a remote laboratory for experiments. Results indicate that the models averaged sensitivity and specificity of 95%, and thus, they could aid COVID-19 antibody detection and the decision-making process of biomedical specialists.


PLoS ONE ◽  
2021 ◽  
Vol 16 (8) ◽  
pp. e0256290
Author(s):  
Taehan Koo ◽  
Moon Hwan Kim ◽  
Mihn-Sook Jue

Direct microscopic examination with potassium hydroxide is generally used as a screening method for diagnosing superficial fungal infections. Although this type of examination is faster than other diagnostic methods, it can still be time-consuming to evaluate a complete sample; additionally, it possesses the disadvantage of inconsistent reliability as the accuracy of the reading may differ depending on the performer’s skill. This study aims at detecting hyphae more quickly, conveniently, and consistently through deep learning using images obtained from microscopy used in real-world practice. An object detection convolutional neural network, YOLO v4, was trained on microscopy images with magnifications of 100×, 40×, and (100+40)×. The study was conducted at the Department of Dermatology at Veterans Health Service Medical Center, Seoul, Korea between January 1, 2019 and December 31, 2019, using 3,707 images (1,255 images for training, 1,645 images for testing). The average precision was used to evaluate the accuracy of object detection. Precision recall curve analysis was performed for the hyphal location determination, and receiver operating characteristic curve analysis was performed on the image classification. The F1 score, sensitivity, and specificity values were used as measures of the overall performance. The sensitivity and specificity were, respectively, 95.2% and 100% in the 100× data model, and 99% and 86.6% in the 40× data model; the sensitivity and specificity in the combined (100+40)× data model were 93.2% and 89%, respectively. The performance of our model had high sensitivity and specificity, indicating that hyphae can be detected with reliable accuracy. Thus, our deep learning-based autodetection model can detect hyphae in microscopic images obtained from real-world practice. We aim to develop an automatic hyphae detection system that can be utilized in real-world practice through continuous research.


PLoS Medicine ◽  
2020 ◽  
Vol 17 (11) ◽  
pp. e1003381
Author(s):  
Seung Seog Han ◽  
Ik Jun Moon ◽  
Seong Hwan Kim ◽  
Jung-Im Na ◽  
Myoung Shin Kim ◽  
...  

Background The diagnostic performance of convolutional neural networks (CNNs) for diagnosing several types of skin neoplasms has been demonstrated as comparable with that of dermatologists using clinical photography. However, the generalizability should be demonstrated using a large-scale external dataset that includes most types of skin neoplasms. In this study, the performance of a neural network algorithm was compared with that of dermatologists in both real-world practice and experimental settings. Methods and findings To demonstrate generalizability, the skin cancer detection algorithm (https://rcnn.modelderm.com) developed in our previous study was used without modification. We conducted a retrospective study with all single lesion biopsied cases (43 disorders; 40,331 clinical images from 10,426 cases: 1,222 malignant cases and 9,204 benign cases); mean age (standard deviation [SD], 52.1 [18.3]; 4,701 men [45.1%]) were obtained from the Department of Dermatology, Severance Hospital in Seoul, Korea between January 1, 2008 and March 31, 2019. Using the external validation dataset, the predictions of the algorithm were compared with the clinical diagnoses of 65 attending physicians who had recorded the clinical diagnoses with thorough examinations in real-world practice. In addition, the results obtained by the algorithm for the data of randomly selected batches of 30 patients were compared with those obtained by 44 dermatologists in experimental settings; the dermatologists were only provided with multiple images of each lesion, without clinical information. With regard to the determination of malignancy, the area under the curve (AUC) achieved by the algorithm was 0.863 (95% confidence interval [CI] 0.852–0.875), when unprocessed clinical photographs were used. The sensitivity and specificity of the algorithm at the predefined high-specificity threshold were 62.7% (95% CI 59.9–65.1) and 90.0% (95% CI 89.4–90.6), respectively. Furthermore, the sensitivity and specificity of the first clinical impression of 65 attending physicians were 70.2% and 95.6%, respectively, which were superior to those of the algorithm (McNemar test; p < 0.0001). The positive and negative predictive values of the algorithm were 45.4% (CI 43.7–47.3) and 94.8% (CI 94.4–95.2), respectively, whereas those of the first clinical impression were 68.1% and 96.0%, respectively. In the reader test conducted using images corresponding to batches of 30 patients, the sensitivity and specificity of the algorithm at the predefined threshold were 66.9% (95% CI 57.7–76.0) and 87.4% (95% CI 82.5–92.2), respectively. Furthermore, the sensitivity and specificity derived from the first impression of 44 of the participants were 65.8% (95% CI 55.7–75.9) and 85.7% (95% CI 82.4–88.9), respectively, which are values comparable with those of the algorithm (Wilcoxon signed-rank test; p = 0.607 and 0.097). Limitations of this study include the exclusive use of high-quality clinical photographs taken in hospitals and the lack of ethnic diversity in the study population. Conclusions Our algorithm could diagnose skin tumors with nearly the same accuracy as a dermatologist when the diagnosis was performed solely with photographs. However, as a result of limited data relevancy, the performance was inferior to that of actual medical examination. To achieve more accurate predictive diagnoses, clinical information should be integrated with imaging information.


2020 ◽  
Author(s):  
Wen-Cheng Liu ◽  
Chin-Sheng Lin ◽  
Chien-Sung Tsai ◽  
Tien-Ping Tsao ◽  
Cheng-Chung Cheng ◽  
...  

Abstract BackgroundThe initial detection and diagnosis of ST-segment or non-ST-segment elevation myocardial infarction (STEMI or NSTEMI) definitely rely on a 12-lead electrocardiogram (ECG). Delay or misdiagnosis is not unusual by subjective interpretation. Our aim is to develop a DLM as a diagnostic support tool to detect MI based on a 12-lead ECG and to evaluate the performance of this model.MethodsThis study included 1,051 ECGs from 737 coronary angiography (CAG)-validated STEMI patients, 697 ECGs from 287 CAG-validated NSTEMI patients, and 140,336 not-MI ECGs from 76,775 patients at emergency departments. DLM was trained and validated for the performance using 80% and 20% of the ECGs, respectively. A human-machine competition was conducted. The area under the receiver operating characteristic curve (AUC), sensitivity, and specificity were used to evaluate the performance of DLM and experts. STEMI versus not-STEMI, and MI versus not-MI were evaluated by DLM.ResultsThe AUCs of DLM for identifying STEMI and MI were 0.976 and 0.944 in the human-machine competition, respectively, which were significantly better than those of our best clinicians. In the real world setting, DLM presented with AUC of 0.995/0.916 with corresponding sensitivities of 96.9%/77.0%, and specificities of 96.2%/92.9% in the identification of STEMI and MI, respectively. Furthermore, DLM demonstrated sufficient diagnostic capacity for STEMI without the aid of troponin I (TnI) (AUC= 0.996) with corresponding sensitivity and specificity of 98.4% and 96.9%. The AUC of combined DLM and the first recorded TnI for the detection of NSTEMI were increased to 0.978 with corresponding sensitivity and specificity of 91.6% and 96.7%, which was better than that of DLM (0.877) or TnI (0.949) alone. ConclusionsDLM may serve as a diagnostic decision tool to assist intensive or emergency medical system-based networks and frontline physicians in identifying STEMI and NSTEMI in a timely and precise manner to prevent delay or misdiagnosis, and thereby to facilitate subsequent reperfusion therapy.


Sign in / Sign up

Export Citation Format

Share Document