Using the diagnostic odds ratio to select multivariate sequential patterns in order to build an interpretable pattern-based classifier in a clinical domain (Preprint)

Mapping Intimacies ◽

10.2196/preprints.32319 ◽

2021 ◽

Author(s):

Isidoro J. Casanova ◽

Manuel Campos ◽

Jose M. Juarez ◽

Antonio Gomariz ◽

Marta Lorente-Ros ◽

...

Keyword(s):

Confidence Intervals ◽

Odds Ratio ◽

Diagnostic Odds Ratio ◽

University Hospital ◽

Classification Model ◽

Sequential Patterns ◽

Pattern Selection ◽

Essential Property ◽

Frequency Properties ◽

Clinical Domain

BACKGROUND It is important to exploit all available data on patients in settings such as Intensive Care Burn Units (ICBUs), where several variables are recorded over time. It is possible to take advantage of the multivariate patterns that model the evolution of patients in order to predict their survival. However, pattern discovery algorithms generate a large number of patterns, of which only some are relevant for classification. The interpretability of the model is, moreover, an essential property in the clinical domain. OBJECTIVE We propose to use the Diagnostic Odds Ratio (DOR) to select the multivariate sequential patterns used in the classification in a clinical domain, rather than employing frequency properties. This makes it possible to employ a terminology closer to the language of clinicians, in which a pattern is considered to be a risk factor or to have a protection factor. METHODS We employ data obtained from the ICBU at the University Hospital of Getafe, where six temporal variables for 465 patients were registered every day during 5 days, and to model the evolution of these clinical variables we use multivariate sequential patterns. We compare four ways in which to employ the DOR for pattern selection: 1) We use it as a threshold in order to select patterns with a minimum DOR; 2) We select patterns whose differential DORs are higher than a threshold as regards their extensions; 3) We select patterns whose DOR confidence intervals do not overlap; and 4) We propose the combination of threshold and non-overlapping confidence intervals in order to select the most discriminative patterns. As a baseline, we compare our proposals with Jumping Emerging Patterns (JEPs), one of the most frequently used techniques for pattern selection that utilize frequency properties. RESULTS We have compared the number and length of the patterns eventually selected, classification performance, and pattern and model interpretability. We show that discretization has a great impact on the accuracy of the classification model, but that a trade off must be found between classification accuracy and the physicians' capacity to interpret the patterns obtained. We have, therefore, opted to use expert discretization without losing too much accuracy. We have also identified that the experiments combining threshold and non-overlapping confidence intervals (Option 4) obtained the fewest number of patterns but also with the smallest size, thus implying the loss of an acceptable accuracy as regards clinician interpretation. CONCLUSIONS A method for the classification of patients’ survival can benefit from the use of sequential patterns, since these patterns consider knowledge about the temporal evolution of the variables in the case of ICBU. We have proved that the DOR can be used in several ways, and that it is a suitable measure with which to select discriminative and interpretable quality patterns.

Download Full-text

Small Sample Confidence Intervals for the Odds Ratio

Communications in Statistics - Simulation and Computation ◽

10.1081/sac-200040691 ◽

2004 ◽

Vol 33 (4) ◽

pp. 1095-1113 ◽

Cited By ~ 27

Author(s):

Raef Lawson

Keyword(s):

Confidence Intervals ◽

Odds Ratio ◽

Small Sample

Download Full-text

Prevalence and risk factors of apical periodontitis in endodontically treated teeth in a selected population of Brazilian adults

Brazilian Dental Journal ◽

10.1590/s0103-64402008000100006 ◽

2008 ◽

Vol 19 (1) ◽

pp. 34-39 ◽

Cited By ~ 34

Author(s):

Carlos Estrela ◽

Cláudio Rodrigues Leles ◽

Augusto César Braz Hollanda ◽

Marcelo Sampaio Moura ◽

Jesus Djalma Pécora

Keyword(s):

Risk Factors ◽

Confidence Intervals ◽

Odds Ratio ◽

Apical Periodontitis ◽

Endodontic Treatment ◽

Technical Quality ◽

Endodontically Treated Teeth ◽

Chi Square ◽

Chi Square Test

The aim of this study was to assess the prevalence and risk factors of apical periodontitis in endodontically treated teeth in a selected population of Brazilian adults. A total of 1,372 periapical radiographs of endodontically treated teeth were analyzed based on the quality of root filling, status of coronal restoration and presence of posts associated with apical periodontitis (AP). Data were analyzed statistically using odds ratio, confidence intervals and chi-square test. The prevalence of AP with adequate endodontic treatment was low (16.5%). This percentage dropped to 12.1% in cases with adequate root filling and adequate coronal restoration. Teeth with adequate endodontic treatment and poor coronal restoration had an AP prevalence of 27.9%. AP increased to 71.7% in teeth with poor endodontic treatment associated with poor coronal restoration. When poor endodontic treatment was combined with adequate coronal restoration, AP prevalence was 61.8%. The prevalence of AP was low when associated with high technical quality of root canal treatment. Poor coronal restoration increased the risk of AP even when endodontic treatment was adequate (OR=2.80; 95%CI=1.87-4.22). The presence of intracanal posts had no influence on AP prevalence.

Download Full-text

Clinical utility of microRNA-451 as diagnostic biomarker for human cancers

Bioscience Reports ◽

10.1042/bsr20180653 ◽

2019 ◽

Vol 39 (1) ◽

Cited By ~ 4

Author(s):

Zhanzhan Li ◽

Yanyan Li ◽

Jun Fu ◽

Na Li ◽

Liangfang Shen

Keyword(s):

Likelihood Ratio ◽

Sensitivity And Specificity ◽

Odds Ratio ◽

Random Effect ◽

Random Effect Model ◽

Positive Likelihood Ratio ◽

Negative Likelihood Ratio ◽

Diagnostic Odds Ratio ◽

Diagnostic Ability ◽

Sensitivity Specificity

AbstractWe conducted comprehensive analyses to assess the diagnostic ability of miRNA-451 in cancers. A systematic online search was conducted in PubMed, Web of Science, China’s national knowledge infrastructure, and VIP databases from inception to July 31, 2017. The bivariate random effect model was used for calculating sensitivity, specificity, positive likelihood ratio, negative likelihood ratio, diagnostic odds ratio, and area under cure (AUC). The whole pooled sensitivity and specificity were 0.85 (0.77–0.90) and 0.85 (0.78–0.90) with their 95% confidence interval (95%CI), respectively. The pooled AUC was 0.91 (95%CI: 0.89–0.94). Positive likelihood ratio was 5.57 (95%CI: 3.74–8.31), negative likelihood ratio was 0.18 (95%CI: 0.11–0.28), and diagnostic odds ratio was 31.33 (95%CI: 15.19–64.61). Among Asian population, the sensitivity and specificity were 0.85 (95%CI: 0.77–0.91) and 0.86 (95%CI: 0.78–0.91), respectively. The positive likelihood ratio and negative likelihood ratio were 5.87 (95%CI: 3.78–9.12) and 0.17 (95%CI: 0.11–0.28). The diagnostic odds ratio and AUC were 34.31 (15.51–75.91) and 0.92 (0.89–0.94). The pooled sensitivity, specificity, positive likelihood ratio, negative likelihood ratio, diagnostic odds ratio, and AUC for digestive system cancer were 0.83, 0.88, 6.87, 0.20, 35.13, and 0.92, respectively. The other cancers were 0.87, 0.81, 4.55, 0.16, 28.51, and 0.90, respectively. For sample source, the results still remain consistent. Our results indicated miRNA-451 has a moderate diagnostic ability for cancers, and could be a potential early screening biomarker, and considered as an adjuvant diagnostic index when being combined with other clinical examinations.

Download Full-text

Furosemide stress test as a predictive marker of acute kidney injury progression or renal replacement therapy: a systemic review and meta-analysis

10.21203/rs.2.23355/v2 ◽

2020 ◽

Author(s):

Jia-Jin Chen ◽

Chih-Hsiang Chang ◽

Yen Ta Huang ◽

George Kuo

Keyword(s):

Acute Kidney Injury ◽

Renal Replacement Therapy ◽

Diagnostic Accuracy ◽

Replacement Therapy ◽

Sensitivity And Specificity ◽

Odds Ratio ◽

Diagnostic Performance ◽

Stress Test ◽

Kidney Injury ◽

Diagnostic Odds Ratio

Abstract Background: The use of the furosemide stress test (FST) as an acute kidney injury (AKI) severity marker has been described in several trials. However, the diagnostic performance of the FST in predicting AKI progression has not yet been fully discussed. Methods: In accordance with Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines, we searched the PubMed, Embase, Cochrane databases up to March, 2020. The diagnostic performance of the FST (in terms of sensitivity, specificity, number of events, true positive, false positive) was extracted and evaluated. Results: We identified eleven trials that enrolled a total of 1366 patients, including 517 patients and 1017 patients for whom the outcomes in terms of AKI stage progression and renal replacement therapy (RRT), respectively, were reported. The pooled sensitivity and specificity results of the FST for AKI progression prediction were 0.81 (95% CI: 0.74 - 0.87) and 0.88 (95% CI: 0.82- 0.92), respectively. The pooled positive likelihood ratio (LR) was 5.45 (95% CI: 3.96-7.50), the pooled negative LR was 0.26 (95% CI: 0.19-0.36), and the pooled diagnostic odds ratio (DOR) was 29.69 (95% CI: 17.00-51.85). The summary receiver operating characteristics (SROC) with pooled diagnostic accuracy was 0.88. The diagnostic performance of the FST in predicting AKI progression was not affected by different AKI criteria or underlying chronic kidney disease. The pooled sensitivity and specificity results of the FST for RRT prediction were 0.84 (95% CI: 0.72-0.91) and 0.77 (95% CI: 0.64-0.87), respectively. The pooled positive LR and pooled negative LR were 3.16 (95% CI: 2.06-4.86) and 0.25 (95% CI: 0.14-0.44), respectively. The pooled diagnostic odds ratio (DOR) was 13.59 (95% CI: 5.74-32.17) and SROC with pooled diagnostic accuracy was 0.86. The diagnostic performance of FST for RRT prediction is better in stage 1-2 AKI comparing to stage 3 AKI (relative DOR: 5.75, 95% CI: 2.51-13.33) Conclusion: The FST is a simple tool for the identification of AKI populations at high risk of AKI progression and the need for RRT and the diagnostic performance of FST in RRT prediction is better in early AKI population.

Download Full-text

Accuracy of Visual Tests for Primary Cervical Cancer Screening in Rural Nepal

Journal of Nepal Medical Association ◽

10.31729/jnma.3857 ◽

2018 ◽

Vol 56 (214) ◽

pp. 917-923

Author(s):

Niresh Thapa ◽

Muna Maharjan ◽

Girishma Shrestha ◽

Narayani Maharjan ◽

Deborah Lindell ◽

...

Keyword(s):

Cervical Cancer ◽

Cancer Screening ◽

Negative Predictive Value ◽

Cervical Cancer Screening ◽

Predictive Value ◽

Odds Ratio ◽

Screening Program ◽

Diagnostic Odds Ratio ◽

Screening Tests ◽

Resource Setting

Introduction: In Nepal, cervical cancer is the most common female cancer. Unfortunately, there is no uniform effective screening system available all around the country. The objective of this study is to evaluate the cytology, Visual Inspection with Acetic Acid and with Lugol’s Iodine alone or in combination to detect a pre-cancerous lesion in rural Nepal.Methods: It is an analytical cross-sectional study. Convenience sampling technique was used to select participants who were apparently healthy, married, non- pregnant women of aged 20-65 years for cervical cancer screening program. Screening tests were performed on all eligible women (n=2143) after socio-demographic and reproductive health data collection. A biopsy was applied as a gold standard test. Cross-tabulations were used to describe the test sensitivity, specificity, positive predictive value, and negative predictive value at a 95% confidence interval. Diagnostic odds ratio was also calculated. Results: A majority, 2143 (94%), of women accepted and participated in this study. The sensitivity vs specificity of cytology, VIA, and VILI was 57.1% vs 98.3%, 71.4% vs 88.8% and 78.6% vs 85.1%, and of the co-testing of ‘Both positive VIA and VILI’ and ‘Either positive VIA or VILI’ was 64.3% vs 85.7% and 90.1% vs 83.7% respectively. Negative predictive value of all tests exceeded 99.7%. Cytology had the highest Diagnostic odds ratio (64.9), followed by the co-test ‘Either positive VIA or VILI’ (27.7).Conclusions: Cervical cancer screening by co-testing ‘Either positive VIA or VILI’ is more useful than cytology; VIA and or VILI are easy, safe, feasible and well-accepted tests in a low resource setting, Nepal.

Download Full-text

Automatic ICD-10 Coding and Training System: Deep Neural Network Based on Supervised Learning

JMIR Medical Informatics ◽

10.2196/23230 ◽

2021 ◽

Vol 9 (8) ◽

pp. e23230

Author(s):

Pei-Fu Chen ◽

Ssu-Ming Wang ◽

Wei-Chih Liao ◽

Lu-Cheng Kuo ◽

Kuan-Chih Chen ◽

...

Keyword(s):

Neural Network ◽

Deep Learning ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Deep Neural Network ◽

University Hospital ◽

Classification Model ◽

Icd 10 ◽

And Training

Background The International Classification of Diseases (ICD) code is widely used as the reference in medical system and billing purposes. However, classifying diseases into ICD codes still mainly relies on humans reading a large amount of written material as the basis for coding. Coding is both laborious and time-consuming. Since the conversion of ICD-9 to ICD-10, the coding task became much more complicated, and deep learning– and natural language processing–related approaches have been studied to assist disease coders. Objective This paper aims at constructing a deep learning model for ICD-10 coding, where the model is meant to automatically determine the corresponding diagnosis and procedure codes based solely on free-text medical notes to improve accuracy and reduce human effort. Methods We used diagnosis records of the National Taiwan University Hospital as resources and apply natural language processing techniques, including global vectors, word to vectors, embeddings from language models, bidirectional encoder representations from transformers, and single head attention recurrent neural network, on the deep neural network architecture to implement ICD-10 auto-coding. Besides, we introduced the attention mechanism into the classification model to extract the keywords from diagnoses and visualize the coding reference for training freshmen in ICD-10. Sixty discharge notes were randomly selected to examine the change in the F1-score and the coding time by coders before and after using our model. Results In experiments on the medical data set of National Taiwan University Hospital, our prediction results revealed F1-scores of 0.715 and 0.618 for the ICD-10 Clinical Modification code and Procedure Coding System code, respectively, with a bidirectional encoder representations from transformers embedding approach in the Gated Recurrent Unit classification model. The well-trained models were applied on the ICD-10 web service for coding and training to ICD-10 users. With this service, coders can code with the F1-score significantly increased from a median of 0.832 to 0.922 (P<.05), but not in a reduced interval. Conclusions The proposed model significantly improved the F1-score but did not decrease the time consumed in coding by disease coders.

Download Full-text

Comparison of Exact, Mid-p, and Mantel–Haenszel Confidence Intervals for the Common Odds Ratio Across Several 2 × 2 Contingency Tables

The American Statistician ◽

10.1080/00031305.1992.10475871 ◽

1992 ◽

Vol 46 (2) ◽

pp. 146-150 ◽

Cited By ~ 2

Author(s):

Cyrus R. Mehta ◽

Stephen J. Walsh

Keyword(s):

Confidence Intervals ◽

Odds Ratio ◽

Contingency Tables ◽

Common Odds Ratio ◽

The Common

Download Full-text

Venous Thromboembolism in Patients With Thrombocytopenia: Risk Factors, Treatment, and Outcome

Clinical and Applied Thrombosis/Hemostasis ◽

10.1177/1076029615613158 ◽

2015 ◽

Vol 23 (4) ◽

pp. 345-350 ◽

Cited By ~ 4

Author(s):

Jens Kristian Baelum ◽

Espen Ellingsen Moe ◽

Mads Nybo ◽

Pernille Just Vinholt

Keyword(s):

Risk Factors ◽

Platelet Count ◽

Venous Thromboembolism ◽

Blood Cells ◽

Medical Records ◽

Odds Ratio ◽

Conditional Logistic Regression ◽

University Hospital ◽

Hospital Department ◽

Bleeding Episodes

Background: Venous thromboembolism (VTE) is a frequent and potentially lethal condition. Venous thrombi are mainly constituted of fibrin and red blood cells, but platelets also play an important role in VTE formation. Information about VTE in patients with thrombocytopenia is, however, missing. Objectives: To identify VTE risk factors and describe treatment and outcome (bleeding episodes and mortality) in patients with thrombocytopenia. Patients/Methods: Patients with thrombocytopenia (platelet count <100 × 109/L) admitted to Odense University Hospital, Denmark, between April 2000 and April 2012 were included. Fifty cases had experienced VTE. Controls without VTE were matched 3:1 with cases on sex and hospital department. Medical records were examined, and data were analyzed using conditional logistic regression. Results: In multivariate analysis, platelet count <50 × 109/L (odds ratio [OR] 0.22, P < .05) and chronic liver disease (OR 0.05, 95% confidence interval [CI] 0.01-0.58) reduced the risk of VTE. Surgery (OR 6.44, 95% CI 1.37-30.20) and previous thromboembolism (OR 6.16, 95% CI 1.21-31.41) were associated with an increased VTE risk. Ninety-two percent of cases were treated with anticoagulants. There was no difference in bleeding incidence between cases and controls. Conclusions: Several known VTE risk factors also seems to apply in patients with thrombocytopenia. Also, patients with thrombocytopenia may be VTE risk stratified based on platelet count and comorbidities. Finally, patients having thrombocytopenia with VTE seem to be safely treated with anticoagulants without increased occurrence of bleeding.

Download Full-text

NIMG-22. PREDICTION OF GLIOBLASTOMA CELLULAR INFILTRATION AND RECURRENCE USING MACHINE LEARNING AND MULTI-PARAMETRIC MRI ANALYSIS: RESULTS FROM THE MULTI-INSTITUTIONAL RESPOND CONSORTIUM

Neuro-Oncology ◽

10.1093/neuonc/noab196.522 ◽

2021 ◽

Vol 23 (Supplement_6) ◽

pp. vi132-vi133

Author(s):

Hamed Akbari ◽

Suyash Mohan ◽

Jose A Garcia ◽

Anahita Fathi Kazerooni ◽

Chiharu Sako ◽

...

Keyword(s):

Odds Ratio ◽

Tumor Recurrence ◽

De Novo ◽

Predictive Biomarkers ◽

Neoplastic Cell ◽

University Hospital ◽

Cell Infiltration ◽

New York University ◽

Single Institution ◽

Peritumoral Tissue

Abstract PURPOSE Multi-parametric MRI and artificial intelligence (AI) methods were previously used to predict peritumoral neoplastic cell infiltration and risk of future recurrence in glioblastoma, in single-institution studies. We hypothesize that important characteristics of peritumoral tissue heterogeneity captured, engineered/selected, and quantified by these methods relate to predictions generalizable in the multi-institutional ReSPOND (Radiomics Signatures for PrecisiON Diagnostics) consortium. METHODS To support further development, generalization, and clinical translation of our proposed method, we trained the AI model on a retrospective cohort of 29 de novo glioblastoma patients from the Hospital of the University of Pennsylvania (UPenn) (Male/Female:20/9, age:22-78 years) followed by evaluation on a prospective multi-institutional cohort of 84 glioblastoma patients (Male/Female:51/33, age:34-89 years) from Case Western Reserve University/University Hospitals (CWRU/UH, 25), New York University (NYU, 13), Ohio State University (OSU, 13), University Hospital Río Hortega (RH, 2), and UPenn (31). Features extracted from pre-resection MRI (T1, T1-Gd, T2, T2-FLAIR, ADC) were used to build our model predicting the spatial pattern of subsequent tumor recurrence. These predictions were evaluated against regions of pathology-confirmed post-resection recurrence. RESULTS Our model predicted the locations that later harbored tumor recurrence with sensitivity 83%, AUC 0.83 (99% CI, 0.73-0.93), and odds ratio 7.23 (99% CI, 7.09-7.37) in the prospective cohort. Odds ratio (99% CI)/AUC(99% CI) per institute were: CWRU/UH, 7.8(7.6-8.1)/0.82(0.75-0.89); NYU, 3.5(3.3-3.6)/0.84(074-0.93); OSU, 7.9(7.6-8.3)/0.8(0.67-0.94); RH, 22.7(20-25.1)/0.94(0.27-1); UPenn, 7.1(6.8-7.3)/0.83(0.76-0.91). CONCLUSION This is the first study that provides relatively extensive multi-institutional validated evidence that AI can provide good predictions of peritumoral neoplastic cell infiltration and future recurrence, by dissecting the MRI signal heterogeneity in peritumoral tissue. Our analyses leveraged the unique dataset of the ReSPOND consortium, which aims to develop and evaluate AI-based biomarkers for individualized prediction and prognostication, by moving from single-institution studies to generalizable, well-validated multi-institutional predictive biomarkers.

Download Full-text