cvauroc: Command to compute cross-validated area under the curve for ROC analysis after predictive modeling for binary outcomes

Author(s):  
Miguel Angel Luque-Fernandez ◽  
Daniel Redondo-Sánchez ◽  
Camille Maringe

Receiver operating characteristic (ROC) analysis is used for comparing predictive models in both model selection and model evaluation. ROC analysis is often applied in clinical medicine and social science to assess the tradeoff between model sensitivity and specificity. After fitting a binary logistic or probit regression model with a set of independent variables, the predictive performance of this set of variables can be assessed by the area under the curve (AUC) from an ROC curve. An important aspect of predictive modeling (regardless of model type) is the ability of a model to generalize to new cases. Evaluating the predictive performance (AUC) of a set of independent variables using all cases from the original analysis sample often results in an overly optimistic estimate of predictive performance. One can use K-fold cross-validation to generate a more realistic estimate of predictive performance in situations with a small number of observations. AUC is estimated iteratively for k samples (the “test” samples) that are independent of the sample used to predict the dependent variable (the “training” sample). cvauroc implements k-fold cross-validation for the AUC for a binary outcome after fitting a logit or probit regression model, averaging the AUCs corresponding to each fold, and bootstrapping the cross-validated AUC to obtain statistical inference and 95% confidence intervals. Furthermore, cvauroc optionally provides the cross-validated fitted probabilities for the dependent variable or outcome, contained in a new variable named _fit; the sensitivity and specificity for each of the levels of the predicted outcome, contained in two new variables named _sen and _spe; and the plot of the mean cross-validated AUC and k-fold ROC curves.

2016 ◽  
Author(s):  
Geoffrey Fouad ◽  
André Skupin ◽  
Christina L. Tague

Abstract. Percentile flows are statistics derived from the flow duration curve (FDC) that describe the flow equaled or exceeded for a given percent of time. These statistics provide important information for managing rivers, but are often unavailable since most basins are ungauged. A common approach for predicting percentile flows is to deploy regional regression models based on gauged percentile flows and related independent variables derived from physical and climatic data. The first step of this process identifies groups of basins through a cluster analysis of the independent variables, followed by the development of a regression model for each group. This entire process hinges on the independent variables selected to summarize the physical and climatic state of basins. Distributed physical and climatic datasets now exist for the contiguous United States (US). However, it remains unclear how to best represent these data for the development of regional regression models. The study presented here developed regional regression models for the contiguous US, and evaluated the effect of different approaches for selecting the initial set of independent variables on the predictive performance of the regional regression models. An expert assessment of the dominant controls on the FDC was used to identify a small set of independent variables likely related to percentile flows. A data-driven approach was also applied to evaluate two larger sets of variables that consist of either (1) the averages of data for each basin or (2) both the averages and statistical distribution of basin data distributed in space and time. The small set of variables from the expert assessment of the FDC and two larger sets of variables for the data-driven approach were each applied for a regional regression procedure. Differences in predictive performance were evaluated using 184 validation basins withheld from regression model development. The small set of independent variables selected through expert assessment produced similar, if not better, performance than the two larger sets of variables. A parsimonious set of variables only consisted of mean annual precipitation, potential evapotranspiration, and baseflow index. Additional variables in the two larger sets of variables added little to no predictive information. Regional regression models based on the parsimonious set of variables were developed using 734 calibration basins, and were converted into a tool for predicting 13 percentile flows in the contiguous US. Supplementary Material for this paper includes an R graphical user interface for predicting the percentile flows of basins within the range of conditions used to calibrate the regression models. The equations and performance statistics of the models are also supplied in tabular form.


Blood ◽  
2019 ◽  
Vol 134 (Supplement_1) ◽  
pp. 4914-4914
Author(s):  
Ikhwan Rinaldi ◽  
Arif Mansjoer

Background There are many factors associated with early mortality after CABG, including postoperative thrombocytopenia (Kertai, 2016). Many factors during CABG surgery, such as administration of heparin or cardio pulmonary bypass during surgery are related to thrombocyte count reduction (Hamid, Akhtar, Naqvi, & Ahsan, 2017; Arepally, 2017). However, it is possible for a post-CABG patient to suffer a significant thrombocyte reduction without reaching the thrombocytopenic state (thrombocyte count <150000/µL). Up to this time, there is still lack of study about association between thrombocyte reduction after surgery and 30-day mortality in patients undergo CABG. This study aim to determine cut off point for postoperative thrombocyte reduction as a predictor of 30-day mortality after CABG surgery. Method This is a retrospective cohort study using medical record of 263 adult patients who underwent CABG surgery in dr. Ciptomangunkusumo National Hospital on 2012-2015. Thrombocyte reduction was determined by substracting preoperative thrombocyte count from postoperative thrombocyte count. Receiver operating curve (ROC) analysis between percentage of thrombocyte reduction and 30-day mortality after surgery was done to obtain the sensitivity and specificity value of a particular degree of thrombocyte reduction. Cut off point was obtained from intersection between sensitivity and specificity value. Result Thirty-day mortality rate after CABG surgery in this study was 11.9%. Cut off point obtained from ROC analysis was 30% with area under the curve (AUC) 0.671. The sensitivity of this cut off point to predict early mortality after CABG surgery was 64.5%, while the specificity was 64.7% Conclusion Thrombocyte reduction more than or equal to 30% can be used as a predictor of 30-day mortality after CABG surgery. Figure Disclosures No relevant conflicts of interest to declare.


2021 ◽  
Vol 11 ◽  
Author(s):  
Tiansong Xie ◽  
Xuanyi Wang ◽  
Zehua Zhang ◽  
Zhengrong Zhou

ObjectivesTo investigate the value of CT-based radiomics analysis in preoperatively discriminating pancreatic mucinous cystic neoplasms (MCN) and atypical serous cystadenomas (ASCN).MethodsA total of 103 MCN and 113 ASCN patients who underwent surgery were retrospectively enrolled. A total of 764 radiomics features were extracted from preoperative CT images. The optimal features were selected by Mann-Whitney U test and minimum redundancy and maximum relevance method. The radiomics score (Rad-score) was then built using random forest algorithm. Radiological/clinical features were also assessed for each patient. Multivariable logistic regression was used to construct a radiological model. The performance of the Rad-score and the radiological model was evaluated using 10-fold cross-validation for area under the curve (AUC), sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV) and accuracy.ResultsTen screened optimal features were identified and the Rad-score was then built based on them. The radiological model was built based on four radiological/clinical factors. In the 10-fold cross-validation, the Rad-score was proved to be robust and reliable (average AUC: 0.784, sensitivity: 0.847, specificity: 0.745, PPV: 0.767, NPV: 0.849, accuracy: 0.793). The radiological model performed slightly less well in classification (average AUC: average AUC: 0.734 sensitivity: 0.748, specificity: 0.705, PPV: 0.732, NPV: 0.798, accuracy: 0.728.ConclusionsThe CT-based radiomics analysis provided promising performance for preoperatively discriminating MCN from ASCN and showed good potential in improving diagnostic power, which may serve as a novel tool for guiding clinical decision-making for these patients.


2009 ◽  
Vol 20 (05) ◽  
pp. 311-314 ◽  
Author(s):  
Julie A. Honaker ◽  
Thomas E. Boismier ◽  
Nathan P. Shepard ◽  
Neil T. Shepard

Background: A vestibulospinal test known as the Fukuda stepping test (FST) has been suggested to be a measure of asymmetrical labyrinthine function. However, an extensive review of the performance of this test to identify a peripheral vestibular lesion has not been reported. Purpose: The purpose of this study was to evaluate the sensitivity and specificity of the standard FST and a head shaking variation for identification of a peripheral vestibular system lesion. Research Design: In this retrospective review, we compared performance on the FST with and without a head shaking component to electronystagmography (ENG) caloric irrigation unilateral weakness results. Study Sample: We studied these factors in 736 chronic dizzy patients. Results: Receiving operating characteristics (ROC) analysis and area under the curve (AUC) indicated no significant benefit to performance from the head shaking variation compared to the standard FST in identifying labyrinthine weakness as classified by caloric unilateral weakness results. Conclusions: These findings suggest that the FST with and without head shake component is not a reliable screening tool for peripheral vestibular asymmetry in chronic dizzy patients; however, future research may hold promise for the FST as a tool for patients with acute unilateral disorders.


Author(s):  
Zahra Al-Busaidi ◽  
Hemesiri Kotagama1 ◽  
Houcine Boughanmi ◽  
Sunil Dharmapala ◽  
John Waelti

Small and Medium Businesses (SMB) have substantial potential to contribute to the growth of an economy through adoption of business related Information Technology. The Agricultural and Food Business (AFB) sector in Oman is dominated by SMBs. Factors influencing the adoption of e-commerce in the AFB sector in Oman were analyzed. The main hypothesis was that scale of the business influenced the adoption of e-commerce. Data were obtained from a random sample (n = 31) of AFBs in Oman and was analyzed using a Probit regression model. The dependent variable was whether the firm had a website, a proxy measure of adoption of e-commerce. The main independent variables were the scale of the business (measured by number of employees) and the scope of the business (international or national) and of variables related to manager’s perception of benefits of adopting e-commerce. It was found that 94% of AFBs in Oman used computers and about 52% of those had websites. The R-squared, of the estimated Probit  regression model was 0.65. The hypothesis that scale of the business influenced the adoption of e-commerce was not rejected, as the coefficient of the business scale variable was statistically  significant (t = 2.5, n = 31). The likelihood of adoption increases with increased scale of the  business. On the converse this suggests that the likelihood of adoption of e-commerce by small and  medium businesses is lower. Given the importance of small and medium businesses in the development  of particularly the AFB sector of Oman, policy maker would have to promote and provide incentives  to adopt e-commerce by SMBs. 


PeerJ ◽  
2016 ◽  
Vol 4 ◽  
pp. e1979 ◽  
Author(s):  
Saw Simeon ◽  
Ola Spjuth ◽  
Maris Lapins ◽  
Sunanta Nabu ◽  
Nuttapat Anuwongcharoen ◽  
...  

Aromatase, the rate-limiting enzyme that catalyzes the conversion of androgen to estrogen, plays an essential role in the development of estrogen-dependent breast cancer. Side effects due to aromatase inhibitors (AIs) necessitate the pursuit of novel inhibitor candidates with high selectivity, lower toxicity and increased potency. Designing a novel therapeutic agent against aromatase could be achieved computationally by means of ligand-based and structure-based methods. For over a decade, we have utilized both approaches to design potential AIs for which quantitative structure–activity relationships and molecular docking were used to explore inhibitory mechanisms of AIs towards aromatase. However, such approaches do not consider the effects that aromatase variants have on different AIs. In this study, proteochemometrics modeling was applied to analyze the interaction space between AIs and aromatase variants as a function of their substructural and amino acid features. Good predictive performance was achieved, as rigorously verified by 10-fold cross-validation, external validation, leave-one-compound-out cross-validation, leave-one-protein-out cross-validation and Y-scrambling tests. The investigations presented herein provide important insights into the mechanisms of aromatase inhibitory activity that could aid in the design of novel potent AIs as breast cancer therapeutic agents.


1995 ◽  
Vol 25 (1) ◽  
pp. 165-170 ◽  
Author(s):  
A. M. Van Hemert ◽  
M. Den Heijer ◽  
M. Vorstenbosch ◽  
J. H. Bolk

SynopsisIn this study we assessed the accuracy of the General Health Questionnaire in detecting psychiatric disorders in general medical out-patients. A total of 290 newly referred patients were interviewed with the Present State Examination. Prior to the interview, 112 patients completed the full GHQ-60, 100 completed the GHQ-30 and 78 completed the GHQ-12. Data from the first group were used to study the full GHQ-60, together with the GHQ-30 and and GHQ-12, when disembedded from the full questionnaire. In a comparison between the disembedded and the separate versions of the GHQ-30 and GHQ-12 we observed considerable variation in the cut-off scores where a certain sensitivity and specificity was attained. In ROC-analysis, the versions were not materially different in their discriminatory capacity (area under the curve). The use of different criteria to define a ‘case’ demonstrated that case severity was another source of increasing cut-off scores. Our data demonstrate that the use of disembedded or separate versions of the questionnaire, together with variation in the case criteria can be a major explanation for variation in cut-off scores that was observed in previous studies.


2021 ◽  
Vol 9 ◽  
Author(s):  
Deliang Sun ◽  
Haijia Wen ◽  
Jiahui Xu ◽  
Yalan Zhang ◽  
Danzhou Wang ◽  
...  

This study aims to develop a logistic regression model of landslide susceptibility based on GeoDetector for dominant-factor screening and 10-fold cross validation for training sample optimization. First, Fengjie county, a typical mountainous area, was selected as the study area since it experienced 1,522 landslides from 2001 to 2016. Second, 22 factors were selected as the initial conditioning factors, and a geospatial database was established with a grid of 30 m precision. Factor detection of the geographic detector and the stepwise regression method included in logistic regression were used to screen out the dominant factors from the database. Then, based on the sample dataset with a 1:10 ratio of landslides and nonlandslides, 10-fold cross validation was used to select the optimized sample to train the logistic regression model of landslide susceptibility in the study area. Finally, the accuracy and efficiency of the two models before and after screening out the dominant factors were evaluated and compared. The results showed that the total accuracy of the two models was both more than 0.9, and the area under the curve value of the receiver operating characteristic curve was more than 0.8, indicating that the models before and after screening factor both had high reliability and good prediction ability. Besides, the screened factors had an active leading role in the geospatial distribution of the historical landslide, indicating that the screened dominant factors have individual rationality. Improving the geospatial agreement between landslide susceptibility and actual landslide-prone by the screening of dominant factors and the optimization of the training samples, a simple, efficient, and reliable logistic-regression–based landslide susceptibility model can be constructed.


Author(s):  
Luis Rolando Guarneros-Nolasco ◽  
Nancy Aracely Cruz-Ramos ◽  
Giner Alor-Hernández ◽  
Lisbeth Rodríguez-Mazahua ◽  
José Luis Sánchez-Cervantes

CVDs are a leading cause of death globally. In CVDs, the heart is unable to deliver enough blood to other body regions. Since effective and accurate diagnosis of CVDs is essential for CVD prevention and treatment, machine learning (ML) techniques can be effectively and reliably used to discern patients suffering from a CVD from those who do not suffer from any heart condition. Namely, machine learning algorithms (MLAs) play a key role in the diagnosis of CVDs through predictive models that allow us to identify the main risks factors influencing CVD development. In this study, we analyze the performance of ten MLAs on two datasets for CVD prediction and two for CVD diagnosis. Algorithm performance is analyzed on top-two and top-four dataset attributes/features with respect to five performance metrics &ndash;accuracy, precision, recall, f1-score, and roc-auc &ndash; using the train-test split technique and k-fold cross-validation. Our study identifies the top two and four attributes from each CVD diagnosis/prediction dataset. As our main findings, the ten MLAs exhibited appropriate diagnosis and predictive performance; hence, they can be successfully implemented for improving current CVD diagnosis efforts and help patients around the world, especially in regions where medical staff is lacking.


Sign in / Sign up

Export Citation Format

Share Document