Application of three statistical models for predicting the risk of diabetes

Abstract Background At present, the proportion of undiagnosed diabetes in Chinese adults is as high as 15.5%. People with diabetes who are not treated and controlled in time may have various complications, such as cardiovascular and cerebrovascular diseases and diabetic foot disorders, which not only seriously affect the quality of life of people with diabetes but also impose a heavy burden on families and society. Therefore, prevention and control of type 2 diabetes is of great significance. Methods We constructed a logistic regression model, a neural network model and a decision tree model to analyse the risk factors for type 2 diabetes and then compared the prediction accuracy of the different models by calculating the area under the relative operating characteristic (ROC) curve and back-inputting the data into the model. Results The prevalence of type 2 diabetes in 4177 subjects who were not diagnosed with type 2 diabetes was 9.31%. The most influential factors associated with type 2 diabetes were triglyceride (TG) ≥ 1.17 mmol/L (odds ratio (OR) =2.233), age ≥ 70 years (OR = 1.734), hypertension (OR = 1.703), alcohol consumption (OR = 1.674), and total cholesterol≥5.2 mmol/L (TC) (OR = 1.463). The prediction accuracies of the three prediction models were 90.8, 91.2, and 90.7%, respectively, and the areas under curve (AUCs) were 0.711, 0.780, and 0.698, respectively. The differences in the AUCs after back propagation (BP) of the neural network model, logistic regression model and decision tree model were statistically significant (P < 0.05). Conclusion BP neural networks have a higher predictive power for identifying the associated risk factors of type 2 diabetes than the other two models, but it is necessary to select a suitable model for specific situations.

Download Full-text

Screening Risk Factors and Interaction Analysis of Hypertension in Overweight and Obesity Population based on Three Statistical Models

10.21203/rs.3.rs-390569/v1 ◽

2021 ◽

Author(s):

Li Lu Wei ◽

Yu jian

Keyword(s):

Neural Network ◽

Risk Factors ◽

Logistic Regression ◽

Regression Model ◽

Bp Neural Network ◽

Logistic Regression Model ◽

Classification Tree ◽

Overweight And Obesity ◽

Tree Model ◽

Classification Tree Model

Abstract Background Hypertension is a common chronic disease in the world, and it is also a common basic disease of cardiovascular and brain complications. Overweight and obesity are the high risk factors of hypertension. In this study, three statistical methods, classification tree model, logistic regression model and BP neural network, were used to screen the risk factors of hypertension in overweight and obese population, and the interaction of risk factors was conducted Analysis, for the early detection of hypertension, early diagnosis and treatment, reduce the risk of hypertension complications, have a certain clinical significance.Methods The classification tree model, logistic regression model and BP neural network model were used to screen the risk factors of hypertension in overweight and obese people.The specificity, sensitivity and accuracy of the three models were evaluated by receiver operating characteristic curve (ROC). Finally, the classification tree CRT model was used to screen the related risk factors of overweight and obesity hypertension, and the non conditional logistic regression multiplication model was used to quantitatively analyze the interaction.Results The Youden index of ROC curve of classification tree model, logistic regression model and BP neural network model were 39.20%,37.02% ,34.85%, the sensitivity was 61.63%, 76.59%, 82.85%, the specificity was 77.58%, 60.44%, 52.00%, and the area under curve (AUC) was 0.721, 0.734,0.733, respectively. There was no significant difference in AUC between the three models (P>0.05). Classification tree CRT model and logistic regression multiplication model suggested that the interaction between NAFLD and FPG was closely related to the prevalence of overweight and obese hypertension.Conclusion NAFLD,FPG,age,TG,UA, LDL-C were the risk factors of hypertension in overweight and obese people. The interaction between NAFLD and FPG increased the risk of hypertension.

Download Full-text

Associated Factors with the Mortality Rate in Patients with COVID-19 - Decision Trees Vs. Logistic Regression

Journal of Evolution of Medical and Dental Sciences ◽

10.14260/jemds/2021/756 ◽

2021 ◽

Vol 10 (44) ◽

pp. 3736-3741

Author(s):

Soraya Siabani ◽

Leila Solouki ◽

Mehdi Moradinazar ◽

Farid Najafi ◽

Ebrahim Shakiba

Keyword(s):

Risk Factors ◽

Cardiovascular Disease ◽

Logistic Regression ◽

Body Temperature ◽

Decision Tree ◽

Regression Model ◽

Roc Curve ◽

Logistic Regression Model ◽

Decision Tree Model ◽

Tree Model

BACKGROUND Given the global burden of COVID-19 mortality, this study intended to determine the factors affecting mortality in patients with COVID-19 using decision tree analysis and logistic regression model in Kermanshah province, 2020. METHODS This cross-sectional study was conducted on 7799 patients with COVID-19 admitted to the hospitals of Kermanshah province. Data gathered from February 18 to July 9, 2020, were obtained from the vice-chancellor for the health of Kermanshah University of Medical Sciences. The performance of the models was compared according to the sensitivity, specificity, and area under the receiver operating characteristic (ROC) curve. RESULTS According to the decision tree model, the most important risk factors for death due to COVID-19 were age, body temperature, admission to intensive care unit (ICU), prior hospital visit within the last 14 days, and cardiovascular disease. Also, the multivariate logistic regression model showed that the variables of age [OR = 4.47, 95 % CI: (3.16 -6.32)], shortness of breath [OR = 1.42, 95 % CI: (1.0-2.01)], ICU admission [OR = 3.75, 95 % CI: (2.47-5.68)], abnormal chest X-ray [OR = 1.93, 95 % CI: (1.06-3.41)], liver disease [OR = 5.05, 95 % CI (1.020-25.2)], body temperature [OR = 4.93, 95 % CI: (2.17-6.25)], and cardiovascular disease [OR = 2.15, 95 % CI: (1.27-3.06)] were significantly associated with the higher mortality of patients with COVID-19. The area under the ROC curve for the decision tree model and logistic regression was 0.77 and 0.75, respectively. CONCLUSIONS Identifying risk factors for mortality in patients with COVID-19 can provide more effective interventions in the early stages of treatment and improve the medical approaches provided by the medical staff. KEY WORDS COVID-19, Decision Tree, Logistic Regression, Mortality, Risk Factor

Download Full-text

Application and comparison of logistic regression model and neural network model in earthquake-induced landslides susceptibility mapping at mountainous region, China

Geomatics Natural Hazards and Risk ◽

10.1080/19475705.2018.1451399 ◽

2018 ◽

Vol 9 (1) ◽

pp. 501-523 ◽

Cited By ~ 4

Author(s):

Peng Xie ◽

Haijia Wen ◽

Chaochao Ma ◽

Laurie G. Baise ◽

Jialan Zhang

Keyword(s):

Neural Network ◽

Logistic Regression ◽

Regression Model ◽

Network Model ◽

Neural Network Model ◽

Logistic Regression Model ◽

Susceptibility Mapping ◽

Mountainous Region ◽

Landslides Susceptibility

Download Full-text

Risk of Hospitalisation among Patients with Type 2 Diabetes and its Determinants: A Logistic Regression Model

GEDRAG & ORGANISATIE REVIEW ◽

10.37896/gor33.02/546 ◽

2020 ◽

Vol 33 (02) ◽

Author(s):

Subha P P ◽

◽

Roy Scaria ◽

Keyword(s):

Type 2 Diabetes ◽

Logistic Regression ◽

Regression Model ◽

Logistic Regression Model

Download Full-text

A Comparison of MICU Survival Prediction Using the Logistic Regression Model and Artificial Neural Network Model

Journal of Nursing Research ◽

10.1097/01.jnr.0000387590.19963.8e ◽

2006 ◽

Vol 14 (4) ◽

pp. 306-314 ◽

Cited By ~ 3

Author(s):

Shu-Ping Lin ◽

Chi-Hsueh Lee ◽

Yang-Shu Lu ◽

Ling-Nu Hsu

Keyword(s):

Neural Network ◽

Artificial Neural Network ◽

Logistic Regression ◽

Regression Model ◽

Network Model ◽

Neural Network Model ◽

Artificial Neural Network Model ◽

Logistic Regression Model ◽

Survival Prediction ◽

Artificial Neural

Download Full-text

Artificial Neural Network Model for Predicting 5-year Mortality after Surgery for Hepatocellular Carcinoma and Performance Comparison with Logistic Regression Model: A Nationwide Taiwan Database Study

2012 Third International Conference on Innovations in Bio-Inspired Computing and Applications ◽

10.1109/ibica.2012.28 ◽

2012 ◽

Author(s):

Wan-Ting Hung ◽

King-Teh Lee ◽

Shih-Chin Wang ◽

Wen-Hsien Ho ◽

Su-Ching Chang ◽

...

Keyword(s):

Neural Network ◽

Hepatocellular Carcinoma ◽

Artificial Neural Network ◽

Logistic Regression ◽

Regression Model ◽

Neural Network Model ◽

Artificial Neural Network Model ◽

Logistic Regression Model ◽

Performance Comparison ◽

And Performance

Download Full-text

Predicting Korean lodging firm failures: An artificial neural network model along with a logistic regression model

International Journal of Hospitality Management ◽

10.1016/j.ijhm.2009.06.007 ◽

2010 ◽

Vol 29 (1) ◽

pp. 120-127 ◽

Cited By ~ 29

Author(s):

Hyewon Youn ◽

Zheng Gu

Keyword(s):

Neural Network ◽

Artificial Neural Network ◽

Logistic Regression ◽

Regression Model ◽

Network Model ◽

Neural Network Model ◽

Artificial Neural Network Model ◽

Logistic Regression Model ◽

Artificial Neural

Download Full-text

Comparison of the Levels of Accuracy of an Artificial Neural Network Model and a Logistic Regression Model for the Diagnosis of Acute Appendicitis

Journal of Medical Systems ◽

10.1007/s10916-007-9077-9 ◽

2007 ◽

Vol 31 (5) ◽

pp. 357-364 ◽

Cited By ~ 17

Author(s):

Shinya Sakai ◽

Kuriko Kobayashi ◽

Shin-ichi Toyabe ◽

Nozomu Mandai ◽

Tatsuo Kanda ◽

...

Keyword(s):

Neural Network ◽

Artificial Neural Network ◽

Logistic Regression ◽

Acute Appendicitis ◽

Regression Model ◽

Network Model ◽

Neural Network Model ◽

Artificial Neural Network Model ◽

Logistic Regression Model ◽

Artificial Neural

Download Full-text

Risk of Hospitalisation among Patients with Type 2 Diabetes and its Determinants: A Logistic Regression Model

GEDRAG & ORGANISATIE REVIEW ◽

10.37896/gor33.02/046 ◽

2020 ◽

Vol 33 (02) ◽

Author(s):

Subha P P ◽

◽

Roy Scaria ◽

Keyword(s):

Type 2 Diabetes ◽

Logistic Regression ◽

Regression Model ◽

Logistic Regression Model

Download Full-text

Application of monoexponential, biexponential, and stretched-exponential models of diffusion-weighted magnetic resonance imaging in the differential diagnosis of metastases and myeloma in the spine-Univariate and multivariate analysis of related parameters

British Journal of Radiology ◽

10.1259/bjr.20190891 ◽

2020 ◽

Vol 93 (1112) ◽

pp. 20190891

Author(s):

Xiaoying Xing ◽

Jiahui Zhang ◽

Yongye Chen ◽

Qiang Zhao ◽

Ning Lang ◽

...

Keyword(s):

Logistic Regression ◽

Differential Diagnosis ◽

Diffusion Coefficient ◽

Decision Tree ◽

Regression Model ◽

Logistic Regression Model ◽

Decision Tree Model ◽

Tree Model ◽

Stretched Exponential ◽

Exponential Models

Objective: To explore the value of related parameters in monoexponential, biexponential, and stretched-exponential models of diffusion-weighted imaging (DWI) in differentiating metastases and myeloma in the spine. Methods: 53 metastases and 16 myeloma patients underwent MRI with 10 b-values (0–1500 s/mm2). Parameters of apparent diffusion coefficient (ADC), true diffusion coefficient (D), pseudo-diffusion coefficient (D*), perfusion fraction (f), the distribution diffusion coefficient (DDC), and intravoxel water diffusion heterogeneity (α) from DWI were calculated. The independent sample t test and the Mann–Whiney U test were used to compare the statistical difference of the parameter values between the two. Receiver operating characteristics (ROC) curve analysis was used to identify the diagnostic efficacy. Then substituted each parameter into the decision tree model and logistic regression model, identified meaningful parameters, and evaluated their joint diagnostic performance. Results: The ADC, D, and α values of metastases were higher than those of myeloma, whereas the D* value was lower than that of myeloma, and the difference was significant (p < 0.05); the area under the ROC curve for the above parameters was 0.661, 0.710, 0.781, and 0.743, respectively. There was no significant difference in the f and DDC values (p > 0.05). D and α were found to conform to the decision tree model, and the accuracy of model diagnosis was 84.1%. ADC and α were found to conform to the logistic regression model, and the accuracy was 87.0%. Conclusion: The 3 models of DWI have certain values indifferentiating metastases and myeloma in spine, and the diagnostic performance of ADC, D, α and D*was better. Combining ADC with α may markedly aid in the differential diagnosis of the two. Advances in knowledge: Monoexponential, biexponential, and stretched-exponential models can offer additional information in the differential diagnosis of metastases and myeloma in the spine. Decision tree model and logistic regression model are effective methods to help further distinguish the two.

Download Full-text