scholarly journals Application of three statistical models for predicting the risk of diabetes

2019 ◽  
Vol 19 (1) ◽  
Author(s):  
Siyu Liu ◽  
Yue Gao ◽  
Yuhang Shen ◽  
Min Zhang ◽  
Jingjing Li ◽  
...  

Abstract Background At present, the proportion of undiagnosed diabetes in Chinese adults is as high as 15.5%. People with diabetes who are not treated and controlled in time may have various complications, such as cardiovascular and cerebrovascular diseases and diabetic foot disorders, which not only seriously affect the quality of life of people with diabetes but also impose a heavy burden on families and society. Therefore, prevention and control of type 2 diabetes is of great significance. Methods We constructed a logistic regression model, a neural network model and a decision tree model to analyse the risk factors for type 2 diabetes and then compared the prediction accuracy of the different models by calculating the area under the relative operating characteristic (ROC) curve and back-inputting the data into the model. Results The prevalence of type 2 diabetes in 4177 subjects who were not diagnosed with type 2 diabetes was 9.31%. The most influential factors associated with type 2 diabetes were triglyceride (TG) ≥ 1.17 mmol/L (odds ratio (OR) =2.233), age ≥ 70 years (OR = 1.734), hypertension (OR = 1.703), alcohol consumption (OR = 1.674), and total cholesterol≥5.2 mmol/L (TC) (OR = 1.463). The prediction accuracies of the three prediction models were 90.8, 91.2, and 90.7%, respectively, and the areas under curve (AUCs) were 0.711, 0.780, and 0.698, respectively. The differences in the AUCs after back propagation (BP) of the neural network model, logistic regression model and decision tree model were statistically significant (P < 0.05). Conclusion BP neural networks have a higher predictive power for identifying the associated risk factors of type 2 diabetes than the other two models, but it is necessary to select a suitable model for specific situations.

2021 ◽  
Author(s):  
Li Lu Wei ◽  
Yu jian

Abstract Background Hypertension is a common chronic disease in the world, and it is also a common basic disease of cardiovascular and brain complications. Overweight and obesity are the high risk factors of hypertension. In this study, three statistical methods, classification tree model, logistic regression model and BP neural network, were used to screen the risk factors of hypertension in overweight and obese population, and the interaction of risk factors was conducted Analysis, for the early detection of hypertension, early diagnosis and treatment, reduce the risk of hypertension complications, have a certain clinical significance.Methods The classification tree model, logistic regression model and BP neural network model were used to screen the risk factors of hypertension in overweight and obese people.The specificity, sensitivity and accuracy of the three models were evaluated by receiver operating characteristic curve (ROC). Finally, the classification tree CRT model was used to screen the related risk factors of overweight and obesity hypertension, and the non conditional logistic regression multiplication model was used to quantitatively analyze the interaction.Results The Youden index of ROC curve of classification tree model, logistic regression model and BP neural network model were 39.20%,37.02% ,34.85%, the sensitivity was 61.63%, 76.59%, 82.85%, the specificity was 77.58%, 60.44%, 52.00%, and the area under curve (AUC) was 0.721, 0.734,0.733, respectively. There was no significant difference in AUC between the three models (P>0.05). Classification tree CRT model and logistic regression multiplication model suggested that the interaction between NAFLD and FPG was closely related to the prevalence of overweight and obese hypertension.Conclusion NAFLD,FPG,age,TG,UA, LDL-C were the risk factors of hypertension in overweight and obese people. The interaction between NAFLD and FPG increased the risk of hypertension.


2021 ◽  
Vol 10 (44) ◽  
pp. 3736-3741
Author(s):  
Soraya Siabani ◽  
Leila Solouki ◽  
Mehdi Moradinazar ◽  
Farid Najafi ◽  
Ebrahim Shakiba

BACKGROUND Given the global burden of COVID-19 mortality, this study intended to determine the factors affecting mortality in patients with COVID-19 using decision tree analysis and logistic regression model in Kermanshah province, 2020. METHODS This cross-sectional study was conducted on 7799 patients with COVID-19 admitted to the hospitals of Kermanshah province. Data gathered from February 18 to July 9, 2020, were obtained from the vice-chancellor for the health of Kermanshah University of Medical Sciences. The performance of the models was compared according to the sensitivity, specificity, and area under the receiver operating characteristic (ROC) curve. RESULTS According to the decision tree model, the most important risk factors for death due to COVID-19 were age, body temperature, admission to intensive care unit (ICU), prior hospital visit within the last 14 days, and cardiovascular disease. Also, the multivariate logistic regression model showed that the variables of age [OR = 4.47, 95 % CI: (3.16 -6.32)], shortness of breath [OR = 1.42, 95 % CI: (1.0-2.01)], ICU admission [OR = 3.75, 95 % CI: (2.47-5.68)], abnormal chest X-ray [OR = 1.93, 95 % CI: (1.06-3.41)], liver disease [OR = 5.05, 95 % CI (1.020-25.2)], body temperature [OR = 4.93, 95 % CI: (2.17-6.25)], and cardiovascular disease [OR = 2.15, 95 % CI: (1.27-3.06)] were significantly associated with the higher mortality of patients with COVID-19. The area under the ROC curve for the decision tree model and logistic regression was 0.77 and 0.75, respectively. CONCLUSIONS Identifying risk factors for mortality in patients with COVID-19 can provide more effective interventions in the early stages of treatment and improve the medical approaches provided by the medical staff. KEY WORDS COVID-19, Decision Tree, Logistic Regression, Mortality, Risk Factor


2020 ◽  
Vol 93 (1112) ◽  
pp. 20190891
Author(s):  
Xiaoying Xing ◽  
Jiahui Zhang ◽  
Yongye Chen ◽  
Qiang Zhao ◽  
Ning Lang ◽  
...  

Objective: To explore the value of related parameters in monoexponential, biexponential, and stretched-exponential models of diffusion-weighted imaging (DWI) in differentiating metastases and myeloma in the spine. Methods: 53 metastases and 16 myeloma patients underwent MRI with 10 b-values (0–1500 s/mm2). Parameters of apparent diffusion coefficient (ADC), true diffusion coefficient (D), pseudo-diffusion coefficient (D*), perfusion fraction (f), the distribution diffusion coefficient (DDC), and intravoxel water diffusion heterogeneity (α) from DWI were calculated. The independent sample t test and the Mann–Whiney U test were used to compare the statistical difference of the parameter values between the two. Receiver operating characteristics (ROC) curve analysis was used to identify the diagnostic efficacy. Then substituted each parameter into the decision tree model and logistic regression model, identified meaningful parameters, and evaluated their joint diagnostic performance. Results: The ADC, D, and α values of metastases were higher than those of myeloma, whereas the D* value was lower than that of myeloma, and the difference was significant (p < 0.05); the area under the ROC curve for the above parameters was 0.661, 0.710, 0.781, and 0.743, respectively. There was no significant difference in the f and DDC values (p > 0.05). D and α were found to conform to the decision tree model, and the accuracy of model diagnosis was 84.1%. ADC and α were found to conform to the logistic regression model, and the accuracy was 87.0%. Conclusion: The 3 models of DWI have certain values indifferentiating metastases and myeloma in spine, and the diagnostic performance of ADC, D, α and D*was better. Combining ADC with α may markedly aid in the differential diagnosis of the two. Advances in knowledge: Monoexponential, biexponential, and stretched-exponential models can offer additional information in the differential diagnosis of metastases and myeloma in the spine. Decision tree model and logistic regression model are effective methods to help further distinguish the two.


Sign in / Sign up

Export Citation Format

Share Document