Diabetes Prediction using Logistic Regression and Feature Normalization

Author(s):  
V Ganesh ◽  
Johnson Kolluri ◽  
K. Vinay Kumar

in an event when there is lots of risk factor then the logistic regression is used for predicting the probability. For binary and ordinal data the medical researcher increase the use of logistic analysis. Several classification problems like spam detection used logistic regression. If a customer purchases a specific product in Diabetes prediction or they will inspire with any other competitor, whether customer click on given advertisement link or not are some example. For two class classification the Logistic Regression is one of the most simple and common machine Learning algorithms. For any binary classification problem it is very easy to use as a basic approach. Deep learning is also its fundamental concept. The relationship measurement and description between dependent binary variable and independent variables can be done by logistic regression.


2020 ◽  
Vol 8 (6) ◽  
pp. 3034-3039

Nowadays, a lot of research is going on in healthcare. One of the significant diseases increased all over the world is Diabetes Mellitus (DM). In this paper, the literature review is done on diabetes prediction using Machine Learning and Deep Learning techniques. Various ML algorithms are used using PIDD (Pima Indian diabetes dataset), and improved k- means using logistic regression among all algorithms achieved the highest accuracy. DL algorithms like CNN and LMST used in diabetic retinopathy images.


The state or disorder where the body cannot effectively use the insulin is called Diabetes. If the insulin levels are not maintained properly, the diabetes is one such disorder where it damages all other body parts. It is estimated that the diabetes is the 7th leading cause of deaths as per World Health Organisation report. Early recognition of diabetes, decreases the risk of serious ailments, which includes, heart diseases, brain stroke, eye related diseases, kidney diseases, nerve related diseases etc. In the present work, pima indians diabetes data set is considered as the best dataset and different models viz., hierarchical clustering with decision tree, hierarchical clustering with support vector machines, hierarchical clustering with logistic regression and k means with logistic regression are developed and implemented for identifying and predicting the diabetes. The accuracies of these prediction models range between 0.90 and 0.946. An Improved Diabetes Prediction Algorithm (IDPA) combining the hierarchical clustering algorithm and Naïve Bayes classification algorithm is developed to identify and predict the Type-II diabetes and has shown an accuracy of 0.96. In this IDPA, firstly, the grouping of data into two groups i.e. diabetes and non-diabetes is done by applying the hierarchical clustering algorithm. Then, the filtering is done by comparing the group value to the class value followed by applying Naïve Bayes classification algorithm for predicting diabetes. The results show that the proposed novel method i.e. IDPA can predict the diabetes with higher accuracy levels (0.96) than the traditional/existing methods and other methods which were implemented. This model can be used to predict diabetes early, thereby reducing the serious complications of diabetes.


Author(s):  
Li-Ying Lang ◽  
Zheng Gao ◽  
Xue-Guang Wang ◽  
Hui Zhao ◽  
Yan-Ping Zhang ◽  
...  

Diabetes is a disease that seriously endangers human health. Early detection and early treatment can reduce the likelihood of complications and mortality. The predictive model can effectively solve the above problems and provide helpful information for the clinic. Based on this, it is proposed to apply the idea of integrated algorithm in DBN algorithm, collect the hospital data by investigating its related factors, clean and process the collected data, and sample and model the processed data multiple times. It is shown that a single DBN classifier is better than support vector machine and logistic regression algorithm. The model established by the integrated deep confidence network has a significant improvement in classification accuracy compared to a single DBN classifier, and solves the unstable classification effect of a single DBN classifier.


2007 ◽  
Vol 23 (3) ◽  
pp. 157-165 ◽  
Author(s):  
Carmen Hagemeister

Abstract. When concentration tests are completed repeatedly, reaction time and error rate decrease considerably, but the underlying ability does not improve. In order to overcome this validity problem this study aimed to test if the practice effect between tests and within tests can be useful in determining whether persons have already completed this test. The power law of practice postulates that practice effects are greater in unpracticed than in practiced persons. Two experiments were carried out in which the participants completed the same tests at the beginning and at the end of two test sessions set about 3 days apart. In both experiments, the logistic regression could indeed classify persons according to previous practice through the practice effect between the tests at the beginning and at the end of the session, and, less well but still significantly, through the practice effect within the first test of the session. Further analyses showed that the practice effects correlated more highly with the initial performance than was to be expected for mathematical reasons; typically persons with long reaction times have larger practice effects. Thus, small practice effects alone do not allow one to conclude that a person has worked on the test before.


2012 ◽  
Vol 2 (2) ◽  
pp. 72-81
Author(s):  
Christina M. Rudin-Brown ◽  
Eve Mitsopoulos-Rubens ◽  
Michael G. Lenné

Random testing for alcohol and other drugs (AODs) in individuals who perform safety-sensitive activities as part of their aviation role was introduced in Australia in April 2009. One year later, an online survey (N = 2,226) was conducted to investigate attitudes, behaviors, and knowledge regarding random testing and to gauge perceptions regarding its effectiveness. Private, recreational, and student pilots were less likely than industry personnel to report being aware of the requirement (86.5% versus 97.1%), to have undergone testing (76.5% versus 96.1%), and to know of others who had undergone testing (39.9% versus 84.3%), and they had more positive attitudes toward random testing than industry personnel. However, logistic regression analyses indicated that random testing is more effective at deterring AOD use among industry personnel.


2001 ◽  
Vol 6 (1) ◽  
pp. 35-48 ◽  
Author(s):  
Michaela Kiernan ◽  
Helena C. Kraemer ◽  
Marilyn A. Winkleby ◽  
Abby C. King ◽  
C. Barr Taylor

Sign in / Sign up

Export Citation Format

Share Document