scholarly journals Chronic Kidney Disease Diagnosis using Decision Tree Algorithms

2020 ◽  
Author(s):  
Hamida Ilyas ◽  
Sajid Ali ◽  
Mahvish Ponum ◽  
Osman Hasan ◽  
Muhammad Tahir Mahmood

Abstract Chronic Kidney Disease (CKD), i.e., gradual decrease in the renal function spanning over a duration of several months to years without any major symptoms, is a life-threatening disease. It progresses in six stages according to the severity level. It is categorized into various stages based on the Glomerular Filtration Rate (GFR), which in turn utilizes several attributes, like age, sex, race and Serum Creatinine. Among multiple available models for estimating GFR value, Chronic Kidney Disease Epidemiology Collaboration (CKD-EPI), which is a linear model, has been found to be quite efficient because it allows detecting all CKD stages i.e., early stage to the last stage of kidney failure. Early detection and cure of CKD is extremely desirable as it can lead to the prevention of unwanted consequences. Machine learning are being extensively advocated for early detection of symptoms and diagnosis of several diseases recently. With the same motivation, the aim of this study is to predict the various stages of CKD using machine learning classification algorithms on the dataset obtained from the medical records of affected people. In particular, we have used the Random Forest and J48 algorithms to obtain a sustainable and practicable model to detect various stages of CKD with comprehensive medical accuracy. Comparative analysis of the results revealed that J48 predicted CKD in all stages better than random forest with a 85.5% accuracy. The study also showed that J48 shows improved performance over Random Forest, so, it may be used to build an automated system for the detection of severity of CKD.

2021 ◽  
Vol 22 (1) ◽  
Author(s):  
Hamida Ilyas ◽  
Sajid Ali ◽  
Mahvish Ponum ◽  
Osman Hasan ◽  
Muhammad Tahir Mahmood ◽  
...  

Abstract Background Chronic Kidney Disease (CKD), i.e., gradual decrease in the renal function spanning over a duration of several months to years without any major symptoms, is a life-threatening disease. It progresses in six stages according to the severity level. It is categorized into various stages based on the Glomerular Filtration Rate (GFR), which in turn utilizes several attributes, like age, sex, race and Serum Creatinine. Among multiple available models for estimating GFR value, Chronic Kidney Disease Epidemiology Collaboration (CKD-EPI), which is a linear model, has been found to be quite efficient because it allows detecting all CKD stages. Methods Early detection and cure of CKD is extremely desirable as it can lead to the prevention of unwanted consequences. Machine learning methods are being extensively advocated for early detection of symptoms and diagnosis of several diseases recently. With the same motivation, the aim of this study is to predict the various stages of CKD using machine learning classification algorithms on the dataset obtained from the medical records of affected people. Specifically, we have used the Random Forest and J48 algorithms to obtain a sustainable and practicable model to detect various stages of CKD with comprehensive medical accuracy. Results Comparative analysis of the results revealed that J48 predicted CKD in all stages better than random forest with an accuracy of 85.5%. The study also showed that J48 shows improved performance over Random Forest. Conclusions The study concluded that it may be used to build an automated system for the detection of severity of CKD.


Author(s):  
Sandhya N. dhage, Dr. Vijay Kumar Garg

Qualitative and quantitative agricultural production leads to economic benefits which can be achieved by periodic monitoring of crop, detection and prevention of crop diseases and insects. Quality of crop production is reduced by pest infection and crop diseases. Existing measures involves manual detection of cotton diseases by farmers and experts which requires  regular monitoring and detection manifest at middle to later stage of infection which causes many disadvantages such as becoming  too late for diseases to be cured.  Lack of early detection of diseases causes the diseases to be spread in nearby crops in the field and also spraying of pesticides is done on entire field for minimizing the infection of disease. The main goal of proposed research topic is to find the solution to the agriculture problem which involves detecting disease in cotton plant at early stage and classify the disease based on symptoms. Early detection of disease at an early stage prevent it from spreading to another area and preventive measures can be taken by farmers by spraying pesticides to control its growth which helps to increase the cotton yield production. Automatic identification of the different diseases affecting cotton crop will give many benefits to the farmers so that time, money will be saved and also gives healthy life to the crop. The contribution of this paper is to present the machine learning approach used for cotton crop disease diagnosis and classification.


Chronic Kidney Disease (CKD) is a worldwide concern that influences roughly 10% of the grown-up population on the world. For most of the people the early diagnosis of CKD is often not possible. Therefore, the utilization of present-day Computer aided supported strategies is important to help the conventional CKD finding framework to be progressively effective and precise. In this project, six modern machine learning techniques namely Multilayer Perceptron Neural Network, Support Vector Machine, Naïve Bayes, K-Nearest Neighbor, Decision Tree, Logistic regression were used and then to enhance the performance of the model Ensemble Algorithms such as ADABoost, Gradient Boosting, Random Forest, Majority Voting, Bagging and Weighted Average were used on the Chronic Kidney Disease dataset from the UCI Repository. The model was tuned finely to get the best hyper parameters to train the model. The performance metrics used to evaluate the model was measured using Accuracy, Precision, Recall, F1-score, Mathew`s Correlation Coefficient and ROC-AUC curve. The experiment was first performed on the individual classifiers and then on the Ensemble classifiers. The ensemble classifier like Random Forest and ADABoost performed better with 100% Accuracy, Precision and Recall when compared to the individual classifiers with 99.16% accuracy, 98.8% Precision and 100% Recall obtained from Decision Tree Algorithm


Author(s):  
Harsh Vardhan Singh

Chronic Kidney Disease (CKD) is a disease which doesn't shows symptoms at all or in some cases it doesn't show any disease specific symptoms it is hard to predict, detect and prevent such a disease and this could be lead to permanently health damage, but machine learning can be hope in this problem it is best in prediction and analysis. The objective of paper is to build the model for predicting the Chronic Kidney Disease using various machine learning classification algorithm. Classification is a powerful machine learning technique that is commonly used for prediction. Some of the classification algorithm are Logistic Regression, Support Vector Machine, Naïve Bayes, Random Forest Classifier, KNN. This paper investigate which algorithm is used for the improving the accuracy in the prediction of Chronic Kidney Disease. And, a comparative analysis on the accuracy and mean squared error is to done for predicting the best model.


Diagnostics ◽  
2022 ◽  
Vol 12 (1) ◽  
pp. 116
Author(s):  
Vijendra Singh ◽  
Vijayan K. Asari ◽  
Rajkumar Rajasekaran

Diabetes and high blood pressure are the primary causes of Chronic Kidney Disease (CKD). Glomerular Filtration Rate (GFR) and kidney damage markers are used by researchers around the world to identify CKD as a condition that leads to reduced renal function over time. A person with CKD has a higher chance of dying young. Doctors face a difficult task in diagnosing the different diseases linked to CKD at an early stage in order to prevent the disease. This research presents a novel deep learning model for the early detection and prediction of CKD. This research objectives to create a deep neural network and compare its performance to that of other contemporary machine learning techniques. In tests, the average of the associated features was used to replace all missing values in the database. After that, the neural network’s optimum parameters were fixed by establishing the parameters and running multiple trials. The foremost important features were selected by Recursive Feature Elimination (RFE). Hemoglobin, Specific Gravity, Serum Creatinine, Red Blood Cell Count, Albumin, Packed Cell Volume, and Hypertension were found as key features in the RFE. Selected features were passed to machine learning models for classification purposes. The proposed Deep neural model outperformed the other four classifiers (Support Vector Machine (SVM), K-Nearest Neighbor (KNN), Logistic regression, Random Forest, and Naive Bayes classifier) by achieving 100% accuracy. The proposed approach could be a useful tool for nephrologists in detecting CKD.


2021 ◽  
Vol 317 ◽  
pp. 05030
Author(s):  
Siti Noor Chotimah ◽  
Budi Warsito ◽  
Bayu Surarso

The number of factors that can be categorized into the diagnosis of Chronic Kidney Disease (CKD) at an early stage makes information about the diagnosis of the disease divided into information that has many influences and has little influence. This study aims to select diagnoses in medical records with the most influential information on chronic kidney disease. The first step is to select a diagnosis with much influence by implementing the Sequential Backward Feature Selection (SBFS). This algorithm eliminates features that are considered to have little influence when compared to other features. In the second step, the features of the best diagnoses are used as input to the Artificial Neural Network (ANN) classification algorithm. The results obtained from this study are information in the form of the best diagnoses that have much influence on chronic kidney disease and the accuracy results based on the selected diagnoses. Based on the study results, 15 features are considered the best of the 18 features used to achieve 88% accuracy results. Compared with conventional methods, this method still requires consideration from the medical staff because it is not a final diagnosis for patients.


Sign in / Sign up

Export Citation Format

Share Document