scholarly journals Comparative analysis of classification models for diagnosis Type 2 Diabetes

Author(s):  
Daniah Almadni

Diabetes mellitus type 2 has become one of the major causes of premature diseases and death in many countries. It accounts for the majority of diabetes cases around the world. Thus, we need to develop a system that diagnoses type 2 diabetes. In this thesis, a fuzzy expert system is proposed using the Mamdani fuzzy inference system to diagnose type 2 diabetes effectively. In order to evaluate the performance of our system, a comparative study has been initiated, and will contrast the proposed system with data mining algorithms, namely J48 Decision tree, multilayer perceptron, support vector machine, and Naïve Bayes. The developed fuzzy expert system and the data mining algorithms are validated with real data from the UCI machine learning datasets. Moreover, the performance of the fuzzy expert system is evaluated by comparing it to related work that used the Mamdani inference system to diagnose the incidence of type 2 diabetes. Alternate title: Comparative analysis of data mining algorithms for diagnosis Type 2 Diabetes

2021 ◽  
Author(s):  
Daniah Almadni

Diabetes mellitus type 2 has become one of the major causes of premature diseases and death in many countries. It accounts for the majority of diabetes cases around the world. Thus, we need to develop a system that diagnoses type 2 diabetes. In this thesis, a fuzzy expert system is proposed using the Mamdani fuzzy inference system to diagnose type 2 diabetes effectively. In order to evaluate the performance of our system, a comparative study has been initiated, and will contrast the proposed system with data mining algorithms, namely J48 Decision tree, multilayer perceptron, support vector machine, and Naïve Bayes. The developed fuzzy expert system and the data mining algorithms are validated with real data from the UCI machine learning datasets. Moreover, the performance of the fuzzy expert system is evaluated by comparing it to related work that used the Mamdani inference system to diagnose the incidence of type 2 diabetes. Alternate title: Comparative analysis of data mining algorithms for diagnosis Type 2 Diabetes


2018 ◽  
Vol 22 (5) ◽  
pp. 303-311 ◽  
Author(s):  
Habibollah Esmaeily ◽  
Maryam Tayefi ◽  
Majid Ghayour-Mobarhan ◽  
Alireza Amirabadizadeh ◽  
◽  
...  

Author(s):  
Sinan Adnan Diwan Alalwan

<p><span>Diabetes is a fast spreading illness, which makes to worry millions of people around the globe. The people affected by type-2 diabetes are rapidly increasing and there are no effective diagnostic systems to control the diabetics. As per global health statistics, in western countries, population effected by type 2 diabetics are higher in rate and cost factor for treatment is increasing. There are no effective methods to eradicate the diabetes and it leads to carry out an investigative study on this disease. In existing reviews, researchers are using data analysis approaches to link the cause for diabetes with the patients based on the diet, life style, inheritance details, age factor, medical history, etc. to identify the root cause of the problem. By having multiple key factors and historical datasets, there are some data mining tools were developed, to generate new rules on the root cause of the disease and discover new knowledge from the past data’s, but the accuracy was not promising. The main objective of this paper is to carry out a detail literature review and design a conceptual data mining method at initial stage and implement it to improve the result accuracy compared to other classifiers. <br /> In this research, two data-mining algorithm were proposed at conceptual level: Self Organizing Map (SOM) and Random Forest Algorithm, which is applied on adult population datasets. The data set used for this research are from UCI machine Learning Repository: Diabetes Dataset. In this paper, <br /> data mining algorithms were discussed and implementation results were evaluated. Based on the result performance evaluation, Self-organizing maps have performed better compared to the Random Forest and other data mining algorithms such as naïve Bayes, decision tree, SVM and MLP for diagnosing the diabetes with better accuracy. In future, once system is implemented, <br /> it can be integrated with diabetic detector device for faster diagnosis of TYPE 2 diabetes disease.</span></p>


Author(s):  
Sushma Jaiswal ◽  
Tarun Jaiswal

Introduction: The expansion of an actual diabetes judgement structure by the fascinating improvement of computational intellect is observed as a chief objective currently. Numerous tactics based on the artificial network and machine-learning procedures have been established and verified alongside diabetes datasets, which remained typically associated with the entities of Pima Indian derivation. Nevertheless, extraordinary accuracy up to 99-100% in forecasting the precise diabetes judgement, none of these methods has touched scientific presentation so far. Various tools such as Machine Learning (ML) and Data Mining are used for correct identification of diabetes. These tools improve the diagnosis process associated with T2DM. Diabetes mellitus type 2 (DMT2) is a major problem in several developing countries but its early diagnosis can provide enhanced treatment and can save several people life. Accordingly, we have to develop a structure that diagnoses type 2 diabetes. In this paper, a fuzzy expert system is proposed that present the Mamdani fuzzy inference structure (MFIS) to diagnose type 2 diabetes meritoriously. For necessary evaluation of the proposed structure, a proportional revision has been originated, that provide the anticipated structure with Machine Learning algorithms, specifically J48 Decision-tree (DT), multilayer perceptron (MLP), support-vector-machine (SVM), and Naïve- Bayes (NB), fusion and mixed fusion-based methods. The advanced fuzzy expert system (FES) and the machine learning algorithms are authenticated with actual data commencing the UCI machine learning datasets. Furthermore, the concert of the fuzzy expert structure is appraised by equating it to connected work that used the MFIS to detect the occurrence of type 2 diabetes. Objective: This survey paper presents a review of recent advances in the area of machine learning based classification models for diagnosis of diabetes. Methods: This paper presents an extensive work done in the field of machine learning based classification models for diagnosis of type 2 diabetes where modified fusion of machine learning methods are compared to the basic models i.e. Radial basis function, K-nearest neighbor, support vector machine, J48, logistic regression, classification and regression tress etc. based on training and testing. Results: Fig. 3 and Fig. 4 summarizes the result based on prediction accurateness for each classifier of training and testing. Conclusion: The fuzzy expert system is the best among its rival classifiers; SVM performs very poorly with a very low true positive rate, i.e. a very high number of positive cases misclassified as (Non-diabetic) negative. Based on the evaluation it is clear that the fuzzy expert system has the highest precision value. However, J48 is the least accurate classifier. It has the highest number of false positives relative to the other classifiers mentioned in the testing part. The results show that the fuzzy expert system has the uppermost cost for both precision and recall. Thus, it has the uppermost value for F-measure in the training and testing datasets. J48 is considered the second-best classifier for the training dataset, whereas Naïve Bayes comes in the second rank in the testing dataset.


Author(s):  
Efat Jabarpour ◽  
Amin Abedini ◽  
Abbasali Keshtkar

Introduction: Osteoporosis is a disease that reduces bone density and loses the quality of bone microstructure leading to an increased risk of fractures. It is one of the major causes of inability and death in elderly people. The current study aims at determining the factors influencing the incidence of osteoporosis and providing a predictive model for the disease diagnosis to increase the diagnostic speed and reduce diagnostic costs. Methods: An Individual's data including personal information, lifestyle, and disease information were reviewed. A new model has been presented based on the Cross-Industry Standard Process CRISP methodology. Besides, Support Vector Machine (SVM) and Bayes methods (Tree Augmented Naïve Bayes (TAN)) and Clementine12 have been used as data mining tools. Results: Some features have been detected to affect this disease. The rules have been extracted that can be used as a pattern for the prediction of the patients' status. Classification precision was calculated to be 88.39% for SVM, and 91.29% for  (TAN) when the precision of  TAN  is higher comparing to other methods. Conclusion: The most effective factors concerning osteoporosis are detected and can be used for a new sample with defined characteristics to predict the possibility of osteoporosis in a person.  


2021 ◽  
Vol ahead-of-print (ahead-of-print) ◽  
Author(s):  
Thiago Cesar de Oliveira ◽  
Lúcio de Medeiros ◽  
Daniel Henrique Marco Detzel

Purpose Real estate appraisals are becoming an increasingly important means of backing up financial operations based on the values of these kinds of assets. However, in very large databases, there is a reduction in the predictive capacity when traditional methods, such as multiple linear regression (MLR), are used. This paper aims to determine whether in these cases the application of data mining algorithms can achieve superior statistical results. First, real estate appraisal databases from five towns and cities in the State of Paraná, Brazil, were obtained from Caixa Econômica Federal bank. Design/methodology/approach After initial validations, additional databases were generated with both real, transformed and nominal values, in clean and raw data. Each was assisted by the application of a wide range of data mining algorithms (multilayer perceptron, support vector regression, K-star, M5Rules and random forest), either isolated or combined (regression by discretization – logistic, bagging and stacking), with the use of 10-fold cross-validation in Weka software. Findings The results showed more varied incremental statistical results with the use of algorithms than those obtained by MLR, especially when combined algorithms were used. The largest increments were obtained in databases with a large amount of data and in those where minor initial data cleaning was carried out. The paper also conducts a further analysis, including an algorithmic ranking based on the number of significant results obtained. Originality/value The authors did not find similar studies or research studies conducted in Brazil.


2015 ◽  
Vol 813-814 ◽  
pp. 1104-1113 ◽  
Author(s):  
A. Sumesh ◽  
Dinu Thomas Thekkuden ◽  
Binoy B. Nair ◽  
K. Rameshkumar ◽  
K. Mohandas

The quality of weld depends upon welding parameters and exposed environment conditions. Improper selection of welding process parameter is one of the important reasons for the occurrence of weld defect. In this work, arc sound signals are captured during the welding of carbon steel plates. Statistical features of the sound signals are extracted during the welding process. Data mining algorithms such as Naive Bayes, Support Vector Machines and Neural Network were used to classify the weld conditions according to the features of the sound signal. Two weld conditions namely good weld and weld with defects namely lack of fusion, and burn through were considered in this study. Classification efficiencies of machine learning algorithms were compared. Neural network is found to be producing better classification efficiency comparing with other algorithms considered in this study.


Author(s):  
Moloud Abdar ◽  
Sharareh R. Niakan Kalhori ◽  
Tole Sutikno ◽  
Imam Much Ibnu Subroto ◽  
Goli Arji

Heart diseases are among the nation’s leading couse of mortality and moribidity. Data mining teqniques can predict the likelihood of patients getting a heart disease. The purpose of this study is comparison of different data mining algorithm on prediction of heart diseases. This work applied and compared data mining techniques to predict the risk of heart diseases. After feature analysis, models by five algorithms including decision tree (C5.0), neural network, support vector machine (SVM), logistic regression and k-nearest neighborhood (KNN) were developed and validated. C5.0 Decision tree has been able to build a model with greatest accuracy 93.02%, KNN, SVM, Neural network have been 88.37%, 86.05% and 80.23% respectively. Produced results of decision tree can be simply interpretable and applicable; their rules can be understood easily by different clinical practitioner.


Sign in / Sign up

Export Citation Format

Share Document