A Review Study on Data Mining Algorithms for Prediction Diseases

Heart diseases are among the nation’s leading couse of mortality and moribidity. Data mining teqniques can predict the likelihood of patients getting a heart disease. The purpose of this study is comparison of different data mining algorithm on prediction of heart diseases. This work applied and compared data mining techniques to predict the risk of heart diseases. After feature analysis, models by five algorithms including decision tree (C5.0), neural network, support vector machine (SVM), logistic regression and k-nearest neighborhood (KNN) were developed and validated. C5.0 Decision tree has been able to build a model with greatest accuracy 93.02%, KNN, SVM, Neural network have been 88.37%, 86.05% and 80.23% respectively. Produced results of decision tree can be simply interpretable and applicable; their rules can be understood easily by different clinical practitioner.

Download Full-text

Osteoporosis Risk Prediction Using Data Mining Algorithms

Journal of Community Health Research ◽

10.18502/jchr.v9i2.3401 ◽

2020 ◽

Author(s):

Efat Jabarpour ◽

Amin Abedini ◽

Abbasali Keshtkar

Keyword(s):

Data Mining ◽

Personal Information ◽

Disease Diagnosis ◽

Support Vector ◽

Data Mining Algorithms ◽

Industry Standard ◽

Disease Information ◽

Increased Risk ◽

Using Data ◽

Mining Algorithms

Introduction: Osteoporosis is a disease that reduces bone density and loses the quality of bone microstructure leading to an increased risk of fractures. It is one of the major causes of inability and death in elderly people. The current study aims at determining the factors influencing the incidence of osteoporosis and providing a predictive model for the disease diagnosis to increase the diagnostic speed and reduce diagnostic costs. Methods: An Individual's data including personal information, lifestyle, and disease information were reviewed. A new model has been presented based on the Cross-Industry Standard Process CRISP methodology. Besides, Support Vector Machine (SVM) and Bayes methods (Tree Augmented Naïve Bayes (TAN)) and Clementine12 have been used as data mining tools. Results: Some features have been detected to affect this disease. The rules have been extracted that can be used as a pattern for the prediction of the patients' status. Classification precision was calculated to be 88.39% for SVM, and 91.29% for (TAN) when the precision of TAN is higher comparing to other methods. Conclusion: The most effective factors concerning osteoporosis are detected and can be used for a new sample with defined characteristics to predict the possibility of osteoporosis in a person.

Download Full-text

Acoustic Signature Based Weld Quality Monitoring for SMAW Process Using Data Mining Algorithms

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.813-814.1104 ◽

2015 ◽

Vol 813-814 ◽

pp. 1104-1113 ◽

Cited By ~ 5

Author(s):

A. Sumesh ◽

Dinu Thomas Thekkuden ◽

Binoy B. Nair ◽

K. Rameshkumar ◽

K. Mohandas

Keyword(s):

Neural Network ◽

Data Mining ◽

Welding Process ◽

Machine Learning Algorithms ◽

Steel Plates ◽

Support Vector ◽

Welding Parameters ◽

Process Data ◽

Data Mining Algorithms ◽

Mining Algorithms

The quality of weld depends upon welding parameters and exposed environment conditions. Improper selection of welding process parameter is one of the important reasons for the occurrence of weld defect. In this work, arc sound signals are captured during the welding of carbon steel plates. Statistical features of the sound signals are extracted during the welding process. Data mining algorithms such as Naive Bayes, Support Vector Machines and Neural Network were used to classify the weld conditions according to the features of the sound signal. Two weld conditions namely good weld and weld with defects namely lack of fusion, and burn through were considered in this study. Classification efficiencies of machine learning algorithms were compared. Neural network is found to be producing better classification efficiency comparing with other algorithms considered in this study.

Download Full-text

Perbandingan Teknik Klasifikasi Neural Network, Support Vector Machine, dan Naive Bayes dalam Mendeteksi Kanker Payudara

BINA INSANI ICT JOURNAL ◽

10.51211/biict.v7i1.1343 ◽

2020 ◽

Vol 7 (1) ◽

pp. 53

Author(s):

Derisma Derisma ◽

Fajri Febrian

Keyword(s):

Breast Cancer ◽

Neural Network ◽

Data Mining ◽

Support Vector Machine ◽

Naive Bayes ◽

Naïve Bayes ◽

Support Vector ◽

Accuracy Rate ◽

Cancer Disease ◽

Network Support

Abstrak: Kanker payudara merupakan jenis kanker yang sering ditemukan oleh kebanyakan wanita. Di Indonesia Kanker payudara menempati urutan pertama pada pasien rawat inap di seluruh rumah sakit. Tujuan dari penelitian ini adalah melakukan diagnosis penyakit kanker payudara berbasis komputasi yang dapat menghasilkan bagaimana kondisi kanker seseorang berdasarkan akurasi algoritma. Penelitian ini menggunakan pemrograman orange python dan dataset Wisconsin Breast Cancer untuk pemodelan klasifikasi kanker payudara. Metode data mining yang diterapkan yaitu Neural Network, Support Vector Machine, dan Naive Bayes. Dalam penelitian ini didapat algoritma klasifikasi terbaik yaitu algoritma Kernel SVM dengan tingkat akurasi sebesar 98.9 % dan algoritma terendah yaitu Naive Bayes senilai 96.1 %. Kata kunci: kanker payudara, neural network, support vector machine, naive bayes Abstract: Breast cancer is a type of cancer that mostly found in many women. In Indonesia, breast cancer ranks first in hospitalized patients at every hospital. This study aimed to conduct a computation-based diagnose of breast cancer disease that could produce the state of cancer of an individual based on the accuracy of algorithm. This study used python orange programming and Wisconsin Breast Cancer dataset for a modeling and application of breast cancer classification. The data mining methods that were applied in this study were Neural Network, Support Vector Machine, dan Naive Bayes. In this study, Kernel SVM’s algorithm was the best classification algorithm of breast cancer disease with 98.9 % accuracy rate and Naïve Beyes was the lowest with 96.1 % of accuracy rate. Keywords: breast cancer, neural network, support vector machine, naive bayes

Download Full-text

Data mining, fuzzy AHP and TOPSIS for optimizing taxpayer supervision

Indonesian Journal of Electrical Engineering and Computer Science ◽

10.11591/ijeecs.v18.i1.pp75-87 ◽

2020 ◽

Vol 18 (1) ◽

pp. 75

Author(s):

M. Jupri ◽

Riyanarto Sarno

Keyword(s):

Data Mining ◽

Nearest Neighbor ◽

Fuzzy Ahp ◽

Support Vector ◽

K Nearest Neighbor ◽

Data Mining Algorithms ◽

Using Data ◽

Time Required ◽

Mining Algorithms

The achievement of accepting optimal tax need effective and efficient tax supervision can be achieved by classifying taxpayer compliance to tax regulations. Considering this issue, this paper proposes the classification of taxpayer compliance using data mining algorithms; i.e. C4.5, Support Vector Machine, K-Nearest Neighbor, Naive Bayes, and Multilayer Perceptron based on the compliance of taxpayer data. The taxpayer compliance can be classified into four classes, which are (1) formal and material compliant taxpayers, (2) formal compliant taxpayers, (3) material compliant taxpayers, and (4) formal and material non-compliant taxpayers. Furthermore, the results of data mining algorithms are compared by using Fuzzy AHP and TOPSIS to determine the best performance classification based on the criteria of Accuracy, F-Score, and Time required. Selection of the taxpayer's priority for more detailed supervision at each level of taxpayer compliance is ranked using Fuzzy AHP and TOPSIS based on criteria of dataset variables. The results show that C4.5 is the best performance classification and achieves preference value of 0.998; whereas the MLP algorithm results from the lowest preference value of 0.131. Alternative taxpayer A233 is the top priority taxpayer with a preference value of 0.433; whereas alternative taxpayer A051 is the lowest priority taxpayer with a preference value of 0.036.

Download Full-text

A Method for Classification Using Data Mining Technique for Diabetes

Psychology and Mental Health ◽

10.4018/978-1-5225-0159-6.ch030 ◽

2016 ◽

pp. 738-761

Author(s):

Ahmad Al-Khasawneh

Keyword(s):

Data Mining ◽

Support Vector Machine ◽

Classification Accuracy ◽

Health Information System ◽

Parameters Optimization ◽

Support Vector ◽

Data Mining Algorithms ◽

Predictive Data Mining ◽

Severity Of The Disease ◽

Using Data

Many researchers in the health information system field have been attracted to develop computer applications that help in the diagnosis process. Imperatively, data mining algorithms address the vital role in all of these applications. Many contributions were made in this area. There has always been a debate on the algorithm that gives the best classifier, the parameters to be used, the dataset pre-processing steps, etc. In this paper, the author largely emphasizes that the best way to build a predictive model with relatively high classification accuracy is to build several predictive models and to choose the model that gives the best results through parameters optimization. Diagnosing diabetes mellitus has gained considerable attention in the last few decades due to the increased severity of the disease. In this research, the author reviews four predictive data mining approaches that are being used in diagnosing diabetes. Four models were implemented to diagnose diabetes from PIMA dataset; k-nearest neighbour, support vector machine, multilayer perceptron neural network, and naive bayesian network. Giving the highest classification accuracy, support vector machine technique outperformed the others with a value of 78.83%.

Download Full-text

Performance Analysis of Data Mining Algorithms

Journal of Computational and Theoretical Nanoscience ◽

10.1166/jctn.2019.8260 ◽

2019 ◽

Vol 16 (9) ◽

pp. 3849-3853

Author(s):

Dar Masroof Amin ◽

Atul Garg

Keyword(s):

Data Mining ◽

Big Data ◽

Future Trend ◽

Easy Access ◽

Support Vector ◽

Linear Discriminant ◽

Data Mining Algorithms ◽

Data Files ◽

Using Data ◽

Mining Algorithms

The globalisation of Internet is creating enormous amount of data on servers. The data created during last two years is itself equivalent to the data created during all these years. This exponential creation of data is due to the easy access to devices based on Internet of things. This information has become a source of predictive analysis for future happenings. The versatile use of computing devices is creating data of diverse nature and the analysts are predicting the future trend using data of their respective domain. The technology used to analyse the data has become a bottleneck over the time. The main reason behind this is that the rate with which the data is getting created is much more than the technology used to access the same. There are various mining techniques used to explore the useful information. In this research there is detailed analysis of how data is used and perceived by various data mining algorithms. Mining algorithms like Naïve Bayes, Support Vector Machines, Linear Discriminant Analysis Algorithm, Artificial Neural Networks, C4.5, C5.0, K-Nearest Neighbour are analysed. The input data used in these algorithms is big data files. This research mainly focuses on how the existing data algorithms are interacting with big data files. The research has been done on twitter comments.

Download Full-text

Air Temperature Prediction Using Different Datamining Approaches In Sulaymaniyah City In Iraq

10.24271/psr.21 ◽

2021 ◽

Vol 3 (2) ◽

pp. 1-9

Author(s):

Yosra Mohammed ◽

Sherko Murad ◽

Brzu Tahir

Keyword(s):

Climate Change ◽

Data Mining ◽

Support Vector Machine ◽

Air Temperature ◽

Significant Feature ◽

Support Vector ◽

Temperature Prediction ◽

Data Mining Algorithms ◽

Air Temperature Prediction ◽

Mining Algorithms

Climate change has a historical impact at universal and local levels over the past era. Climate change is one of the greatest challenge issues in the globe for meteorological research. Air temperature estimation, in particular, has been measured as a significant feature in weather impression studies on industrial sectors, environmental, ecological, and agricultural. Accurately predicting air temperature guides to measure lifestyle, perform a key character for the government, industries, and public in development activities. In this paper, we investigate the use of various data mining approaches such as Support Vector Machine (SVM), Decision tree (DT), and Naïve Bayes for air temperature prediction within Sulaymaniyah City in Kurdistan, IRAQ. The metrological data is collected from the local Weather Forecast Department in the city within the range 2013 to 2018 inclusive. A dataset for the metrological data was developed and used to train the data mining algorithms. The proposed data mining algorithms were tested on the dataset to predict the air temperature and the performance of these algorithms were compared using standard performance metrics. Support vector machine has accomplished promising performance among using algorithms

Download Full-text

Decision Support System for Diabetes Classification Using Data Mining Techniques

Research Anthology on Decision Support Systems and Decision Management in Healthcare, Business, and Engineering ◽

10.4018/978-1-7998-9023-2.ch053 ◽

2021 ◽

pp. 1091-1113

Author(s):

Ahmad M. Al-Khasawneh

Keyword(s):

Data Mining ◽

Support Vector Machine ◽

Nearest Neighbor ◽

Support Vector ◽

K Nearest Neighbor ◽

Data Mining Algorithms ◽

Use Of Data ◽

Predictive Data Mining ◽

Severity Of The Disease ◽

Using Data

The use of data mining algorithms in health information systems has played a significant role in developing applications that help to diagnose different diseases. The type of the disease determines the selection of the algorithm, parameters to be used, and dataset pre-processing steps, etc. In this chapter, diagnosing diabetes mellitus is the target since it has gained significant attention in the last few decades due to the increased severity of the disease. Four predictive data mining approaches are being used in diagnosing diabetes. Four models were implemented to diagnose diabetes from PIMA dataset: k-nearest neighbor, support vector machine, multilayer perceptron neural network, and naive Bayesian network. Giving the highest classification accuracy, support vector machine technique outperformed the others with a value of 78.83%.

Download Full-text

Decision Support System for Diabetes Classification Using Data Mining Techniques

Advances in Healthcare Information Systems and Administration - Handbook of Research on Emerging Perspectives on Healthcare Information Systems and Informatics ◽

10.4018/978-1-5225-5460-8.ch012 ◽

2018 ◽

pp. 281-303

Author(s):

Ahmad M. Al-Khasawneh

Keyword(s):

Data Mining ◽

Support Vector Machine ◽

Nearest Neighbor ◽

Support Vector ◽

K Nearest Neighbor ◽

Data Mining Algorithms ◽

Use Of Data ◽

Predictive Data Mining ◽

Severity Of The Disease ◽

Using Data

The use of data mining algorithms in health information systems has played a significant role in developing applications that help to diagnose different diseases. The type of the disease determines the selection of the algorithm, parameters to be used, and dataset pre-processing steps, etc. In this chapter, diagnosing diabetes mellitus is the target since it has gained significant attention in the last few decades due to the increased severity of the disease. Four predictive data mining approaches are being used in diagnosing diabetes. Four models were implemented to diagnose diabetes from PIMA dataset: k-nearest neighbor, support vector machine, multilayer perceptron neural network, and naive Bayesian network. Giving the highest classification accuracy, support vector machine technique outperformed the others with a value of 78.83%.

Download Full-text

A Review Study on Data Mining Algorithms for Prediction Diseases

Comparing Performance of Data Mining Algorithms in Prediction Heart Diseases

Osteoporosis Risk Prediction Using Data Mining Algorithms

Acoustic Signature Based Weld Quality Monitoring for SMAW Process Using Data Mining Algorithms

Perbandingan Teknik Klasifikasi Neural Network, Support Vector Machine, dan Naive Bayes dalam Mendeteksi Kanker Payudara

Data mining, fuzzy AHP and TOPSIS for optimizing taxpayer supervision

A Method for Classification Using Data Mining Technique for Diabetes

Performance Analysis of Data Mining Algorithms

Air Temperature Prediction Using Different Datamining Approaches In Sulaymaniyah City In Iraq

Decision Support System for Diabetes Classification Using Data Mining Techniques

Decision Support System for Diabetes Classification Using Data Mining Techniques

Export Citation Format