An Ontology Driven System to Predict Diabetes with Machine Learning Techniques

Technologies which are used in the healthcare industry are changing rapidly because the technology is evolving to improve people's lifestyles constantly. For instance, different technological devices are used for the diagnosis and treatment of diseases. It has been revealed that diagnosis of disease can be made by computer systems with developing technology.Machine learning algorithms are frequently used tools because of their high performance in the field of health as well as many field. The aim of this study is to investigate different machine learning classification algorithms that can be used in the diagnosis of diabetes and to make comparative analyzes according to the metrics in the literature. In the study, seven classification algorithms were used in the literature. These algorithms are Logistic Regression, K-Nearest Neighbor, Multilayer Perceptron, Random Forest, Decision Trees, Support Vector Machine and Naive Bayes. Firstly, classification performance of algorithms are compared. These comparisons are based on accuracy, sensitivity, precision, and F1-score. The results obtained showed that support vector machine algorithm had the highest accuracy with 78.65%.

Download Full-text

Diabetes Prediction Using Machine Learning Techniques

Journal of Intelligent Systems with Applications ◽

10.54856/jiswa.202112183 ◽

2021 ◽

pp. 150-152

Author(s):

Seyma Kiziltas Koc ◽

Mustafa Yeniad

Keyword(s):

Machine Learning ◽

Support Vector Machine ◽

High Performance ◽

Nearest Neighbor ◽

Classification Performance ◽

Machine Learning Techniques ◽

Support Vector ◽

Classification Algorithms ◽

K Nearest Neighbor ◽

Machine Learning Classification

Technologies which are used in the healthcare industry are changing rapidly because the technology is evolving to improve people's lifestyles constantly. For instance, different technological devices are used for the diagnosis and treatment of diseases. It has been revealed that diagnosis of disease can be made by computer systems with developing technology.Machine learning algorithms are frequently used tools because of their high performance in the field of health as well as many field. The aim of this study is to investigate different machine learning classification algorithms that can be used in the diagnosis of diabetes and to make comparative analyzes according to the metrics in the literature. In the study, seven classification algorithms were used in the literature. These algorithms are Logistic Regression, K-Nearest Neighbor, Multilayer Perceptron, Random Forest, Decision Trees, Support Vector Machine and Naive Bayes. Firstly, classification performance of algorithms are compared. These comparisons are based on accuracy, sensitivity, precision, and F1-score. The results obtained showed that support vector machine algorithm had the highest accuracy with 78.65%.

Download Full-text

A review of machine learning techniques using decision tree and support vector machine

2016 International Conference on Computing Communication Control and automation (ICCUBEA) ◽

10.1109/iccubea.2016.7860040 ◽

2016 ◽

Cited By ~ 14

Author(s):

Madan Somvanshi ◽

Pranjali Chavan ◽

Shital Tambade ◽

S. V. Shinde

Keyword(s):

Machine Learning ◽

Support Vector Machine ◽

Decision Tree ◽

Machine Learning Techniques ◽

Support Vector ◽

Learning Techniques

Download Full-text

Spatial–Temporal Analysis of Land Cover Change at the Bento Rodrigues Dam Disaster Area Using Machine Learning Techniques

Remote Sensing ◽

10.3390/rs11212548 ◽

2019 ◽

Vol 11 (21) ◽

pp. 2548

Author(s):

Dong Luo ◽

Douglas G. Goodin ◽

Marcellus M. Caldas

Keyword(s):

Machine Learning ◽

Support Vector Machine ◽

Land Cover ◽

Decision Tree ◽

Machine Learning Algorithms ◽

Training Data ◽

Machine Learning Techniques ◽

Support Vector ◽

Disaster Area ◽

Mine Sites

Disasters are an unpredictable way to change land use and land cover. Improving the accuracy of mapping a disaster area at different time is an essential step to analyze the relationship between human activity and environment. The goals of this study were to test the performance of different processing procedures and examine the effect of adding normalized difference vegetation index (NDVI) as an additional classification feature for mapping land cover changes due to a disaster. Using Landsat ETM+ and OLI images of the Bento Rodrigues mine tailing disaster area, we created two datasets, one with six bands, and the other one with six bands plus the NDVI. We used support vector machine (SVM) and decision tree (DT) algorithms to build classifier models and validated models performance using 10-fold cross-validation, resulting in accuracies higher than 90%. The processed results indicated that the accuracy could reach or exceed 80%, and the support vector machine had a better performance than the decision tree. We also calculated each land cover type’s sensitivity (true positive rate) and found that Agriculture, Forest and Mine sites had higher values but Bareland and Water had lower values. Then, we visualized land cover maps in 2000 and 2017 and found out the Mine sites areas have been expanded about twice of the size, but Forest decreased 12.43%. Our findings showed that it is feasible to create a training data pool and use machine learning algorithms to classify a different year’s Landsat products and NDVI can improve the vegetation covered land classification. Furthermore, this approach can provide a venue to analyze land pattern change in a disaster area over time.

Download Full-text

Prediction of CoVid-19 mortality in Iraq-Kurdistan by using Machine learning

UHD Journal of Science and Technology ◽

10.21928/uhdjst.v5n1y2021.pp66-70 ◽

2021 ◽

Vol 5 (1) ◽

pp. 66-70

Author(s):

Ardalan Husin Awlla ◽

Brzu T. Muhammed ◽

Sherko H. Murad ◽

Sabah N. Ahmad

Keyword(s):

Machine Learning ◽

Support Vector Machine ◽

Mortality Rate ◽

Decision Tree ◽

Data Analytics ◽

Naive Bayes ◽

Major Effect ◽

Support Vector ◽

Classification Algorithms ◽

Patient Death

This research analyzed different aspects of coronavirus disease (COVID-19) for patients who have coronavirus, for find out which aspects have an effect to patient death. First, a literature has been made with the previous research that has been done on the analysis dataset of coronavirus using Machine learning (ML) algorithm. Second, data analytics is applied on a dataset of Sulaymaniyah, Iraq, to find factors that affect the mortality rate of coronavirus patients. Third, classification algorithms are used on a dataset of 1365 samples provided by hospitals in Sulaymaniyah, Iraq to diagnose COVID-19. Using ML algorithm provided us to find mortality rate of this disease, and detect which factor has major effect to patient death. It is shown here that support vector machine (SVM), decision tree (DT), and naive Bayes algorithms can classify COVID-19 patients, and DT is best one among them at an accuracy (96.7 %).

Download Full-text

Using Machine Learning to Perform Proximity Detection - Classifying Bluetooth Beacon RSSI V alues

10.20944/preprints202009.0508.v1 ◽

2020 ◽

Author(s):

Karen Song

Keyword(s):

Machine Learning ◽

Support Vector Machine ◽

Smart Phones ◽

Support Vector ◽

Classification Algorithms ◽

Decision Tree Classifier ◽

Machine Learning Classification ◽

Proximity Detection ◽

Tree Classifier ◽

Testing Accuracy

This project focuses on using machine learning classification algorithms to determine whether two people are 6 feet apart or not. Two Raspberry Pis were used simulate smart phones. RSSI values of the Bluetooth beacons transmitted between the Raspberry Pis were collected and recorded to train the classifier. The Gaussian Support Vector Machine Classifer yielded the highest testing accuracy of 79.670 and the Decision Tree Classifier yielded the highest AUC of 0.80.

Download Full-text

Using Machine Learning to Perform Proximity Detection - Classifying Bluetooth Beacon RSSI Values

10.20944/preprints202009.0508.v2 ◽

2020 ◽

Author(s):

Karen Song

Keyword(s):

Machine Learning ◽

Support Vector Machine ◽

Smart Phones ◽

Support Vector ◽

Classification Algorithms ◽

Decision Tree Classifier ◽

Machine Learning Classification ◽

Proximity Detection ◽

Tree Classifier ◽

Testing Accuracy

This project focuses on using machine learning classification algorithms to determine whether two people are 6 feet apart or not. Two Raspberry Pis were used simulate smart phones. RSSI values of the Bluetooth beacons transmitted between the Raspberry Pis were collected and recorded to train the classifier. The Gaussian Support Vector Machine Classifer yielded the highest testing accuracy of 79.670 and the Decision Tree Classifier yielded the highest AUC of 0.80.

Download Full-text

COVID-19 World Vaccination Progress Using Machine Learning Classification Algorithms

Qubahan Academic Journal ◽

10.48161/qaj.v1n2a53 ◽

2021 ◽

Vol 1 (2) ◽

pp. 100-105

Author(s):

Nasiba M. Abdulkareem ◽

Adnan Mohsin Abdulazeez ◽

Diyar Qader Zeebaree ◽

Dathar A. Hasan

Keyword(s):

Machine Learning ◽

Decision Tree ◽

Vaccine Development ◽

Machine Learning Techniques ◽

Classification Algorithms ◽

Real World Data ◽

K Nearest Neighbors ◽

Machine Learning Classification ◽

Learning Techniques ◽

And Performance

In December 2019, SARS-CoV-2 caused coronavirus disease (COVID-19) distributed to all countries, infecting thousands of people and causing deaths. COVID-19 induces mild sickness in most cases, although it may render some people very ill. Therefore, vaccines are in various phases of clinical progress, and some of them being approved for national use. The current state reveals that there is a critical need for a quick and timely solution to the Covid-19 vaccine development. Non-clinical methods such as data mining and machine learning techniques may help do this. This study will focus on the COVID-19 World Vaccination Progress using Machine learning classification Algorithms. The findings of the paper show which algorithm is better for a given dataset. Weka is used to run tests on real-world data, and four output classification algorithms (Decision Tree, K-nearest neighbors, Random Tree, and Naive Bayes) are used to analyze and draw conclusions. The comparison is based on accuracy and performance period, and it was discovered that the Decision Tree outperforms other algorithms in terms of time and accuracy.

Download Full-text

"Predicting Absenteeism at Work Using Machine Learning Algorithms

Muthanna Journal of Pure Science ◽

10.52113/2/07.01.2020/1-12 ◽

2019 ◽

Vol 7 (1) ◽

pp. 1-12

Author(s):

Samir Qaisar Ajmi

Keyword(s):

Machine Learning ◽

Support Vector Machine ◽

Decision Tree ◽

Machine Learning Algorithms ◽

Machine Learning Techniques ◽

Support Vector ◽

Tree Model ◽

Learning Techniques ◽

Business Market ◽

Commercial Environment

"To work in the commercial environment, the company needs to be a major competitor in the business market, which depends mainly on the company's resources. One of the most important resources is the employees. Based on that, the absence of the employees from work leads to deterioration and reduce production in the institutions which leads to heavy losses. There are many reasons why employees are absent from work. Those may include health problems and social occasions. The purpose of this paper was to apply machine learning techniques to predict the absenteeism at work. There are four methods have been used in this research ( neural network(NN) technique ,decision tree (DT) technique, support vector machine (SVM) technique and logistic regression (LR) technique. . decision tree model has the highest accuracy equals to 83.33% with AUC 0.834 and the support vector machine has the lowest accuracy equals to 68.47 % with AUC 0.760."

Download Full-text

Prototype Classification: Insights from Machine Learning

Neural Computation ◽

10.1162/neco.2009.01-07-443 ◽

2009 ◽

Vol 21 (1) ◽

pp. 272-300 ◽

Cited By ~ 8

Author(s):

Arnulf B. A. Graf ◽

Olivier Bousquet ◽

Gunnar Rätsch ◽

Bernhard Schölkopf

Keyword(s):

Machine Learning ◽

Support Vector Machine ◽

Relevance Vector Machine ◽

Support Vector ◽

Classification Algorithms ◽

Normal Vector ◽

Machine Learning Classification ◽

Formal Framework ◽

Soft Margin ◽

Prototype Classification

We shed light on the discrimination between patterns belonging to two different classes by casting this decoding problem into a generalized prototype framework. The discrimination process is then separated into two stages: a projection stage that reduces the dimensionality of the data by projecting it on a line and a threshold stage where the distributions of the projected patterns of both classes are separated. For this, we extend the popular mean-of-class prototype classification using algorithms from machine learning that satisfy a set of invariance properties. We report a simple yet general approach to express different types of linear classification algorithms in an identical and easy-to-visualize formal framework using generalized prototypes where these prototypes are used to express the normal vector and offset of the hyperplane. We investigate non-margin classifiers such as the classical prototype classifier, the Fisher classifier, and the relevance vector machine. We then study hard and soft margin classifiers such as the support vector machine and a boosted version of the prototype classifier. Subsequently, we relate mean-of-class prototype classification to other classification algorithms by showing that the prototype classifier is a limit of any soft margin classifier and that boosting a prototype classifier yields the support vector machine. While giving novel insights into classification per se by presenting a common and unified formalism, our generalized prototype framework also provides an efficient visualization and a principled comparison of machine learning classification.

Download Full-text