Machine Learning Based Coronary Artery Disease Prediction

The world health organization (WHO) has assessed that the death of around 12 million people across the globe is observed each year because of diseases related to cardiovascular. The dangers associated with the cardiovascular disease can be identified effectively using machine learning techniques. As per survey, around 30% of the patient suffers no symptoms during heart attacks. But the bloodstream contains unique indications of the attack for days. The medical diagnosis of a patient remains a complex task due to several factors. The accurate medical diagnosis of a patient’s heart disease is critical as it significantly leads to the saving of millions of human lives. In this regard, the automation of the medical diagnosis is significant. The goal of this work is the development of a system for predicting the disease related to coronary artery in a patient with high accuracy utilizing machine learning (ML) techniques. Several algorithms like Naïve Bayes (NB), Support Vector Machine (SVM), and Decision Tree (DT) classifiers were implemented for predicting the disease. Extensive experiments demonstrated that the naïve Bayes achieved higher accuracy than the DT and SVM with regards to accuracy, precision, F-Measure, Recall, and receiver operating characteristic (ROC) performance metrics.

Download Full-text

Classification of Student Performance Dataset using Machine Learning Algorithms

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.b1114.1292s219 ◽

2019 ◽

Vol 9 (2S2) ◽

pp. 752-757

Keyword(s):

Machine Learning ◽

Decision Tree ◽

Student Performance ◽

Performance Metrics ◽

Naive Bayes ◽

Research Work ◽

Naïve Bayes ◽

Machine Learning Algorithms ◽

Machine Learning Techniques ◽

Support Vector

The scope of this research work is to identify the efficient machine learning algorithm for predicting the behavior of a student from the student performance dataset. We applied Support Vector Machines, K-Nearest Neighbor, Decision Tree and Naïve Bayes algorithms to predict the grade of a student and compared their prediction results in terms of various performance metrics. The students who visited many resources for reference, made academic related discussions and interactions in the class room, absent for minimum days, cared by parents care have shown great improvement in the final grade. Among the machine learning techniques we have used, SVM has shown more accuracy in terms of four important attribute. The accuracy rate of SVM after tuning is 0.80. The KNN and decision tree achieves the accuracy of 0.64, 0.65 respectively whereas the Naïve Bayes achieves 0.77.

Download Full-text

Sentiment Analysis using various Machine Learning and Deep Learning Techniques

Journal of the Nigerian Society of Physical Sciences ◽

10.46481/jnsps.2021.308 ◽

2021 ◽

pp. 385-394

Author(s):

V Umarani ◽

A Julian ◽

J Deepa

Keyword(s):

Machine Learning ◽

Deep Learning ◽

Sentiment Analysis ◽

Naive Bayes ◽

Naïve Bayes ◽

Supervised Machine Learning ◽

Machine Learning Techniques ◽

Support Vector ◽

Analysis Process ◽

Learning Techniques

Sentiment analysis has gained a lot of attention from researchers in the last year because it has been widely applied to a variety of application domains such as business, government, education, sports, tourism, biomedicine, and telecommunication services. Sentiment analysis is an automated computational method for studying or evaluating sentiments, feelings, and emotions expressed as comments, feedbacks, or critiques. The sentiment analysis process can be automated using machine learning techniques, which analyses text patterns faster. The supervised machine learning technique is the most used mechanism for sentiment analysis. The proposed work discusses the flow of sentiment analysis process and investigates the common supervised machine learning techniques such as multinomial naive bayes, Bernoulli naive bayes, logistic regression, support vector machine, random forest, K-nearest neighbor, decision tree, and deep learning techniques such as Long Short-Term Memory and Convolution Neural Network. The work examines such learning methods using standard data set and the experimental results of sentiment analysis demonstrate the performance of various classifiers taken in terms of the precision, recall, F1-score, RoC-Curve, accuracy, running time and k fold cross validation and helps in appreciating the novelty of the several deep learning techniques and also giving the user an overview of choosing the right technique for their application.

Download Full-text

Preliminary Screening of COVID-19 Infection Employing Machine Learning Techniques From Simple Blood Profile

International Journal of Quantitative Structure-Property Relationships ◽

10.4018/ijqspr.2021070103 ◽

2021 ◽

Vol 6 (3) ◽

pp. 35-47

Author(s):

Anirudh Reddy Cingireddy ◽

Robin Ghosh ◽

Supratik Kar ◽

Venkata Melapu ◽

Sravanthi Joginipeli ◽

...

Keyword(s):

Machine Learning ◽

Random Forest ◽

Naive Bayes ◽

Albert Einstein ◽

Naïve Bayes ◽

Machine Learning Techniques ◽

Support Vector ◽

Blood Profile ◽

Molecular Tests ◽

Large Populations

Frequent testing of the entire population would help to identify individuals with active COVID-19 and allow us to identify concealed carriers. Molecular tests, antigen tests, and antibody tests are being widely used to confirm COVID-19 in the population. Molecular tests such as the real-time reverse transcription-polymerase chain reaction (rRT-PCR) test will take a minimum of 3 hours to a maximum of 4 days for the results. The authors suggest using machine learning and data mining tools to filter large populations at a preliminary level to overcome this issue. The ML tools could reduce the testing population size by 20 to 30%. In this study, they have used a subset of features from full blood profile which are drawn from patients at Israelita Albert Einstein hospital located in Brazil. They used classification models, namely KNN, logistic regression, XGBooting, naive Bayes, decision tree, random forest, support vector machine, and multilayer perceptron with k-fold cross-validation, to validate the models. Naïve bayes, KNN, and random forest stand out as the most predictive ones with 88% accuracy each.

Download Full-text

Machine learning versus logistic regression methods for 2-year mortality prognostication in a small, heterogeneous glioma database

10.1101/472555 ◽

2018 ◽

Cited By ~ 2

Author(s):

Sandip S Panesar ◽

Rhett N D’Souza ◽

Fang-Cheng Yeh ◽

Juan C Fernandez-Miranda

Keyword(s):

Machine Learning ◽

Logistic Regression ◽

Machine Learning Techniques ◽

World Health ◽

Support Vector ◽

Molecular Characteristics ◽

Regression Methods ◽

Learning Techniques ◽

The World ◽

Health Organization

AbstractBackgroundMachine learning (ML) is the application of specialized algorithms to datasets for trend delineation, categorization or prediction. ML techniques have been traditionally applied to large, highly-dimensional databases. Gliomas are a heterogeneous group of primary brain tumors, traditionally graded using histopathological features. Recently the World Health Organization proposed a novel grading system for gliomas incorporating molecular characteristics. We aimed to study whether ML could achieve accurate prognostication of 2-year mortality in a small, highly-dimensional database of glioma patients.MethodsWe applied three machine learning techniques: artificial neural networks (ANN), decision trees (DT), support vector machine (SVM), and classical logistic regression (LR) to a dataset consisting of 76 glioma patients of all grades. We compared the effect of applying the algorithms to the raw database, versus a database where only statistically significant features were included into the algorithmic inputs (feature selection).ResultsRaw input consisted of 21 variables, and achieved performance of (accuracy/AUC): 70.7%/0.70 for ANN, 68%/0.72 for SVM, 66.7%/0.64 for LR and 65%/0.70 for DT. Feature selected input consisted of 14 variables and achieved performance of 73.4%/0.75 for ANN, 73.3%/0.74 for SVM, 69.3%/0.73 for LR and 65.2%/0.63 for DT.ConclusionsWe demonstrate that these techniques can also be applied to small, yet highly-dimensional datasets. Our ML techniques achieved reasonable performance compared to similar studies in the literature. Though local databases may be small versus larger cancer repositories, we demonstrate that ML techniques can still be applied to their analysis, though traditional statistical methods are of similar benefit.

Download Full-text

Preparedness and Mitigation by projecting the risk against COVID-19 transmission using Machine Learning Techniques

10.1101/2020.04.26.20080655 ◽

2020 ◽

Author(s):

Akshay Kumar ◽

Farhan Mohammad Khan ◽

Rajiv Gupta ◽

Harish Puppala

Keyword(s):

Machine Learning ◽

Gaussian Process Regression ◽

Machine Learning Techniques ◽

World Health ◽

Support Vector ◽

Learning Tools ◽

Learning Techniques ◽

Health Organization ◽

Respiratory Coronavirus ◽

Criticality Index

AbstractThe outbreak of COVID-19 is first identified in China, which later spread to various parts of the globe and was pronounced pandemic by the World Health Organization (WHO). The disease of transmissible person-to-person pneumonia caused by the extreme acute respiratory coronavirus 2 syndrome (SARS-COV-2, also known as COVID-19), has sparked a global warning. Thermal screening, quarantining, and later lockdown were methods employed by various nations to contain the spread of the virus. Though exercising various possible plans to contain the spread help in mitigating the effect of COVID-19, projecting the rise and preparing to face the crisis would help in minimizing the effect. In the scenario, this study attempts to use Machine Learning tools to forecast the possible rise in the number of cases by considering the data of daily new cases. To capture the uncertainty, three different techniques: (i) Decision Tree algorithm, (ii) Support Vector Machine algorithm, and (iii) Gaussian process regression are used to project the data and capture the possible deviation. Based on the projection of new cases, recovered cases, deceased cases, medical facilities, population density, number of tests conducted, and facilities of services, are considered to define the criticality index (CI). CI is used to classify all the districts of the country in the regions of high risk, low risk, and moderate risk. An online dashpot is created, which updates the data on daily bases for the next four weeks. The prospective suggestions of this study would aid in planning the strategies to apply the lockdown/ any other plan for any country, which can take other parameters to define the CI.

Download Full-text

Sentimen Analisis Terkait Lockdown pada Sosial Media Twitter

Indonesian Journal on Software Engineering (IJSE) ◽

10.31294/ijse.v6i2.8991 ◽

2020 ◽

Vol 6 (2) ◽

pp. 223-229

Author(s):

Muhammad Dwison Alizah ◽

Arifin Nugroho ◽

Ummu Radiyah ◽

Windu Gata

Keyword(s):

Support Vector Machine ◽

World Health Organization ◽

Predictive Modeling ◽

Naive Bayes ◽

Naïve Bayes ◽

World Health ◽

Support Vector ◽

The World ◽

Negative Impacts ◽

Health Organization

Abstract: Covid-19 has been set as a Pandemic by the World Health Organization (WHO). The very large impact and the infection that is fast enough are the reasons for making Covid-19 as a pandemic and efforts to overcome. One anticipation that can be done is to do lockdown. Making the decision to carry out a lockdown is intended to reduce the spread that occurs. Lockdown is certainly not a 100% good solution for all of individual. There are individual who agree that the lockdown will be implemented, also there are those who think that the lockdown is better not to be carried out considering the negative impacts that can occur. Therefore in this study will be presented the predictive modeling for sentiment analysis related to "lockdown" specially on social media Twitter. The method used to labeled was using Vader then the tweets are extracted using TF-IDF, and modeling is made for the prediction of sentiment using Naïve Bayes and Support Vector Machine. The results obtained from the two algorithms are more than 80%. Keywords: Covid-19, lockdown, TF-IDF, Naïve Bayes, Support Vector Machine Abstrak: Covid-19 telah ditetapkan sebagia Pandemi oleh World Health Organization (WHO). Dampak yang sangat besar dan penyebaran yang cukup cepat menjadi alsan untuk menjadikan Covid-19 sebagai Pandemi dan perlu dilakukan upaya penanggulangan. Salah satu upaya yang bisa dilakukan adalah dengan melakukan lockdown. Pengambilan keputusan untuk melakukan lockdown diperuntukan guna mengurangi penyebaran yang terjadi. Lockdown tentunya bukanlah solusi yang 100% baik bagi segala pihak. Terdapat pihak - pihak yang menyetujui akan dilaksanakannya lockdown, ada pula yang beranggapan bahwa lockdown lebih baik tidak dilaksanakan dengan pertimbangan dampak negatif yang bisa terjadi. Oleh karena itu, pada penelitian ini akan disampaikan mengenai pembuatan pemodelan prediksi terkait analisa sentimen terkait “Lockdown” yang dikhususkan pada media sosial Twitter. Metode yang digunakan adalah dengan melakukan labeling menggunakan Vader dan selanjutnya tweet tersebut dilakukan ekstraksi menggunakan TF-IDF, dan dibuatkan pemodelan untuk prediksi sentimen menggunakan Naïve Bayes dan Support Vector Machine. Hasil evaluasi yang didapat dari kedua algoritma tersebut ialah mencapai lebih dari 80%. Kata kunci: Covid-19, lockdown, TF-IDF, Naïve Bayes, Support Vector Machine Abstract: Covid-19 has been set as a Pandemic by the World Health Organization (WHO). The very large impact and the infection that is fast enough are the reasons for making Covid-19 as a pandemic and efforts to overcome. One anticipation that can be done is to do lockdown. Making the decision to carry out a lockdown is intended to reduce the spread that occurs. Lockdown is certainly not a 100% good solution for all of individual. There are individual who agree that the lockdown will be implemented, also there are those who think that the lockdown is better not to be carried out considering the negative impacts that can occur. Therefore in this study will be presented the predictive modeling for sentiment analysis related to "lockdown" specially on social media Twitter. The method used to labeled was using Vader then the tweets are extracted using TF-IDF, and modeling is made for the prediction of sentiment using Naïve Bayes and Support Vector Machine. The results obtained from the two algorithms are more than 80%. Keywords:Covid-19, lockdown, TF-IDF, Naïve Bayes, Support Vector Machine Abstrak: Covid-19 telah ditetapkan sebagia Pandemi oleh World Health Organization (WHO). Dampak yang sangat besar dan penyebaran yang cukup cepat menjadi alsan untuk menjadikan Covid-19 sebagai Pandemi dan perlu dilakukan upaya penanggulangan. Salah satu upaya yang bisa dilakukan adalah dengan melakukan lockdown. Pengambilan keputusan untuk melakukan lockdown diperuntukan guna mengurangi penyebaran yang terjadi. Lockdown tentunya bukanlah solusi yang 100% baik bagi segala pihak. Terdapat pihak - pihak yang menyetujui akan dilaksanakannya lockdown, ada pula yang beranggapan bahwa lockdown lebih baik tidak dilaksanakan dengan pertimbangan dampak negatif yang bisa terjadi. Oleh karena itu, pada penelitian ini akan disampaikan mengenai pembuatan pemodelan prediksi terkait analisa sentimen terkait “Lockdown” yang dikhususkan pada media sosial Twitter. Metode yang digunakan adalah dengan melakukan labeling menggunakan Vader dan selanjutnya tweet tersebut dilakukan ekstraksi menggunakan TF-IDF, dan dibuatkan pemodelan untuk prediksi sentimen menggunakan Naïve Bayes dan Support Vector Machine. Hasil evaluasi yang didapat dari kedua algoritma tersebut ialah mencapai lebih dari 80%. Kata kunci: Covid-19, lockdown, TF-IDF, Naïve Bayes, Support Vector Machine

Download Full-text

Analisis Sentimen Opini Terhadap Vaksin Covid - 19 pada Media Sosial Twitter Menggunakan Support Vector Machine dan Naive Bayes

Jurnal Komtika ◽

10.31603/komtika.v5i1.5185 ◽

2021 ◽

Vol 5 (1) ◽

pp. 19-25

Author(s):

Frizka Fitriana ◽

Ema Utami ◽

Hanif Al Fatta

Keyword(s):

Support Vector Machine ◽

Naive Bayes ◽

Naïve Bayes ◽

World Health ◽

Support Vector ◽

Support Vector Machine Algorithm ◽

Bayes Algorithm ◽

Health Organization ◽

The Right ◽

The Impact

The corona virus outbreak, commonly referred to as COVID-19, has been officially designated a global pandemic by the World Health Organization (WHO). To minimize the impact caused by the virus, one of the right steps is to develop a vaccine, however, with the vaccination for the Indonesian people, it is controversial so that it invites many people to give an opinion assessment, but the limited space makes it difficult for the public to express their opinion, because Therefore, people choose social media as a place to channel public opinion. Support vector machine algorithm has better performance in terms of accuracy, precision and recall with values of 90.47%, 90.23%, 90.78% with performance values on the Bayes algorithm, namely 88.64%, 87.32%, 88, 13%, with a difference of 1.83% accuracy, 2.91% precision and 2.65% recall, while for time the Naive Bayes algorithm has a better performance level with a value of 8.1 seconds and the Support vector machine algorithm gets a time speed of 11 seconds with a difference of 2, 9 seconds. With the results of sentiment analysis neutral 8.76%, negative 42.92% and positive 48.32% for Bayes and neutral 10.56%, negative 41.28% and positive 48.16% for SVM.

Download Full-text

On the Analysis of Machine Learning Classifiers to Detect Traffic Congestion in Vehicular Networks

10.5753/eniac.2019.9290 ◽

2019 ◽

Author(s):

Lucas Carvalho ◽

Maycon Silva ◽

Edimilson Santos ◽

Daniel Guidoni

Keyword(s):

Machine Learning ◽

Traffic Congestion ◽

Vehicular Networks ◽

Naive Bayes ◽

Naïve Bayes ◽

Machine Learning Techniques ◽

Support Vector ◽

K Nearest Neighbors ◽

Applied Machine Learning ◽

Routing Methods

Problems related to traffic congestion and management have become common in many cities. Thus, vehicle re-routing methods have been proposed to minimize the congestion. Some of these methods have applied machine learning techniques, more specifically classifiers, to verify road conditions and detect congestion. However, better results may be obtained by applying a classifier more suitable to domain. In this sense, this paper presents an evaluation of different classifiers applied to the identification of the level of road congestion. Our main goal is to analyze the characteristics of each classifier in this task. The classifiers involved in the experiments here are: Multiple Layer Neural Network (MLP), K-Nearest Neighbors (KNN), Decision Trees (J48), Support Vector Machines (SVM), Naive Bayes and Tree Augment Naive Bayes.

Download Full-text

A Comparative Analysis to Visualize the Behavior of Different Machine Learning Algorithms for Normalized and Un-Normalized Data in Predicting Alzheimer’s Disease

Journal of Computational and Theoretical Nanoscience ◽

10.1166/jctn.2019.8259 ◽

2019 ◽

Vol 16 (9) ◽

pp. 3840-3848

Author(s):

Neeraj Kumar ◽

Jatinder Manhas ◽

Vinod Sharma

Keyword(s):

Machine Learning ◽

Naive Bayes ◽

Neurodegenerative Disorder ◽

Learning Algorithms ◽

Naïve Bayes ◽

Machine Learning Algorithms ◽

Machine Learning Techniques ◽

Support Vector ◽

Linear Discriminant ◽

Age Related

Advancement in technology has helped people to live a long and better life. But the increased life expectancy has also elevated the risk of age related disorders, especially the neurodegenerative disorders. Alzheimer’s is one such neurodegenerative disorder, which is also the leading contributor towards dementia in elderly people. Despite of extensive research in this field, scientists have failed to find a cure for the disease till date. This makes early diagnosis of Alzheimer’s very crucial so as to delay its progression and improve the condition of the patient. Various techniques are being employed for diagnosing Alzheimer’s which include neuropsychological tests, medical imaging, blood based biomarkers, etc. Apart from this, various machine learning algorithms have been employed so far to diagnose Alzheimer’s in its early stages. In the current research, authors compared the performance of various machine learning techniques i.e., Linear Discriminant Analysis (LDA), K-Nearest Neighbour (KNN), Naïve Bayes (NB), Support Vector Machines (SVM), Decision Trees (DT), Random Forests (RF) and Multi Layer Perceptron (MLP) on Alzheimer’s dataset. This paper experimentally demonstrated that normalization exhibits a predominant role in enhancing the efficiency of some machine learning algorithms. Therefore it becomes imperative to choose the algorithms as per the available data. In this paper, the efficiency of the given machine learning methods was compared in terms of accuracy and f1-score. Naïve Bayes gave a better overall performance for both accuracy and f1-score and it also remained unaffected with the normalization of data along with LDA, DT and RF. Whereas KNN, SVM and MLP showed a drastic (17% to 86%) improvement in the performance when they are given normalized data as compared to un-normalized data from Alzheimer’s dataset.

Download Full-text

Implementasi Metode Naive Bayes Untuk Memprediksi Resiko Penyakit Jantung

Patria Artha Technological Journal ◽

10.33857/patj.v4i2.351 ◽

2020 ◽

Vol 4 (2) ◽

pp. 66-70

Author(s):

firman Tempola

Keyword(s):

Machine Learning ◽

World Health Organization ◽

Naive Bayes ◽

Naïve Bayes ◽

World Health ◽

Health Organization

Kematian akibat penyakit jantung terus meningkat dan tak mengenal usia muda dan tua. World Health Organization menyebutkan 7,3 juta penduduk dunia meninggal akibat dari penyakit jantung. Bahkan disebutkan penyakit jantung adalah salah penyakit nomor satu paling mematikan. Untuk itu penting diketahui resiko dari pentakit jantung dengan menerapkan model-model yang ada pada machine learning. Tujuan dari penelitian ini yaitu untuk mengimplementasikan metode Naive Bayes untuk memprediksi penyakit jantung, serta dilakukan ujii kinerja algoritma dengan menghitung presisi, recall dan akurasi. Adapun Kriteria-kriteria yang digunakan pada penelitian ini yaitu umur, jenis kelamin, jenis sakit dada, tekanan darah, kolestrol, kadar gula, elektrokardiografi, tekanan jantung, angina induksi, old-peak, segmen_st, Fluoroskopi, denyut jantung. Sedangkan class yang diprediksi ada 2 beresiko dan tidak beresiko. Hasil dalam penelitian ini menunjukan bahwa metode berhasil memprediksi atau mengklasifikasi pasien beresiko penyakit jantung dan tidak beresiko penyakit jantung dengan persentase precision 90%, recall 100% serta mendapatkan akurasi 92.85% dan termaksuk exellent classification.

Download Full-text