Comparative Analysis of Machine Learning
Algorithms with and without Feature Extraction

Image recognition is one of the core disciplines in Computer Vision. It is one of the most widely researched topics of the last few decades. Many advances in image recognition in the past decade, has made it one of the most efficient and powerful disciplines of all, having its applications in every sector including Finance, Healthcare, Security services, Agriculture and many more. Feature extraction is an integral part of image recognition. It helps in training the model more efficiently and with a higher accuracy, by getting rid of any unwanted or unnecessary features, thus reducing the dimensionality of the input image. This also helps in reducing the computational resources required by the algorithm to train, thus making it affordable for people with low end setups. Here we compare the accuracies of different machine learning classification algorithms, and their training times, with and without using feature Extraction. For the purpose of extracting features, a convolutional neural network was used. The model was trained and tested on the data of 12 classes containing a total of 2,175 images. For comparisons, we chose the Logistic regression, K-Nearest Neighbors Classifier, Random forest Classifier, and Support Vector Machine Classifier.

Download Full-text

Traffic Signs Detection Using Machine Learning Algorithms

International Journal for Modern Trends in Science and Technology - RTT2020 ◽

10.46501/ijmtst061119 ◽

2020 ◽

Vol 6 (11) ◽

pp. 109-112

Author(s):

Yugam Bajaj and Shallu Bashambu

Keyword(s):

Machine Learning ◽

Automobile Industry ◽

Autonomous Vehicle ◽

Machine Learning Algorithms ◽

Support Vector ◽

K Nearest Neighbors ◽

Traffic Signs ◽

Conventional Vehicle ◽

Machine Learning Classification ◽

Main Challenge

With the rapid advancement and developments in the Automobile industry, that day is not far when each of us would be owning their own Autonomous Vehicle. Although manufacturing of a full proof Autonomous Vehicle has its own fair share of challenges. The main challenge that lies in front of us, is imbibing the latest technologies and advancements into the conventional vehicles we already have. This paper discusses one such technology that we can incorporate in our vehicle, to direct the Conventional Vehicle into becoming an Autonomous Vehicle in future. The user would be able to classify Traffic Signs on Road, which would help him/her to understand what that sign signifies, i.e. what rules the driver must follow while driving on that particular road. We use Machine Learning Classification Algorithms like k-Nearest Neighbors, Random Forest and Support Vector Machine on our dataset, to compute the best accuracies in the process as well.

Download Full-text

Predicting Student’s Performance Using Machine Learning Algorithm

International Journal of Advanced Research in Science, Communication and Technology ◽

10.48175/ijarsct-1209 ◽

2021 ◽

pp. 53-58

Author(s):

Sheela Rani P ◽

Dhivya S ◽

Dharshini Priya M ◽

Dharmila Chowdary A

Keyword(s):

Machine Learning ◽

Support Vector Machine ◽

Prediction Model ◽

Naive Bayes ◽

Learning Algorithm ◽

Naïve Bayes ◽

Machine Learning Algorithms ◽

Support Vector ◽

Learning Approaches ◽

K Nearest Neighbors

Machine learning is a new analysis discipline that uses knowledge to boost learning, optimizing the training method and developing the atmosphere within which learning happens. There square measure 2 sorts of machine learning approaches like supervised and unsupervised approach that square measure accustomed extract the knowledge that helps the decision-makers in future to require correct intervention. This paper introduces an issue that influences students' tutorial performance prediction model that uses a supervised variety of machine learning algorithms like support vector machine , KNN(k-nearest neighbors), Naïve Bayes and supplying regression and logistic regression. The results supported by various algorithms are compared and it is shown that the support vector machine and Naïve Bayes performs well by achieving improved accuracy as compared to other algorithms. The final prediction model during this paper may have fairly high prediction accuracy .The objective is not just to predict future performance of students but also provide the best technique for finding the most impactful features that influence student’s while studying.

Download Full-text

Classifications of Breast Cancer Diagnosis using Machine Learning

International Journal of Computers ◽

10.46300/9108.2020.14.13 ◽

2020 ◽

Vol 14 ◽

Keyword(s):

Breast Cancer ◽

Machine Learning ◽

Random Forest ◽

Breast Cancer Diagnosis ◽

Performance Comparison ◽

Support Vector ◽

Breast Cancer Dataset ◽

K Nearest Neighbors ◽

Cancer Dataset ◽

Machine Learning Classification

Breast Cancer (BC) is amongst the most common and leading causes of deaths in women throughout the world. Recently, classification and data analysis tools are being widely used in the medical field for diagnosis, prognosis and decision making to help lower down the risks of people dying or suffering from diseases. Advanced machine learning methods have proven to give hope for patients as this has helped the doctors in early detection of diseases like Breast Cancer that can be fatal, in support with providing accurate outcomes. However, the results highly depend on the techniques used for feature selection and classification which will produce a strong machine learning model. In this paper, a performance comparison is conducted using four classifiers which are Multilayer Perceptron (MLP), Support Vector Machine (SVM), K-Nearest Neighbors (KNN) and Random Forest on the Wisconsin Breast Cancer dataset to spot the most effective predictors. The main goal is to apply best machine learning classification methods to predict the Breast Cancer as benign or malignant using terms such as accuracy, f-measure, precision and recall. Experimental results show that Random forest is proven to achieve the highest accuracy of 99.26% on this dataset and features, while SVM and KNN show 97.78% and 97.04% accuracy respectively. MLP shows the least accuracy of 94.07%. All the experiments are conducted using RStudio as the data mining tool platform.

Download Full-text

Machine Learning Classification Algorithms to Predict aGvHD following Allo-HSCT: A Systematic Review

Methods of Information in Medicine ◽

10.1055/s-0040-1709150 ◽

2019 ◽

Vol 58 (06) ◽

pp. 205-212

Author(s):

Cirruse Salehnasab ◽

Abbas Hajifathali ◽

Farkhondeh Asadi ◽

Elham Roshandel ◽

Alireza Kazemi ◽

...

Keyword(s):

Machine Learning ◽

Systemic Review ◽

Predictor Variables ◽

Support Vector ◽

Classification Algorithms ◽

K Nearest Neighbors ◽

Hematopoietic Stem ◽

Machine Learning Classification ◽

Graft Versus Host ◽

Meta Analyses

Abstract Background The acute graft-versus-host disease (aGvHD) is the most important cause of mortality in patients receiving allogeneic hematopoietic stem cell transplantation. Given that it occurs at the stage of severe tissue damage, its diagnosis is late. With the advancement of machine learning (ML), promising real-time models to predict aGvHD have emerged. Objective This article aims to synthesize the literature on ML classification algorithms for predicting aGvHD, highlighting algorithms and important predictor variables used. Methods A systemic review of ML classification algorithms used to predict aGvHD was performed using a search of the PubMed, Embase, Web of Science, Scopus, Springer, and IEEE Xplore databases undertaken up to April 2019 based on Preferred Reporting Items for Systematic reviews and Meta-Analyses (PRISMA) statements. The studies with a focus on using the ML classification algorithms in the process of predicting of aGvHD were considered. Results After applying the inclusion and exclusion criteria, 14 studies were selected for evaluation. The results of the current analysis showed that the algorithms used were Artificial Neural Network (79%), Support Vector Machine (50%), Naive Bayes (43%), k-Nearest Neighbors (29%), Regression (29%), and Decision Trees (14%), respectively. Also, many predictor variables have been used in these studies so that we have divided them into more abstract categories, including biomarkers, demographics, infections, clinical, genes, transplants, drugs, and other variables. Conclusion Each of these ML algorithms has a particular characteristic and different proposed predictors. Therefore, it seems these ML algorithms have a high potential for predicting aGvHD if the process of modeling is performed correctly.

Download Full-text

Feature-Based Opinion Mining and Managed Machine Learning with Sentiment Classification Models

International Journal of Engineering and Advanced Technology - Regular Issue ◽

10.35940/ijeat.b4555.129219 ◽

2019 ◽

Vol 9 (2) ◽

pp. 3992-3998

Keyword(s):

Machine Learning ◽

Sentiment Analysis ◽

Language Processing ◽

Opinion Mining ◽

Machine Learning Algorithms ◽

Support Vector ◽

K Nearest Neighbors ◽

Data Intensive ◽

Learning Tasks ◽

Feature Based

Sentiment Analysis is individuals' opinions and feedbacks study towards a substance, which can be items, services, movies, people or events. The opinions are mostly expressed as remarks or reviews. With the social network, gatherings and websites, these reviews rose as a significant factor for the client’s decision to buy anything or not. These days, a vast scalable computing environment provides us with very sophisticated way of carrying out various data-intensive natural language processing (NLP) and machine-learning tasks to examine these reviews. One such example is text classification, a compelling method for predicting the clients' sentiment. In this paper, we attempt to center our work of sentiment analysis on movie review database. We look at the sentiment expression to order the extremity of the movie reviews on a size of 0(highly disliked) to 4(highly preferred) and perform feature extraction and ranking and utilize these features to prepare our multilabel classifier to group the movie review into its right rating. This paper incorporates sentiment analysis utilizing feature-based opinion mining and managed machine learning. The principle center is to decide the extremity of reviews utilizing nouns, verbs, and adjectives as opinion words. In addition, a comparative study on different classification approaches has been performed to determine the most appropriate classifier to suit our concern problem space. In our study, we utilized six distinctive machine learning algorithms – Naïve Bayes, Logistic Regression, SVM (Support Vector Machine), RF (Random Forest) KNN (K nearest neighbors) and SoftMax Regression.

Download Full-text

Predicting Vasovagal Syncope for Paraplegia Patients Using Average Weighted Ensemble Technique

Journal of Mobile Multimedia ◽

10.13052/jmm1550-4646.1817 ◽

2021 ◽

Author(s):

V. Vinodhini ◽

Akula Vishalakshi ◽

G. Naga Chandrika ◽

S. Sankar ◽

Somula Ramasubbareddy

Keyword(s):

Machine Learning ◽

Vasovagal Syncope ◽

Correct Diagnosis ◽

Machine Learning Algorithms ◽

Support Vector ◽

Ensemble Technique ◽

Machine Learning Classification ◽

Severe Fatigue ◽

Artery Disease ◽

Serious Disease

Vasovagal syncope (VVS) refers to fainting of people with a drop in blood flow to the brain more serious disease in paraplegia patients. Precognitive diagnoses are characterized by lightheadedness, nausea, severe fatigue, and an elevated heart rate. As a result, it’s important to seek care as soon as possible after experiencing syncope. Since receiving a correct diagnosis and appropriate care, the majority of patients may avoid complications with syncope. Syncope appears to be a sign of COVID 19 in people with coronary artery disease. Furthermore, a sudden heart attack might result in acute syncope. In a few circumstances, machine learning classification techniques may not be precise. For paraplegia patients, prediction vasovagal syncope needs more precise results in order to save their lives. The aim of this paper is to use the ensemble technique to improve the accuracy of conventional machine learning algorithms. EEG (ElectroEncephaloGram) brainwave dataset from kaggle is used to implement it. The accuracy of the proposed AWET algorithm is 82%. It improves the accuracy by 17% compare to Support Vector Machine, Random Forest, Naive Bayes, and MultiLayer Perceptron classifiers.

Download Full-text

Glass Classification based on Machine Learning Algorithms

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.h6819.0991120 ◽

2020 ◽

Vol 9 (11) ◽

pp. 139-142

Keyword(s):

Machine Learning ◽

Random Forest ◽

Amorphous Solid ◽

Machine Learning Algorithms ◽

Support Vector ◽

K Nearest Neighbors ◽

X Ray ◽

Svm Algorithm ◽

Artificial Neural Network Ann ◽

Logistic Regression Algorithm

Glass Industry is considered one of the most important industries in the world. The Glass is used everywhere, from water bottles to X-Ray and Gamma Rays protection. This is a non-crystalline, amorphous solid that is most often transparent. There are lots of uses of glass, and during investigation in a crime scene, the investigators need to know what is type of glass in a scene. To find out the type of glass, we will use the online dataset and machine learning to solve the above problem. We will be using ML algorithms such as Artificial Neural Network (ANN), K-nearest neighbors (KNN) algorithm, Support Vector Machine (SVM) algorithm, Random Forest algorithm, and Logistic Regression algorithm. By comparing all the algorithm Random Forest did the best in glass classification.

Download Full-text

Effectiveness of Classification Methods on the Diabetes System

Asian Journal of Research in Computer Science ◽

10.9734/ajrcos/2021/v12i330287 ◽

2021 ◽

pp. 33-43

Author(s):

Ahmed T. Shawky ◽

Ismail M. Hagag

Keyword(s):

Machine Learning ◽

Naive Bayes ◽

Early Stage ◽

Research Paper ◽

Naïve Bayes ◽

Machine Learning Algorithms ◽

Support Vector ◽

K Nearest Neighbor ◽

Machine Learning Classification ◽

Bayes Algorithm

In today’s world using data mining and classification is considered to be one of the most important techniques, as today’s world is full of data that is generated by various sources. However, extracting useful knowledge out of this data is the real challenge, and this paper conquers this challenge by using machine learning algorithms to use data for classifiers to draw meaningful results. The aim of this research paper is to design a model to detect diabetes in patients with high accuracy. Therefore, this research paper using five different algorithms for different machine learning classification includes, Decision Tree, Support Vector Machine (SVM), Random Forest, Naive Bayes, and K- Nearest Neighbor (K-NN), the purpose of this approach is to predict diabetes at an early stage. Finally, we have compared the performance of these algorithms, concluding that K-NN algorithm is a better accuracy (81.16%), followed by the Naive Bayes algorithm (76.06%).

Download Full-text

Perbandingan Algoritma k-Nearest Neighbors (k-NN) dan Support Vector Machines (SVM) untuk Klasifikasi Pengenalan Citra Wajah

Jurnal ICT : Information Communication & Technology ◽

10.36054/jict-ikmi.v20i1.354 ◽

2021 ◽

Vol 20 (1) ◽

pp. 186-191

Author(s):

Parasian DP Silitonga ◽

Romanus Damanik

Keyword(s):

Image Recognition ◽

Machine Learning Algorithms ◽

Public Image ◽

Support Vector ◽

Facial Image ◽

K Nearest Neighbors ◽

Svm Algorithm ◽

Vector Machines ◽

Significant Research ◽

The Moment

Abstract- The study of face recognition is one of the areas of computer vision that requires significant research at the moment. Numerous researchers have conducted studies on facial image recognition using a variety of techniques or methods to achieve the highest level of accuracy possible when recognizing a person's face from existing images. However, recognizing the image of a human face is not easy for a computer. As a result, several approaches were taken to resolve this issue. This study compares two (two) machine learning algorithms for facial image recognition to determine which algorithm has the highest level of accuracy, precision, recall, and AUC. The comparison is carried out in the following steps: image acquisition, preprocessing, feature extraction, face classification, training, and testing. Based on the stages and experiments conducted on public image datasets, it is concluded that the SVM algorithm, on average, has a higher level of accuracy, precision, and recall than the k-NN algorithm when the dataset proportion is 90:10. While the k-NN algorithm has the highest similarity in terms of accuracy, precision, and recall at 80%: 20% and 70%: 30% of 99.20. However, for the highest AUC percentage level, the k-NN algorithm outperforms SVM at a dataset proportion of 80%: 20% at 100%.

Download Full-text

Three simple steps to improve the interpretability of EEG-SVM studies

10.1101/2021.12.14.472588 ◽

2021 ◽

Author(s):

Coralie Joucla ◽

Damien Gabriel ◽

Emmanuel Haffen ◽

Juan-Pablo Ortega

Keyword(s):

Machine Learning ◽

Model Development ◽

Research Literature ◽

Machine Learning Algorithms ◽

Support Vector ◽

Machine Learning Classification ◽

Diagnosis And Prognosis ◽

Eeg Data ◽

Clinical Adoption

Research in machine-learning classification of electroencephalography (EEG) data offers important perspectives for the diagnosis and prognosis of a wide variety of neurological and psychiatric conditions, but the clinical adoption of such systems remains low. We propose here that much of the difficulties translating EEG-machine learning research to the clinic result from consistent inaccuracies in their technical reporting, which severely impair the interpretability of their often-high claims of performance. Taking example from a major class of machine-learning algorithms used in EEG research, the support-vector machine (SVM), we highlight three important aspects of model development (normalization, hyperparameter optimization and cross-validation) and show that, while these 3 aspects can make or break the performance of the system, they are left entirely undocumented in a shockingly vast majority of the research literature. Providing a more systematic description of these aspects of model development constitute three simple steps to improve the interpretability of EEG-SVM research and, in fine, its clinical adoption.

Download Full-text