A Short Review of Classification Algorithms Accuracy for Data Prediction in Data Mining Applications

Ibrahim Ba’abbad; Thamer Althubiti; Abdulmohsen Alharbi; Khalid Alfarsi; Saim Rasheed

doi:10.4236/jdaip.2021.93011

A Short Review of Classification Algorithms Accuracy for Data Prediction in Data Mining Applications

Journal of Data Analysis and Information Processing ◽

10.4236/jdaip.2021.93011 ◽

2021 ◽

Vol 09 (03) ◽

pp. 162-174

Author(s):

Ibrahim Ba’abbad ◽

Thamer Althubiti ◽

Abdulmohsen Alharbi ◽

Khalid Alfarsi ◽

Saim Rasheed

Keyword(s):

Data Mining ◽

Short Review ◽

Classification Algorithms ◽

Data Prediction

Download Full-text

Comparative Study of Different Classification Algorithms for Stream Data Mining Using MOA

International Journal of Computer Sciences and Engineering ◽

10.26438/ijcse/v6i11.614616 ◽

2018 ◽

Vol 6 (11) ◽

pp. 614-616

Author(s):

Ashish P. Joshi ◽

Biraj V. Patel

Keyword(s):

Data Mining ◽

Comparative Study ◽

Classification Algorithms ◽

Stream Data ◽

Stream Data Mining

Download Full-text

Early Detection of Red Palm Weevil, Rhynchophorus ferrugineus (Olivier), Infestation Using Data Mining

Plants ◽

10.3390/plants10010095 ◽

2021 ◽

Vol 10 (1) ◽

pp. 95

Author(s):

Heba Kurdi ◽

Amal Al-Aldawsari ◽

Isra Al-Turaiki ◽

Abdulrahman S. Aldawood

Keyword(s):

Data Mining ◽

Plant Size ◽

Support Vector ◽

Classification Algorithms ◽

Palm Tree ◽

Rhynchophorus Ferrugineus ◽

Red Palm Weevil ◽

Palm Weevil ◽

Using Data ◽

F Measure

In the past 30 years, the red palm weevil (RPW), Rhynchophorus ferrugineus (Olivier), a pest that is highly destructive to all types of palms, has rapidly spread worldwide. However, detecting infestation with the RPW is highly challenging because symptoms are not visible until the death of the palm tree is inevitable. In addition, the use of automated RPW weevil identification tools to predict infestation is complicated by a lack of RPW datasets. In this study, we assessed the capability of 10 state-of-the-art data mining classification algorithms, Naive Bayes (NB), KSTAR, AdaBoost, bagging, PART, J48 Decision tree, multilayer perceptron (MLP), support vector machine (SVM), random forest, and logistic regression, to use plant-size and temperature measurements collected from individual trees to predict RPW infestation in its early stages before significant damage is caused to the tree. The performance of the classification algorithms was evaluated in terms of accuracy, precision, recall, and F-measure using a real RPW dataset. The experimental results showed that infestations with RPW can be predicted with an accuracy up to 93%, precision above 87%, recall equals 100%, and F-measure greater than 93% using data mining. Additionally, we found that temperature and circumference are the most important features for predicting RPW infestation. However, we strongly call for collecting and aggregating more RPW datasets to run more experiments to validate these results and provide more conclusive findings.

Download Full-text

A Comprehensive Survey of Classification Algorithms for Formulating Crop Yield Prediction Using Data Mining Techniques

10.1109/temsmet51618.2020.9557403 ◽

2020 ◽

Author(s):

C Chandana ◽

G Parthasarathy

Keyword(s):

Data Mining ◽

Crop Yield ◽

Classification Algorithms ◽

Yield Prediction ◽

Data Mining Techniques ◽

Comprehensive Survey ◽

Using Data

Download Full-text

Comparisons Of Data Mining Classification Algorithms For Customers' Shopping Intention In E-Commerce

10.1109/aidas53897.2021.9574307 ◽

2021 ◽

Author(s):

Kek Zhi Xuan ◽

Shuhaida Ismail ◽

Intan Syazwani Noorain ◽

Nur Aliaa Dalila A. Muhaime

Keyword(s):

Data Mining ◽

Classification Algorithms

Download Full-text

A method for improving the accuracy of data mining classification algorithms

Computers & Operations Research ◽

10.1016/j.cor.2008.12.011 ◽

2009 ◽

Vol 36 (10) ◽

pp. 2829-2839 ◽

Cited By ~ 31

Author(s):

Nikolaos Mastrogiannis ◽

Basilis Boutsinas ◽

Ioannis Giannikos

Keyword(s):

Data Mining ◽

Classification Algorithms

Download Full-text

A Survey on Major Classification Algorithms and Comparative Analysis of Few Classification Algorithms on Contact Lenses Data Set Using Data Mining Tool

New Trends in Computational Vision and Bio-inspired Computing ◽

10.1007/978-3-030-41862-5_121 ◽

2020 ◽

pp. 1201-1209

Author(s):

Syed Nawaz Pasha ◽

D. Ramesh ◽

Mohammad Sallauddin

Keyword(s):

Data Mining ◽

Comparative Analysis ◽

Contact Lenses ◽

Classification Algorithms ◽

Data Set ◽

Data Mining Tool ◽

Mining Tool ◽

Using Data

Download Full-text

Comparing sets of patterns with the Jaccard index

Australasian Journal of Information Systems ◽

10.3127/ajis.v22i0.1538 ◽

2018 ◽

Vol 22 ◽

Cited By ~ 2

Author(s):

Sam Fletcher ◽

Md Zahidul Islam

Keyword(s):

Data Mining ◽

Driving Force ◽

Prediction Accuracy ◽

Jaccard Index ◽

Classification Algorithms ◽

Single Element ◽

Temporal Data ◽

Real World Data ◽

Actionable Knowledge ◽

Computational Simplicity

The ability to extract knowledge from data has been the driving force of Data Mining since its inception, and of statistical modeling long before even that. Actionable knowledge often takes the form of patterns, where a set of antecedents can be used to infer a consequent. In this paper we offer a solution to the problem of comparing different sets of patterns. Our solution allows comparisons between sets of patterns that were derived from different techniques (such as different classification algorithms), or made from different samples of data (such as temporal data or data perturbed for privacy reasons). We propose using the Jaccard index to measure the similarity between sets of patterns by converting each pattern into a single element within the set. Our measure focuses on providing conceptual simplicity, computational simplicity, interpretability, and wide applicability. The results of this measure are compared to prediction accuracy in the context of a real-world data mining scenario.

Download Full-text

A Survey on Phishing Detection and The Importance of Feature Selection In Data Mining Classification Algorithms

Issue 4 - Journal of Science and Technology ◽

10.46243/jst.2020.v5.i6.pp11-18 ◽

2020 ◽

pp. 11-18

Keyword(s):

Data Mining ◽

Feature Selection ◽

Support Vector ◽

Classification Algorithms ◽

End User ◽

Preparation Methods ◽

Survey Paper ◽

Vector Machines ◽

Feature Selection Techniques ◽

Phishing Detection

: In this era of Internet, the issue of security of information is at its peak. One of the main threats in this cyber world is phishing attacks which is an email or website fraud method that targets the genuine webpage or an email and hacks it without the consent of the end user. There are various techniques which help to classify whether the website or an email is legitimate or fake. The major contributors in the process of detection of these phishing frauds include the classification algorithms, feature selection techniques or dataset preparation methods and the feature extraction that plays an important role in detection as well as in prevention of these attacks. This Survey Paper studies the effect of all these contributors and the approaches that are applied in the study conducted on the recent papers. Some of the classification algorithms that are implemented includes Decision tree, Random Forest , Support Vector Machines, Logistic Regression , Lazy K Star, Naive Bayes and J48 etc.

Download Full-text