Performance Improvement of Decision Tree: A Robust Classifier Using Tabu Search Algorithm

Muhammad Asfand Hafeez; Muhammad Rashid; Hassan Tariq; Zain Ul Abideen; Saud S. Alotaibi; Mohammed H. Sinky

doi:10.3390/app11156728

Performance Improvement of Decision Tree: A Robust Classifier Using Tabu Search Algorithm

Applied Sciences ◽

10.3390/app11156728 ◽

2021 ◽

Vol 11 (15) ◽

pp. 6728

Author(s):

Muhammad Asfand Hafeez ◽

Muhammad Rashid ◽

Hassan Tariq ◽

Zain Ul Abideen ◽

Saud S. Alotaibi ◽

...

Keyword(s):

Machine Learning ◽

Tabu Search ◽

Decision Tree ◽

Decision Trees ◽

Search Algorithm ◽

Learning Algorithms ◽

Performance Comparison ◽

Machine Learning Algorithms ◽

Supervised Machine Learning ◽

Tabu Search Algorithm

Classification and regression are the major applications of machine learning algorithms which are widely used to solve problems in numerous domains of engineering and computer science. Different classifiers based on the optimization of the decision tree have been proposed, however, it is still evolving over time. This paper presents a novel and robust classifier based on a decision tree and tabu search algorithms, respectively. In the aim of improving performance, our proposed algorithm constructs multiple decision trees while employing a tabu search algorithm to consistently monitor the leaf and decision nodes in the corresponding decision trees. Additionally, the used tabu search algorithm is responsible to balance the entropy of the corresponding decision trees. For training the model, we used the clinical data of COVID-19 patients to predict whether a patient is suffering. The experimental results were obtained using our proposed classifier based on the built-in sci-kit learn library in Python. The extensive analysis for the performance comparison was presented using Big O and statistical analysis for conventional supervised machine learning algorithms. Moreover, the performance comparison to optimized state-of-the-art classifiers is also presented. The achieved accuracy of 98%, the required execution time of 55.6 ms and the area under receiver operating characteristic (AUROC) for proposed method of 0.95 reveals that the proposed classifier algorithm is convenient for large datasets.

Get full-text (via PubEx)

Encrypted DNP3 Traffic Classification Using Supervised Machine Learning Algorithms

Machine Learning and Knowledge Extraction ◽

10.3390/make1010022 ◽

2019 ◽

Vol 1 (1) ◽

pp. 384-399 ◽

Cited By ~ 2

Author(s):

Thais de Toledo ◽

Nunzio Torrisi

Keyword(s):

Machine Learning ◽

Support Vector Machine ◽

Decision Tree ◽

Smart Grids ◽

Learning Algorithms ◽

Electric Utility ◽

Machine Learning Algorithms ◽

Supervised Machine Learning ◽

Support Vector ◽

Communication Link

The Distributed Network Protocol (DNP3) is predominately used by the electric utility industry and, consequently, in smart grids. The Peekaboo attack was created to compromise DNP3 traffic, in which a man-in-the-middle on a communication link can capture and drop selected encrypted DNP3 messages by using support vector machine learning algorithms. The communication networks of smart grids are a important part of their infrastructure, so it is of critical importance to keep this communication secure and reliable. The main contribution of this paper is to compare the use of machine learning techniques to classify messages of the same protocol exchanged in encrypted tunnels. The study considers four simulated cases of encrypted DNP3 traffic scenarios and four different supervised machine learning algorithms: Decision tree, nearest-neighbor, support vector machine, and naive Bayes. The results obtained show that it is possible to extend a Peekaboo attack over multiple substations, using a decision tree learning algorithm, and to gather significant information from a system that communicates using encrypted DNP3 traffic.

Get full-text (via PubEx)

Performance Evaluation of Machine Learning Techniques for Identifying Forged and Phony Uniform Resource Locators (URLs)

Nigerian Journal of Technological Development ◽

10.4314/njtd.v16i4.2 ◽

2019 ◽

Vol 16 (4) ◽

pp. 155-169

Author(s):

N. A. Azeez ◽

A. A. Ajayi

Keyword(s):

Machine Learning ◽

Decision Tree ◽

Financial Institutions ◽

Traditional Approach ◽

Learning Algorithms ◽

Performance Comparison ◽

Machine Learning Algorithms ◽

Machine Learning Techniques ◽

Reliable Solution ◽

F Measure

Since the invention of Information and Communication Technology (ICT), there has been a great shift from the erstwhile traditional approach of handling information across the globe to the usage of this innovation. The application of this initiative cut across almost all areas of human endeavours. ICT is widely utilized in education and production sectors as well as in various financial institutions. It is of note that many people are using it genuinely to carry out their day to day activities while others are using it to perform nefarious activities at the detriment of other cyber users. According to several reports which are discussed in the introductory part of this work, millions of people have become victims of fake Uniform Resource Locators (URLs) sent to their mails by spammers. Financial institutions are not left out in the monumental loss recorded through this illicit act over the years. It is worth mentioning that, despite several approaches currently in place, none could confidently be confirmed to provide the best and reliable solution. According to several research findings reported in the literature, researchers have demonstrated how machine learning algorithms could be employed to verify and confirm compromised and fake URLs in the cyberspace. Inconsistencies have however been noticed in the researchers’ findings and also their corresponding results are not dependable based on the values obtained and conclusions drawn from them. Against this backdrop, the authors carried out a comparative analysis of three learning algorithms (Naïve Bayes, Decision Tree and Logistics Regression Model) for verification of compromised, suspicious and fake URLs and determine which is the best of all based on the metrics (F-Measure, Precision and Recall) used for evaluation. Based on the confusion metrics measurement, the result obtained shows that the Decision Tree (ID3) algorithm achieves the highest values for recall, precision and f-measure. It unarguably provides efficient and credible means of maximizing the detection of compromised and malicious URLs. Finally, for future work, authors are of the opinion that two or more supervised learning algorithms can be hybridized to form a single effective and more efficient algorithm for fake URLs verification.Keywords: Learning-algorithms, Forged-URL, Phoney-URL, performance-comparison

Get full-text (via PubEx)

Predictive model construction for prediction of soil fertility using decision tree machine learning algorithm

Kongunadu Research Journal ◽

10.26524/krj.2021.5 ◽

2021 ◽

Vol 8 (1) ◽

pp. 30-35

Author(s):

Jayalakshmi R ◽

Savitha Devi M

Keyword(s):

Machine Learning ◽

Decision Tree ◽

Soil Fertility ◽

Learning Algorithms ◽

Crop Productivity ◽

Machine Learning Algorithms ◽

Supervised Machine Learning ◽

Support Vector ◽

Severe Problem ◽

Agriculture Sector

Agriculture sector is recognized as the backbone of the Indian economy that plays a crucial role in the growth of the nation’s economy. It imparts on weather and other environmental aspects. Some of the factors on which agriculture is reliant are Soil, climate, flooding, fertilizers, temperature, precipitation, crops, insecticides, and herb. The soil fertility is dependent on these factors and hence difficult to predict. However, the Agriculture sector in India is facing the severe problem of increasing crop productivity. Farmers lack the essential knowledge of nutrient content of the soil, selection of crop best suited for the soil and they also lack efficient methods for predicting crop well in advance so that appropriate methods have been used to improve crop productivity. This paper presents different Supervised Machine Learning Algorithms such as Decision tree, K-Nearest Neighbor (KNN), Support Vector Machine (SVM) to predict the fertility of soil based on macro-nutrients and micro-nutrients status found in the dataset. Supervised Machine Learning algorithms are applied on the training dataset and are tested with the test dataset, and the implementation of these algorithms is done using R Tool. The performance analysis of these algorithms is done using different evaluation metrics like mean absolute error, cross-validation, and accuracy. Result analysis shows that the Decision tree is produced the best accuracy of 99% with a very less mean square error (MSE) rate.

Get full-text (via PubEx)

Comprehensive Performance Comparison of Supervised Machine Learning Algorithms in Non-Intrusive Load Monitoring

2020 International Conference on Smart Energy Systems and Technologies (SEST) ◽

10.1109/sest48500.2020.9203443 ◽

2020 ◽

Author(s):

Ahmet Furkan Ersen ◽

Ayse Kubra Erenoglu ◽

Ozan Erdinc ◽

Ibrahim Sengor ◽

Joao P. S. Catalao

Keyword(s):

Machine Learning ◽

Learning Algorithms ◽

Performance Comparison ◽

Machine Learning Algorithms ◽

Supervised Machine Learning ◽

Comprehensive Performance ◽

Load Monitoring

Get full-text (via PubEx)

Performance Comparison of Supervised Machine Learning Algorithms for Multiclass Transient Classification in a Nuclear Power Plant

Swarm, Evolutionary, and Memetic Computing - Lecture Notes in Computer Science ◽

10.1007/978-3-319-20294-5_10 ◽

2015 ◽

pp. 111-122

Author(s):

Manas Ranjan Prusty ◽

Jaideep Chakraborty ◽

T. Jayanthi ◽

K. Velusamy

Keyword(s):

Machine Learning ◽

Power Plant ◽

Nuclear Power Plant ◽

Nuclear Power ◽

Learning Algorithms ◽

Performance Comparison ◽

Machine Learning Algorithms ◽

Supervised Machine Learning

Get full-text (via PubEx)

Denial of Service (DoS) Attack Detection: Performance Comparison of Supervised Machine Learning Algorithms

2020 IEEE Intl Conf on Dependable, Autonomic and Secure Computing, Intl Conf on Pervasive Intelligence and Computing, Intl Conf on Cloud and Big Data Computing, Intl Conf on Cyber Science and Technology Congress (DASC/PiCom/CBDCom/CyberSciTech) ◽

10.1109/dasc-picom-cbdcom-cyberscitech49142.2020.00088 ◽

2020 ◽

Author(s):

Zhuolin Li ◽

Hao Zhang ◽

Hossain Shahriar ◽

Dan Lo ◽

Kai Qian ◽

...

Keyword(s):

Machine Learning ◽

Denial Of Service ◽

Learning Algorithms ◽

Performance Comparison ◽

Detection Performance ◽

Attack Detection ◽

Machine Learning Algorithms ◽

Supervised Machine Learning ◽

Dos Attack

Get full-text (via PubEx)

A Support Vector Machine and Decision Tree Based Breast Cancer Prediction

International Journal of Engineering and Advanced Technology - Regular Issue ◽

10.35940/ijeat.a1752.029320 ◽

2020 ◽

Vol 9 (3) ◽

pp. 2972-2976

Keyword(s):

Breast Cancer ◽

Machine Learning ◽

Support Vector Machine ◽

Decision Tree ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Classification Model ◽

Supervised Machine Learning ◽

Misclassification Rate ◽

Support Vector

The first step in diagnosis of a breast cancer is the identification of the disease. Early detection of the breast cancer is significant to reduce the mortality rate due to breast cancer. Machine learning algorithms can be used in identification of the breast cancer. The supervised machine learning algorithms such as Support Vector Machine (SVM) and the Decision Tree are widely used in classification problems, such as the identification of breast cancer. In this study, a machine learning model is proposed by employing learning algorithms namely, the support vector machine and decision tree. The kaggle data repository consisting of 569 observations of malignant and benign observations is used to develop the proposed model. Finally, the model is evaluated using accuracy, confusion matrix precision and recall as metrics for evaluation of performance on the test set. The analysis result showed that, the support vector machine (SVM) has better accuracy and less number of misclassification rate and better precision than the decision tree algorithm. The average accuracy of the support vector machine (SVM) is 91.92 % and that of the decision tree classification model is 87.12 %.

Get full-text (via PubEx)

An Example of Performance Comparison of Supervised Machine Learning Algorithms Before and After PCA and LDA Application: Breast Cancer Detection

2020 Innovations in Intelligent Systems and Applications Conference (ASYU) ◽

10.1109/asyu50717.2020.9259883 ◽

2020 ◽

Author(s):

Seda Kaya ◽

Mete Yaganoglu

Keyword(s):

Breast Cancer ◽

Machine Learning ◽

Cancer Detection ◽

Learning Algorithms ◽

Performance Comparison ◽

Breast Cancer Detection ◽

Machine Learning Algorithms ◽

Supervised Machine Learning ◽

Before And After

Get full-text (via PubEx)

INTELLIGENT COOPERATIVE WEB CACHING POLICIES FOR MEDIA OBJECTS BASED ON J48 DECISION TREE AND NAÏVE BAYES SUPERVISED MACHINE LEARNING ALGORITHMS IN STRUCTURED PEER-TO-PEER SYSTEMS

Journal of Information and Communication Technology ◽

10.32890/jict2016.15.2.5 ◽

2016 ◽

Vol 15 (2) ◽

Author(s):

Hamidah Ibrahim ◽

Waheed Yasin ◽

Nur Izura Udzir ◽

Nor Asilah Wati Abdul Hamid

Keyword(s):

Machine Learning ◽

Decision Tree ◽

Learning Algorithms ◽

Peer To Peer ◽

Web Caching ◽

Machine Learning Algorithms ◽

Supervised Machine Learning ◽

Peer To Peer Systems ◽

Media Objects ◽

J48 Decision Tree

Get full-text (via PubEx)

Application of Supervised Machine Learning Algorithms for Lithofacies Classification.

10.2523/19349-ms ◽

2019 ◽

Author(s):

Subhadeep Sarkar ◽

Chandan Majumdar

Keyword(s):

Machine Learning ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Supervised Machine Learning ◽

Lithofacies Classification

Get full-text (via PubEx)