Using Decision Tree Classifier for Analyzing Students’ Activities

The exponential growth in the use of computers over networks, as well as the proliferation of applications that operate on different platforms, has drawn attention to network security. This paradigm takes advantage of security flaws in all operating systems that are both technically difficult and costly to fix. As a result, intrusion is used as a key to worldwide a computer resource's credibility, availability, and confidentiality. The Intrusion Detection System (IDS) is critical in detecting network anomalies and attacks. In this paper, the data mining principle is combined with IDS to efficiently and quickly identify important, secret data of interest to the user. The proposed algorithm addresses four issues: data classification, high levels of human interaction, lack of labeled data, and the effectiveness of distributed denial of service attacks. We're also working on a decision tree classifier that has a variety of parameters. The previous algorithm classified IDS up to 90% of the time and was not appropriate for large data sets. Our proposed algorithm was designed to accurately classify large data sets. Aside from that, we quantify a few more decision tree classifier parameters.

Download Full-text

Student Performance Predictions Using Knowledge Discovery Database and Data Mining, DPU Students Records as Sample

Academic Journal of Nawroz University ◽

10.25007/ajnu.v10n3a875 ◽

2021 ◽

Vol 10 (3) ◽

pp. 121-127

Author(s):

Bareen Haval ◽

Karwan Jameel Abdulrahman ◽

Araz Rajab

Keyword(s):

Data Mining ◽

Decision Tree ◽

Student Performance ◽

Educational Data Mining ◽

Data Sets ◽

Decision Tree Classifier ◽

Data Mining Techniques ◽

Academic History ◽

Tree Classifier ◽

Using Data

This article presents the results of connecting an educational data mining techniques to the academic performance of students. Three classification models (Decision Tree, Random Forest and Deep Learning) have been developed to analyze data sets and predict the performance of students. The projected submission of the three classificatory was calculated and matched. The academic history and data of the students from the Office of the Registrar were used to train the models. Our analysis aims to evaluate the results of students using various variables such as the student's grade. Data from (221) students with (9) different attributes were used. The results of this study are very important, provide a better understanding of student success assessments and stress the importance of data mining in education. The main purpose of this study is to show the student successful forecast using data mining techniques to improve academic programs. The results of this research indicate that the Decision Tree classifier overtakes two other classifiers by achieving a total prediction accuracy of 97%.

Download Full-text

DEVELOPING A PARALLEL CLASSIFIER FOR MINING IN BIG DATA SETS

IIUM Engineering Journal ◽

10.31436/iiumej.v22i2.1541 ◽

2021 ◽

Vol 22 (2) ◽

pp. 119-134

Author(s):

Ahad Shamseen ◽

Morteza Mohammadi Zanjireh ◽

Mahdi Bahaghighat ◽

Qin Xin

Keyword(s):

Data Mining ◽

Big Data ◽

Decision Tree ◽

Main Memory ◽

Experimental Results ◽

Primary Data ◽

Data Sets ◽

Decision Tree Classifier ◽

Vast Amount ◽

Tree Classifier

Data mining is the extraction of information and its roles from a vast amount of data. This topic is one of the most important topics these days. Nowadays, massive amounts of data are generated and stored each day. This data has useful information in different fields that attract programmers’ and engineers’ attention. One of the primary data mining classifying algorithms is the decision tree. Decision tree techniques have several advantages but also present drawbacks. One of its main drawbacks is its need to reside its data in the main memory. SPRINT is one of the decision tree builder classifiers that has proposed a fix for this problem. In this paper, our research developed a new parallel decision tree classifier by working on SPRINT results. Our experimental results show considerable improvements in terms of the runtime and memory requirements compared to the SPRINT classifier. Our proposed classifier algorithm could be implemented in serial and parallel environments and can deal with big data. ABSTRAK: Perlombongan data adalah pengekstrakan maklumat dan peranannya dari sejumlah besar data. Topik ini adalah salah satu topik yang paling penting pada masa ini. Pada masa ini, data yang banyak dihasilkan dan disimpan setiap hari. Data ini mempunyai maklumat berguna dalam pelbagai bidang yang menarik perhatian pengaturcara dan jurutera. Salah satu algoritma pengkelasan perlombongan data utama adalah pokok keputusan. Teknik pokok keputusan mempunyai beberapa kelebihan tetapi kekurangan. Salah satu kelemahan utamanya adalah keperluan menyimpan datanya dalam memori utama. SPRINT adalah salah satu pengelasan pembangun pokok keputusan yang telah mengemukakan untuk masalah ini. Dalam makalah ini, penyelidikan kami sedang mengembangkan pengkelasan pokok keputusan selari baru dengan mengusahakan hasil SPRINT. Hasil percubaan kami menunjukkan peningkatan yang besar dari segi jangka masa dan keperluan memori berbanding dengan pengelasan SPRINT. Algoritma pengklasifikasi yang dicadangkan kami dapat dilaksanakan dalam persekitaran bersiri dan selari dan dapat menangani data besar.

Download Full-text

A Big Data-Based Data Mining Tool for Physical Education and Technical and Tactical Analysis

International Journal of Emerging Technologies in Learning (iJET) ◽

10.3991/ijet.v14i22.11345 ◽

2019 ◽

Vol 14 (22) ◽

pp. 220 ◽

Cited By ~ 1

Author(s):

Lili Pan

Keyword(s):

Data Mining ◽

Information Technology ◽

Big Data ◽

Data Analysis ◽

Physical Education ◽

Big Data Analysis ◽

Competitive Sports ◽

Data Mining Tool ◽

Research Findings ◽

Mining Tool

This paper attempts to develop a data mining tool to guide sports training, promote physical education and facilitate technical and tactical analysis. For this purpose, information techniques like mathematical statistics and big data analysis were employed to collect and analyse the information on competitive sports. Based on database and computer algorithm, the author designed a data mining tool applicable to the information of various competitive sports. The proposed tool can mine out valuable information from the big data, enabling trainers to realize targeted and efficient physical education. The mined information also helps improve the analysis of techniques and tactics of competitive sports. The research findings promote the application of information technology in physical education and competitive sports.

Download Full-text

Z - CRIME: A data mining tool for the detection of suspicious criminal activities based on decision tree

2014 International Conference on Data Mining and Intelligent Computing (ICDMIC) ◽

10.1109/icdmic.2014.6954268 ◽

2014 ◽

Cited By ~ 7

Author(s):

Mugdha Sharma

Keyword(s):

Data Mining ◽

Decision Tree ◽

Data Mining Tool ◽

Mining Tool

Download Full-text

Using T3, an Improved Decision Tree Classifier, for Mining Stroke-related Medical Data

Methods of Information in Medicine ◽

10.1160/me0317 ◽

2007 ◽

Vol 46 (05) ◽

pp. 523-529 ◽

Cited By ~ 8

Author(s):

M. Saraee ◽

B. Theodoulidis ◽

J. A. Keane ◽

C. Tjortjis

Keyword(s):

Data Mining ◽

Decision Tree ◽

Predictive Models ◽

Medical Data ◽

Classification Algorithm ◽

Medical Decision ◽

Classification Error ◽

Decision Tree Classifier ◽

Data Set ◽

Tree Classifier

Summary Objectives: Medical data are a valuable resource from which novel and potentially useful knowledge can be discovered by using data mining. Data mining can assist and support medical decision making and enhance clinical managementand investigative research. The objective of this work is to propose a method for building accurate descriptive and predictive models based on classification of past medical data. We also aim to compare this method with other well established data mining methods and identify strengths and weaknesses. Method: We propose T3, a decision tree classifier which builds predictive models based on known classes, by allowing for a certain amount of misclassification error in training in order to achieve better descriptive and predictive accuracy. We then experiment with a real medical data set on stroke, and various subsets, in order to identify strengths and weaknesses. We also compare performance with a very successful and well established decision tree classifier. Results: T3 demonstrated impressive performance when predicting unseen cases of stroke resulting in as little as 0.4% classification error while the state of the art decision tree classifier resulted in 33.6% classification error respectively. Conclusions: This paper presents and evaluates T3, a classification algorithm that builds decision trees of depth at most three, and results in high accuracy whilst keeping the tree size reasonably small. T3 demonstrates strong descriptive and predictive power without compromising simplicity and clarity. We evaluate T3 based on real stroke register data and compare it with C4.5, a well-known classification algorithm, showing that T3 produces significantly more accurate and readable classifiers.

Download Full-text

Komparasi Algoritma Nonparametrik untuk Klasifikasi Citra Wajah Berdasarkan Suku di Indonesia

Jurnal Edukasi dan Penelitian Informatika (JEPIN) ◽

10.26418/jp.v6i3.43268 ◽

2020 ◽

Vol 6 (3) ◽

pp. 337

Author(s):

Seno Hartono ◽

Anggi Perwitasari ◽

Herry Sujaini

Keyword(s):

Data Mining ◽

Support Vector Machine ◽

Decision Tree ◽

Nearest Neighbor ◽

Support Vector ◽

K Nearest Neighbor ◽

Data Mining Tool ◽

Mining Tool

Klasifikasi merupakan metode data mining yang berfungsi untuk mengatur dan mengkategorikan data pada kelas yang berbeda-beda. Penelitian ini bertujuan untuk membandingkan dan menentukan algoritma nonparametrik terbaik dalam pengklasifikasian citra wajah. Dalam proses pengklasifikasian, penelitian ini menggunakan algoritma klasifikasi nonparametrik yaitu k-Nearest Neighbor (kNN), Support Vector Machine (SVM), Decision Tree, dan AdaBoost Untuk mengklasifikasikan citra wajah penduduk Indonesia yang berasal dari suku Batak, Dayak, Jawa, Melayu, dan Tionghoa. Penelitian ini menggunakan Orange Data Mining Tool sebagai alat bantu untuk melakukan proses data mining. Dari hasil pengklasifikasian dengan menerapkan algoritma k-Nearest Neigbor, Support Vector Machine, Decision Tree, dan AdaBoost, SVM memberikan nilai akurasi yang lebih baik dibanding algoritma lainnya. Rata-rata nilai precision keempat algoritma tersebut berturut-turut adalah Support Vector Machine 37.5%, diikuti oleh algoritma k-Nearest Neighbor 31.55%, AdaBoost 30.25%, dan untuk Decision Tree 29.75%.

Download Full-text

Application of Decision Tree as a Data Mining Tool in a Manufacturing System

Selected Readings on Database Technologies and Applications ◽

10.4018/978-1-60566-098-1.ch011 ◽

2011 ◽

pp. 234-251

Author(s):

S. A. Oke

Keyword(s):

Data Mining ◽

Decision Making ◽

Decision Tree ◽

Manufacturing Systems ◽

Manufacturing System ◽

Research Activity ◽

Data Mining Tool ◽

Mining Tool ◽

Classification Prediction ◽

Effective Decision Making

This work demonstrates the application of decision tree, a data mining tool, in the manufacturing system. Data mining has the capability for classification, prediction, estimation, and pattern recognition by using manufacturing databases. Databases of manufacturing systems contain significant information for decision making, which could be properly revealed with the application of appropriate data mining techniques. Decision trees are employed for identifying valuable information in manufacturing databases. Practically, industrial managers would be able to make better use of manufacturing data at little or no extra investment in data manipulation cost. The work shows that it is valuable for managers to mine data for better and more effective decision making. This work is therefore new in that it is the first time that proper documentation would be made in the direction of the current research activity.

Download Full-text

Application of Decision Tree as a Data Mining Tool in a Manufacturing System

Database Technologies ◽

10.4018/978-1-60566-058-5.ch054 ◽

2009 ◽

pp. 940-955

Author(s):

S. A. Oke

Keyword(s):

Data Mining ◽

Decision Making ◽

Decision Tree ◽

Manufacturing Systems ◽

Manufacturing System ◽

Research Activity ◽

Data Mining Tool ◽

Mining Tool ◽

Classification Prediction ◽

Effective Decision Making

This work demonstrates the application of decision tree, a data mining tool, in the manufacturing system. Data mining has the capability for classification, prediction, estimation, and pattern recognition by using manufacturing databases. Databases of manufacturing systems contain significant information for decision making, which could be properly revealed with the application of appropriate data mining techniques. Decision trees are employed for identifying valuable information in manufacturing databases. Practically, industrial managers would be able to make better use of manufacturing data at little or no extra investment in data manipulation cost. The work shows that it is valuable for managers to mine data for better and more effective decision making. This work is therefore new in that it is the first time that proper documentation would be made in the direction of the current research activity.

Download Full-text

Application of Decision Tree as a Data mining Tool in a Manufacturing System

Intelligent Databases ◽

10.4018/978-1-59904-120-9.ch006 ◽

2011 ◽

pp. 117-136

Author(s):

S.A. Oke

Keyword(s):

Data Mining ◽

Decision Making ◽

Decision Tree ◽

Manufacturing Systems ◽

Manufacturing System ◽

Research Activity ◽

Data Mining Tool ◽

Mining Tool ◽

Classification Prediction ◽

Effective Decision Making

This work demonstrates the application of decision tree, a data mining tool, in the manufacturing system. Data mining has the capability for classification, prediction, estimation, and pattern recognition by using manufacturing databases. Databases of manufacturing systems contain significant information for decision making, which could be properly revealed with the application of appropriate data mining techniques. Decision trees are employed for identifying valuable information in manufacturing databases. Practically, industrial managers would be able to make better use of manufacturing data at little or no extra investment in data manipulation cost. The work shows that it is valuable for managers to mine data for better and more effective decision making. This work is therefore new in that it is the first time that proper documentation would be made in the direction of the current research activity.

Download Full-text