Penerapan Algoritma C5.0 Untuk Prediksi Kelulusan Pembelajaran Mahasiswa Pada Matakuliah Arsitektur Sistem Komputer

Computer system architecture is one of the subjects that must be taken in the informatics engineering study program. In the study program the graduation of each student in the course is one of the important aspects that must be evaluated every semester. Graduation for each student / I in the course is an illustration that the learning process delivered is going well and also the material presented by the lecturer in charge of the course can be digested by students. Graduation of each student in the course can be predicted based on the habit pattern of the students. Data mining is an alternative process that can be done to find out habit patterns based on the data that has been collected. Data mining itself is an extraction process on a collection of data that produces valuable information for companies, agencies or organizations that can be used in the decision-making process. Prediction of graduation with data mining can be solved by classifying the data set. The C5.0 algorithm is an improvement algorithm from the C4.5 algorithm where the process is almost the same, only the C5.0 algorithm has advantages over the previous algorithm. The results of the C5.0 algorithm are in the form of a decision tree or a rule that is formed based on the entropy or gain value. The prediction process is carried out based on the classification of the C5.0 algorithm by using the attributes of Attendance Value, Assignment Value, UTS Value and UAS Value. The final result of the C5.0 algorithm classification process is a decision tree with rules in it. The performance of the C5.0 algorithm gets a high accuracy rate of 93.33%

Download Full-text

PENERAPAN DATA MINING MENGGUNAKAN ALGORITMA C4.5 TEHADAP PENGARUH PENJUALAN KOPI PADA PT. JPW INDONESIA

Jurnal Sistem Informasi dan Informatika (Simika) ◽

10.47080/simika.v3i1.836 ◽

2020 ◽

Vol 3 (1) ◽

pp. 40-54

Author(s):

Ikong Ifongki

Keyword(s):

Data Mining ◽

Decision Tree ◽

Decision Rules ◽

Large Data ◽

Added Value ◽

Data Set ◽

Use Of Data ◽

Decision Tree Classification ◽

C4.5 Algorithm

Data mining is a series of processes to explore the added value of a data set in the form of knowledge that has not been known manually. The use of data mining techniques is expected to provide knowledge - knowledge that was previously hidden in the data warehouse, so that it becomes valuable information. C4.5 algorithm is a decision tree classification algorithm that is widely used because it has the main advantages of other algorithms. The advantages of the C4.5 algorithm can produce decision trees that are easily interpreted, have an acceptable level of accuracy, are efficient in handling discrete type attributes and can handle discrete and numeric type attributes. The output of the C4.5 algorithm is a decision tree like other classification techniques, a decision tree is a structure that can be used to divide a large data set into smaller sets of records by applying a series of decision rules, with each series of division members of the resulting set become similar to each other. In this case study what is discussed is the effect of coffee sales by processing 106 data from 1087 coffee sales data at PT. JPW Indonesia. Data samples taken will be calculated manually using Microsoft Excel and Rapidminer software. The results of the calculation of the C4.5 algorithm method show that the Quantity and Price attributes greatly affect coffee sales so that sales at PT. JPW Indonesia is still often unstable.

Download Full-text

Data Mining Implementation Using Naïve Bayes Algorithm and Decision Tree J48 In Determining Concentration Selection

International Journal of Quantitative Research and Modeling ◽

10.46336/ijqrm.v1i3.72 ◽

2020 ◽

Vol 1 (3) ◽

pp. 123-134

Author(s):

Budiman Budiman ◽

Reni Nursyanti ◽

R Yadi Rakhman Alamsyah ◽

Imannudin Akbar

Keyword(s):

Data Mining ◽

Decision Tree ◽

Naive Bayes ◽

Naïve Bayes ◽

Training Data ◽

Study Program ◽

Data Set ◽

Lower Accuracy ◽

Accuracy Result ◽

Bayes Algorithm

Computerization of society has substantially improved the ability to generate and collect data from a variety of sources. A large amount of data has flooded almost every aspect of people's lives. AMIK HASS Bandung has an Informatic Management Study Program consisting of three areas of concentration that can be selected by students in the fourth semester including Computerized Accounting, Computer Administration, and Multimedia. The determination of concentration selection should be precise based on past data, so the academic section must have a pattern or rule to predict concentration selection. In this work, the data mining techniques were using Naive Bayes and Decision Tree J48 using WEKA tools. The data set used in this study was 111 with a split test percentage mode of 75% used as training data as the model formation and 25% as test data to be tested against both models that had been established. The highest accuracy result obtained on Naive Bayes which is obtaining a 71.4% score consisting of 20 instances that were properly clarified from 28 training data. While Decision Tree J48 has a lower accuracy of 64.3% consisting of 18 instances that are properly clarified from 28 training data. In Decision Tree J48 there are 4 patterns or rules formed to determine concentration selection so that the academic section can assist students in determining concentration selection.

Download Full-text

Penerapan Metode Klasifikasi Decision Tree dan Algoritma C4.5 dalam Memprediksi Kriteria Nasabah Kredit Mega Auto Finance

JURIKOM (Jurnal Riset Komputer) ◽

10.30865/jurikom.v7i2.1762 ◽

2020 ◽

Vol 7 (2) ◽

pp. 200

Author(s):

Puji Santoso ◽

Rudy Setiawan

Keyword(s):

Data Mining ◽

Decision Tree ◽

Microsoft Excel ◽

Customer Data ◽

Data Mining Techniques ◽

C4.5 Algorithm ◽

Marketing Costs ◽

Excel Format ◽

Data Mining Application

One of the tasks in the field of marketing finance is to analyze customer data to find out which customers have the potential to do credit again. The method used to analyze customer data is by classifying all customers who have completed their credit installments into marketing targets, so this method causes high operational marketing costs. Therefore this research was conducted to help solve the above problems by designing a data mining application that serves to predict the criteria of credit customers with the potential to lend (credit) to Mega Auto Finance. The Mega Auto finance Fund Section located in Kotim Regency is a place chosen by researchers as a case study, assuming the Mega Auto finance Fund Section has experienced the same problems as described above. Data mining techniques that are applied to the application built is a classification while the classification method used is the Decision Tree (decision tree). While the algorithm used as a decision tree forming algorithm is the C4.5 Algorithm. The data processed in this study is the installment data of Mega Auto finance loan customers in July 2018 in Microsoft Excel format. The results of this study are an application that can facilitate the Mega Auto finance Funds Section in obtaining credit marketing targets in the future

Download Full-text

Improved differentiation classification of variable precision artificial intelligence higher education management

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-219036 ◽

2021 ◽

pp. 1-10

Author(s):

Chao Dong ◽

Yan Guo

Keyword(s):

Artificial Intelligence ◽

Higher Education ◽

Data Mining ◽

Decision Tree ◽

Classification Accuracy ◽

Attribute Selection ◽

Higher Education Management ◽

Education Management ◽

Decision Tree Classification

The wide application of artificial intelligence technology in various fields has accelerated the pace of people exploring the hidden information behind large amounts of data. People hope to use data mining methods to conduct effective research on higher education management, and decision tree classification algorithm as a data analysis method in data mining technology, high-precision classification accuracy, intuitive decision results, and high generalization ability make it become a more ideal method of higher education management. Aiming at the sensitivity of data processing and decision tree classification to noisy data, this paper proposes corresponding improvements, and proposes a variable precision rough set attribute selection standard based on scale function, which considers both the weighted approximation accuracy and attribute value of the attribute. The number improves the anti-interference ability of noise data, reduces the bias in attribute selection, and improves the classification accuracy. At the same time, the suppression factor threshold, support and confidence are introduced in the tree pre-pruning process, which simplifies the tree structure. The comparative experiments on standard data sets show that the improved algorithm proposed in this paper is better than other decision tree algorithms and can effectively realize the differentiated classification of higher education management.

Download Full-text

DECISION TREE C4.5 ALGORITHM FOR TUITION AID GRANT PROGRAM CLASSIFICATION (CASE STUDY: DEPARTMENT OF INFORMATION SYSTEM, UNIVERSITAS TEKNOKRAT INDONESIA)

Edutic - Scientific Journal of Informatics Education ◽

10.21107/edutic.v7i1.8849 ◽

2020 ◽

Vol 7 (1) ◽

Author(s):

Heni Sulistiani ◽

Ahmad Ari Aldino

Keyword(s):

Decision Tree ◽

Classification Model ◽

Grant Program ◽

C4.5 Algorithm ◽

Long Time ◽

The Right ◽

Short Time ◽

Tuition Fee

In pandemic era, almost everyone struggles for their life. College students are such example. They have difficulty in paying tuition fee to continue their study. Based on this problematic situation, Universitas Teknokrat Indonesia grants the students who have good academic performance with tuition fee aid program. Many variables used for determining the grant made it hard to make a decision in a short time or even takes very long time. To make it easier for management to decide who is the right student to get grant, it needs classification model. The purpose of this study is the classification of grant recipients by using decision tree C4.5 algorithm. That can determine whether a potential student can be accepted as an awardee or not. Then, the results of the classification are validated with ten-fold cross validation with an accuracy, precision and recall with the score of 87 % for all part. It means the model perform quite well to be implemented into system.

Download Full-text

Penerapan Metode Klasifikasi Decision Tree Pada Status Gizi Balita Di Kabupaten Simalungun

KESATRIA: Jurnal Penerapan Sistem Informasi (Komputer & Manajemen) ◽

10.30645/kesatria.v1i2.23 ◽

2020 ◽

Vol 1 (2) ◽

pp. 68-72

Author(s):

Hananda Hafizan ◽

Anggita Nadia Putri

Keyword(s):

Decision Tree ◽

Nutritional Status ◽

Public Health Service ◽

Children Under Five ◽

Under Five ◽

C4.5 Algorithm ◽

Decision Tree Method ◽

Nutritional Status Of Children ◽

Tree Method

One of the health problems in Indonesia is the problem of nutritional status of children under five years. Cases of malnutrition are not only a family problem, but also a state problem. The nutritional status of children under five years can be assessed by measuring the human body known as "Anthropometry". To be able to carry out anthropometric examinations and measurements in order to find out the nutritional status of children under five, they can go to public health service places such as the Posyandu. We went to the KENANGA Posyandu located in Wonorejo, Kerasaan sub-district, Simalungun district. The purpose of this study will be to test the model for the classification of nutritional status of children under the WHO-2005 reference standard by utilizing data mining techniques using the Decision Tree method C4.5 Algorithm.

Download Full-text

Discriminating Developing versus Nondeveloping Tropical Disturbances in the Western North Pacific through Decision Tree Analysis

Weather and Forecasting ◽

10.1175/waf-d-14-00023.1 ◽

2015 ◽

Vol 30 (2) ◽

pp. 446-454 ◽

Cited By ~ 12

Author(s):

Wei Zhang ◽

Bing Fu ◽

Melinda S. Peng ◽

Tim Li

Keyword(s):

Decision Tree ◽

Air Temperature ◽

North Pacific ◽

Western North Pacific ◽

Temperature Anomaly ◽

Relative Vorticity ◽

Precipitation Rate ◽

C4.5 Algorithm ◽

Air Temperature Anomaly

Abstract This study investigates the classification of developing and nondeveloping tropical disturbances in the western North Pacific (WNP) through the C4.5 algorithm. A decision tree is built based on this algorithm and can be used as a tool to predict future tropical cyclone (TC) genesis events. The results show that the maximum 800-hPa relative vorticity, SST, precipitation rate, divergence averaged between 1000- and 500-hPa levels, and 300-hPa air temperature anomaly are the five most important variables for separating the developing and nondeveloping tropical disturbances. This algorithm also unravels the thresholds of the five variables (i.e., 4.2 × 10−5 s−1 for maximum 800-hPa relative vorticity, 28.2°C for SST, 0.1 mm h−1 for precipitation rate, −0.7 × 10−6 s−1 for vertically averaged convergence, and 0.5°C for 300-hPa air temperature anomaly). Six rules are derived from the decision tree. The classification accuracy of this decision tree is 81.7% for the 2004–10 cases. The hindcast accuracy for the 2011–13 dataset is 84.6%.

Download Full-text

Data-Driven Decision Tree Classification for Product Portfolio Design Optimization

Journal of Computing and Information Science in Engineering ◽

10.1115/1.3243634 ◽

2009 ◽

Vol 9 (4) ◽

Cited By ~ 25

Author(s):

Conrad S. Tucker ◽

Harrison M. Kim

Keyword(s):

Data Mining ◽

Decision Tree ◽

Engineering Design ◽

Optimization Techniques ◽

Product Portfolio ◽

Performance Expectations ◽

Data Set ◽

Tree Data ◽

Portfolio Design ◽

Product Concepts

The formulation of a product portfolio requires extensive knowledge about the product market space and also the technical limitations of a company’s engineering design and manufacturing processes. A design methodology is presented that significantly enhances the product portfolio design process by eliminating the need for an exhaustive search of all possible product concepts. This is achieved through a decision tree data mining technique that generates a set of product concepts that are subsequently validated in the engineering design using multilevel optimization techniques. The final optimal product portfolio evaluates products based on the following three criteria: (1) it must satisfy customer price and performance expectations (based on the predictive model) defined here as the feasibility criterion; (2) the feasible set of products/variants validated at the engineering level must generate positive profit that we define as the optimality criterion; (3) the optimal set of products/variants should be a manageable size as defined by the enterprise decision makers and should therefore not exceed the product portfolio limit. The strength of our work is to reveal the tremendous savings in time and resources that exist when decision tree data mining techniques are incorporated into the product portfolio design and selection process. Using data mining tree generation techniques, a customer data set of 40,000 responses with 576 unique attribute combinations (entire set of possible product concepts) is narrowed down to 46 product concepts and then validated through the multilevel engineering design response of feasible products. A cell phone example is presented and an optimal product portfolio solution is achieved that maximizes company profit, without violating customer product performance expectations.

Download Full-text

CUDT: A CUDA Based Decision Tree Algorithm

The Scientific World JOURNAL ◽

10.1155/2014/745640 ◽

2014 ◽

Vol 2014 ◽

pp. 1-12 ◽

Cited By ~ 18

Author(s):

Win-Tsung Lo ◽

Yue-Shan Chang ◽

Ruey-Kai Sheu ◽

Chun-Chieh Chiu ◽

Shyan-Ming Yuan

Keyword(s):

Data Mining ◽

Decision Tree ◽

New Technology ◽

Large Data ◽

Decision Tree Algorithm ◽

Data Set ◽

Tree Algorithm ◽

Ubiquitous Sensing ◽

Device Architecture ◽

Huge Data

Decision tree is one of the famous classification methods in data mining. Many researches have been proposed, which were focusing on improving the performance of decision tree. However, those algorithms are developed and run on traditional distributed systems. Obviously the latency could not be improved while processing huge data generated by ubiquitous sensing node in the era without new technology help. In order to improve data processing latency in huge data mining, in this paper, we design and implement a new parallelized decision tree algorithm on a CUDA (compute unified device architecture), which is a GPGPU solution provided by NVIDIA. In the proposed system, CPU is responsible for flow control while the GPU is responsible for computation. We have conducted many experiments to evaluate system performance of CUDT and made a comparison with traditional CPU version. The results show that CUDT is 5∼55 times faster than Weka-j48 and is 18 times speedup than SPRINT for large data set.

Download Full-text

Classification of Tumors in Brain MRI Images With Hybrid of Global and Local DWT Features using Decision Tree

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.c4659.1081219 ◽

2019 ◽

Vol 8 (12) ◽

pp. 3072-3077

Keyword(s):

Brain Tumor ◽

Brain Tumors ◽

Decision Tree ◽

Brain Mri ◽

Extraction Process ◽

Discrete Wavelet ◽

Global Features ◽

Tumor Identification ◽

The Brain

Automated brain tumor identification and classification is still an open problem for research in the medical image processing domain. Brain tumor is a bunch of unwanted cells that develop in the brain. This growth of a tumor takes up space within skull and affects the normal functioning of brain. Automated segmentation and detection of brain tumors are important in MRI scan analysis as it provides information about neural architecture of brain and also about abnormal tissues that are extremely necessary to identify appropriate surgical plan. Automating this process is a challenging task as tumor tissues show high diversity in appearance with different patients and also in many cases they tend to appear very similar to the normal tissues. Effective extraction of features that represent the tumor in brain image is the key for better classification. In this paper, we propose a hybrid feature extraction process. In this process, we combine the local and global features of the brain MRI using first by Discrete Wavelet Transformation and then using texture based statistical features by computing Gray Level Co-occurrence Matrix. The extracted combined features are used to construct decision tree for classification of brain tumors in to benign or malignant class.

Download Full-text