The Analysis and Application of the C4.5 Algorithm in Decision Tree Technology

2012 ◽  
Vol 457-458 ◽  
pp. 754-757
Author(s):  
Hong Yan Zhao

The Decision Tree technology, which is the main technology of the Data Mining classification and forecast, is the classifying rule that infers the Decision Tree manifestation through group of out-of-orders, the non-rule examples. Based on the research background of The Decision Tree’s concept, the C4.5 Algorithm and the construction of The Decision Tree, the using of C4.5 Decision Tree Algorithm was applied to result analysis of students’ score for the purpose of improving the teaching quality.

2013 ◽  
Vol 397-400 ◽  
pp. 2296-2300 ◽  
Author(s):  
Fei Shuai ◽  
Jun Quan Li

In current, there are complex relationship between the assets of information security product. According to this characteristic, we propose a new asset recognition algorithm (ART) on the improvement of the C4.5 decision tree algorithm, and analyze the computational complexity and space complexity of the proposed algorithm. Finally, we demonstrate that our algorithm is more precise than C4.5 algorithm in asset recognition by an application example whose result verifies the availability of our algorithm.Keywordsdecision tree, information security product, asset recognition, C4.5


Author(s):  
Sujuan Jia ◽  
Yajing Pang

Vast data in the higher education system are used to analyse and evaluate the teaching quality, so that the key factors that affect the quality of teaching can be predicted. Besides, the learner’s personalized behaviour can also become the data source for teaching result prediction. This paper proposes a decision tree model by taking the teaching quality data and the statistical analysis results of the learn-er’s personalized behaviour as inputs. This model was based on the improved C4.5 decision tree algorithm, which used the FAYYAD boundary point decision theorem for effectively reducing the computation time to the most threshold. In this algorithm, the iterative analysis mechanism was introduced in combination with the data change of the learner’s personalized behaviour, so as to dynamically adjust the final teaching evaluation result. Finally, according to the actual statisti-cal data of one academic year, the teaching quality evaluation was effectively completed and the direction of future teaching prediction was proposed.


2014 ◽  
Vol 926-930 ◽  
pp. 703-707
Author(s):  
Hu Yong

Aimed at the student the result problem, give student the result data scoops out the model. The decision tree method is a very valid classification method, in the data that scoop out. According to student the result data characteristics, adopted the C4.5 decision tree algorithm. C4.5 algorithm is the improvement algorithm of the decision trees core algorithm ID3, it construct in brief, the speed compare quickly, easy realization. Selection decision belongs to sex, scoop out the result enunciation, that algorithm can be right to get student the result data classification, and some worthy conclusion, provide the decision the analysis.


2020 ◽  
Vol 27 (3) ◽  
pp. 29-43
Author(s):  
Sihem Oujdi ◽  
Hafida Belbachir ◽  
Faouzi Boufares

Using data mining techniques on spatial data is more complex than on classical data. To be able to extract useful patterns, the spatial data mining algorithms must deal with the representation of data as stack of thematic layers and consider, in addition to the object of interest itself, its neighbors linked through implicit spatial relations. The application of the classification by decision trees combined with the visualization tools represents a convenient decision support tool for spatial data analysis. The purpose of this paper is to provide and evaluate an alternative spatial classification algorithm that supports the thematic-layered data organization, by the adaptation of the C4.5 decision tree algorithm to spatial data, named S-C4.5, inspired by the SCART and spatial ID3 algorithms and the adoption of the Spatial Join Index. Our work concerns both data organization and the algorithm adaptation. Decision tree construction was experimented on traffic accident dataset and benchmarked on both computation time and memory consumption according to different experimentations: study of phenomenon by a single and then by multiple other phenomena, including one or more spatial relations. Different approaches used show compromised and balanced results between memory usage and computation time.


2019 ◽  
Vol 7 (2) ◽  
Author(s):  
Dyah Wulandari ◽  
Nur Lutfiyana ◽  
Heny Sumarno

Abstract - Credit is the provision of money or equivalent claims, based on agreements or agreements on loans between banks and other parties which require the borrowing party to repay the debt after a certain period of time with the amount of interest, compensation or profit sharing. From the credit customer data available at BSM KCP Kemang Pratama still has Non Performing Financing (NPF) or Bad Credit.In analyzing a credit sometimes an analyst does an inaccurate analysis, so there are some customers who are less able to make credit payments, resulting in bad credit. So the researchers conducted an analysis using the C4.5 decision tree algorithm and Rapid Miner application for determining credit worthiness. From the analysis of credit customer data using the C4.5 decision tree algorithm method, the feasibility of credit recipient customers is very effective and produces a value of accuracy on Rapid Miner 5.3 of 80%, Precision of 100% and Recall of 0% so as to minimize the risk.Keywords— Credit, C4.5 Algorithm, Rapid Miner, Value AccuracyAbstrak - Kredit merupakan penyediaan uang atau tagihan yang dapat disamakan dengan hal itu, berdasarkan persetujuan atau kesepakatan pinjaman-pinjaman antara bank dengan pihak lain yang mewajibkan pihak peminjam untuk melunasi utangnya setelah jangka waktu tertentu dengan jumlah bunga, imbalan atau pembagian hasil keuntungan. Dari data nasabah kredit yang ada pada BSM KCP Kemang Pratama masih memiliki Non Performing Financing (NPF) atau Kredit Macet. Dalam menganalisa sebuah kredit terkadang seorang analis melakukan analisa tidak akurat, sehingga ada beberapa nasabah yang kurang mampu dalam melakukan pembayaran kredit, dan pada akhirnya mengakibatkan kredit macet. Peneliti melakukan analisis menggunakan algoritma decision tree C4.5 dan aplikasi Rapid Miner untuk penentuan kelayakan pemberian kredit. Dari analisis data nasabah kredit menggunakan metode Algoritma decision tree C4.5 menghasilkan kelayakan nasabah penerima kredit sangat efektif dan menghasilkan nilai akurasi pada Rapid Miner 5.3 sebesar 80%, Precision sebesar 100% dan Recall sebesar 0% sehingga dapat meminimalisir resiko yang terjadi.Kata kunci— Kredit, Algoritma C4.5, Rapid Miner, Nilai Akurasi


2014 ◽  
Vol 543-547 ◽  
pp. 1639-1642 ◽  
Author(s):  
Liang Li ◽  
Ying Zheng ◽  
Xiao Hua Sun ◽  
Fu Shun Wang

According to students' employment problem, employment data mining model of university graduates is presented. The decision tree is very effective means for classification, which is proposed according to the characteristics of employment data and C4.5 algorithm. The C4.5 algorithm is improved from ID3 algorithm that is the core algorithm in the decision tree. The C4.5 algorithm is suitable for its simple construction, high processing speed and easy implementation. The model includes preprocess of the data of employment selection of decision attributes, implementation of mining algorithm, and obtainment of rules from the decision tree. The rules point out which decision attributes decide the classification of employers. Case study shows that the decision tree algorithm applied to employment information data mining, can classify data of employment correctly with simple structure and faster speed, and find some valuable results for analysis and decision. so the proposed algorithm in this paper is effective.


2018 ◽  
Vol 7 (2) ◽  
pp. 200-210
Author(s):  
Ronaldo Syahputra ◽  
Wifra Safitri

The Karate sport is a kind of sport that is quite popular today. All regions in Indonesia are racing to improve the performance of their karate athletes. Various developments were carried out to be able to improve karate sports achievements. This research will later be used as a benchmark for developing and realizing good sports performance, especially in the karate by using the concept of data mining. To apply the concept of Data Mining, one way that can be done is to implement the C4.5 algorithm. C4.5 algorithm or also called decision tree algorithm, is a very strong and well-known classification and prediction method. The application of the concept of data mining with C4.5 algorithm is done by analyzing what factors support the achievement of karate sports. After that, the C4.5 algorithm is calculated to find out what is the most decisive factor in the development of karate sports achievements. This aims to maximize the role of these achievement supporting factors. The results of this study are expected to provide great benefits for the development and improvement of the achievements of karate athletes in West Sumatra.


2015 ◽  
Vol 4 (3) ◽  
pp. 173-182
Author(s):  
Salih Özsoy ◽  
Gökhan Gümüş ◽  
Savriddin KHALILOV

In this study, Data Mining, one of the latest technologies of the Information Systems, was introduced and Classification a Data Mining method and the Classification algorithms were discussed. A classification was applied by using C4.5 decision tree algorithm on a dataset about Labor Relations from http://archive.ics.uci.edu/ml/datasets.html. Finally, C4.5 algorithm was compared to some other decision tree algorithms. C4.5 was the one of the successful classifier.


2014 ◽  
Vol 538 ◽  
pp. 460-464
Author(s):  
Xue Li

Based on inter-correlation and permeability among disciplines, the author makes an attempt to apply the information science to cognitive linguistics to provide a new perspective for the study of foreign languages. The correlation between self-efficacy and such four factors as anxiety, learning strategies, motivation and learners’ past achievement is analyzed by means of data mining and the extent to which the above factors affect self-efficacy in language learning is explored in this paper. The paper employs the decision tree algorithm in SPSS Clementine. C5.0 decision tree algorithm is adopted to analyze data in the study. The results are elicited from the researches carried out in this paper. The increased anxiety is bound to weaken learners’ motivation over time. It is obvious that learners have low self-efficacy. It is very important to employ strategies in foreign language learning. Ignorance of using learning strategies may result in unplanned learning with unsatisfactory achievements in spite of more efforts involved. Self-efficacy in foreign language learning may be weakened accordingly. Learners’ past achievement is a reference dimension in measuring self-efficacy with weaker influence.


Sign in / Sign up

Export Citation Format

Share Document