HYPER HEURISTIC EVOLUTIONARY APPROACH FOR CONSTRUCTING DECISION TREE CLASSIFIERS

Sunil Kumar; Saroj Ratnoo; Jyoti Vashishtha

doi:10.32890/jict2021.20.2.5

HYPER HEURISTIC EVOLUTIONARY APPROACH FOR CONSTRUCTING DECISION TREE CLASSIFIERS

Journal of Information and Communication Technology ◽

10.32890/jict2021.20.2.5 ◽

2021 ◽

Vol 20 (Number 2) ◽

pp. 249-276

Author(s):

Sunil Kumar ◽

Saroj Ratnoo ◽

Jyoti Vashishtha

Keyword(s):

Decision Tree ◽

Heuristic Approach ◽

Decision Tree Model ◽

Evolutionary Approach ◽

Optimal Decision ◽

Decision Tree Classifier ◽

Tree Model ◽

Tree Construction ◽

Tree Classifier ◽

Optimal Values

Decision tree models have earned a special status in predictive modeling since these are considered comprehensible for human analysis and insight. Classification and Regression Tree (CART) algorithm is one of the renowned decision tree induction algorithms to address the classification as well as regression problems. Finding optimal values for the hyper parameters of a decision tree construction algorithm is a challenging issue. While making an effective decision tree classifier with high accuracy and comprehensibility, we need to address the question of setting optimal values for its hyper parameters like the maximum size of the tree, the minimum number of instances required in a node for inducing a split, node splitting criterion and the amount of pruning. The hyper parameter setting influences the performance of the decision tree model. As researchers, we know that no single setting of hyper parameters works equally well for different datasets. A particular setting that gives an optimal decision tree for one dataset may produce a sub-optimal decision tree model for another dataset. In this paper, we present a hyper heuristic approach for tuning the hyper parameters of Recursive and Partition Trees (rpart), which is a typical implementation of CART in statistical and data analytics package R. We employ an evolutionary algorithm as hyper heuristic for tuning the hyper parameters of the decision tree classifier. The approach is named as Hyper heuristic Evolutionary Approach with Recursive and Partition Trees (HEARpart). The proposed approach is validated on 30 datasets. It is statistically proved that HEARpart performs significantly better than WEKA’s J48 algorithm in terms of error rate, F-measure, and tree size. Further, the suggested hyper heuristic algorithm constructs significantly comprehensible models as compared to WEKA’s J48, CART and other similar decision tree construction strategies. The results show that the accuracy achieved by the hyper heuristic approach is slightly less as compared to the other comparative approaches.

Download Full-text

Design Design the temperature and humidity classification of the workspace by using a decision tree model.

Electro Luceat ◽

10.32531/jelekn.v6i2.228 ◽

2020 ◽

Vol 6 (2) ◽

pp. 169-178

Author(s):

Wahyu Setiady ◽

Y.B. Adyapaka Apatya

Keyword(s):

Decision Tree ◽

Decision Tree Model ◽

Raspberry Pi ◽

Decision Tree Classifier ◽

Tree Model ◽

Tree Classifier ◽

Temperature And Humidity

Rancang bangun alat klasifikasi suhu dan kelembaban ruang kerja dengan menggunakan model decision tree. Berdasarkan tabel standar tata cara perencanaan teknis konservasi energi pada bangunan gedung, suhu nyaman optimal ada pada kisaran 22,8oC – 25,8 oC dengan ambang atas 28 oC dan kelembaban 70%. Dengan memanfaatkan decision tree classifier, suhu dan kelembaban ruangan yang dideteksi oleh sensor DHT11 diklasifikasikan berdasarkan model yang telah dibuat dengan menggunakan Raspberry Pi 3 dan node red. Penelitian ini dilaksanakan di laboratorium komputer Politeknik Industri ATMI yang juga digunakan sebagai laboratorium riset terapan yang bekerjasama dengan industri dalam bidang pengembangan perangkat lunak otomasi. Penelitian ini berhasil membuat alat klasifikasi suhu dan kelembaban ruang kerja dengan menggunakan model decision tree yang menghasilkan status dingin, sejuk nyaman, nyaman optimal, hangat nyaman dan panas dengan tingkat prediksi model 0,983.

Download Full-text

An Optimal Decision Tree Model for Diabetes Diagnosis

2019 4th International Conference on Computational Intelligence and Applications (ICCIA) ◽

10.1109/iccia.2019.00023 ◽

2019 ◽

Cited By ~ 1

Author(s):

Zhen Sun ◽

Songsen Yu ◽

Yang Zhang

Keyword(s):

Decision Tree ◽

Decision Tree Model ◽

Optimal Decision ◽

Diabetes Diagnosis ◽

Tree Model

Download Full-text

Applying particle swarm optimization-based decision tree classifier for wart treatment selection

Complex & Intelligent Systems ◽

10.1007/s40747-021-00348-3 ◽

2021 ◽

Author(s):

Junhua Hu ◽

Xiangzhu Ou ◽

Pei Liang ◽

Bo Li

Keyword(s):

Decision Tree ◽

Particle Swarm ◽

Classification And Regression Tree ◽

Particle Swarm Algorithm ◽

Decision Tree Classifier ◽

Tree Model ◽

Proposed Model ◽

Tree Classifier ◽

Cart Algorithm ◽

Better Than

AbstractWart is a disease caused by human papillomavirus with common and plantar warts as general forms. Commonly used methods to treat warts are immunotherapy and cryotherapy. The selection of proper treatment is vital to cure warts. This paper establishes a classification and regression tree (CART) model based on particle swarm optimisation to help patients choose between immunotherapy and cryotherapy. The proposed model can accurately predict the response of patients to the two methods. Using an improved particle swarm algorithm (PSO) to optimise the parameters of the model instead of the traditional pruning algorithm, a more concise and more accurate model is obtained. Two experiments are conducted to verify the feasibility of the proposed model. On the hand, five benchmarks are used to verify the performance of the improved PSO algorithm. On the other hand, the experiment on two wart datasets is conducted. Results show that the proposed model is effective. The proposed method classifies better than k-nearest neighbour, C4.5 and logistic regression. It also performs better than the conventional optimisation method for the CART algorithm. Moreover, the decision tree model established in this study is interpretable and understandable. Therefore, the proposed model can help patients and doctors reduce the medical cost and improve the quality of healing operation.

Download Full-text

A Boosted Decision Tree Model for Predicting Loan Default in P2P Lending Communities

International Journal of Engineering and Advanced Technology - Regular Issue ◽

10.35940/ijeat.a9626.109119 ◽

2019 ◽

Vol 9 (1) ◽

pp. 1257-1261

Keyword(s):

Small Business ◽

Decision Tree ◽

Decision Tree Classifier ◽

Tree Model ◽

Loan Default ◽

Accuracy Profile ◽

Default Prediction ◽

Tree Classifier ◽

Social Lending ◽

Boosted Decision Tree

Loan Default Prediction For Social Lending Is An Emerging Area Of Research In Predictive Analytics. The Need For Large Amount Of Data And Few Available Studies In The Current Loan Default Prediction Models For Social Lending Suggest That Other Viable And Easily Implementable Models Should Be Investigated And Developed. In View Of This, This Study Developed A Data Mining Model For Predicting Loan Default Among Social Lending Patrons, Specifically The Small Business Owners, Using Boosted Decision Tree Model. The United States Small Business Administration (Usba) PubliclyAvailable Loan Administration Dataset Of 27 Features And 899164 Data Instances Was Used In 80:20 Ratios For The Training And Testing Of The Model. 16 Data Features Were Finally Used As Predictors After Data Cleaning And Feature Engineering. The Gradient Boosting Decision Tree Classifier Recorded 99% Accuracy Compared To The Basic Decision Tree Classifier Of 98%. The Model Is Further Evaluated With (A) Receiver Operating Characteristics (Roc) And Area Under Curve (Auc), (B) Cumulative Accuracy Profile (Cap), And (C) Cumulative Accuracy Profile (Cap) Under Auc. Each Of These Model Performance Evaluation Metrics, Especially Roc-Auc, Showed The Relationship Between The True Positives And False Positives That Implies The Model Is A Good Fit.

Download Full-text

Knowledge discovery from gene expression dataset using bagging lasso decision tree

Indonesian Journal of Electrical Engineering and Computer Science ◽

10.11591/ijeecs.v21.i2.pp1151-1159 ◽

2021 ◽

Vol 21 (2) ◽

pp. 1151

Author(s):

Umu Sa'adah ◽

Masithoh Yessi Rochayani ◽

Ani Budi Astuti

Keyword(s):

Gene Expression ◽

Decision Tree ◽

Gene Expression Data ◽

High Dimensional Data ◽

Decision Tree Model ◽

High Dimensional ◽

Expression Data ◽

Tree Model ◽

Tree Classifier ◽

Cart Algorithm

<p>Classifying high-dimensional data are a challenging task in data mining. Gene expression data is a type of high-dimensional data that has thousands of features. The study was proposing a method to extract knowledge from high-dimensional gene expression data by selecting features and classifying. Lasso was used for selecting features and the classification and regression tree (CART) algorithm was used to construct the decision tree model. To examine the stability of the lasso decision tree, we performed bootstrap aggregating (Bagging) with 50 replications. The gene expression data used was an ovarian tumor dataset that has 1,545 observations, 10,935 gene features, and binary class. The findings of this research showed that the lasso decision tree could produce an interpretable model that theoretically correct and had an accuracy of 89.32%. Meanwhile, the model obtained from the majority vote gave an accuracy of 90.29% which showed an increase in accuracy of 1% from the single lasso decision tree model. The slightly increasing accuracy shows that the lasso decision tree classifier is stable.</p>

Download Full-text

FALL DETECTION USING THREE WEARABLE TRIAXIAL ACCELEROMETERS AND A DECISION-TREE CLASSIFIER

Biomedical Engineering Applications Basis and Communications ◽

10.4015/s1016237214500598 ◽

2014 ◽

Vol 26 (05) ◽

pp. 1450059 ◽

Cited By ~ 3

Author(s):

Kan Luo ◽

Jianqing Li ◽

Jianfeng Wu ◽

Hua Yang ◽

Gaozhi Xu

Keyword(s):

Decision Tree ◽

Detection System ◽

Human Movement ◽

Fall Detection ◽

The Body ◽

Body Tilt ◽

Decision Tree Classifier ◽

Tree Model ◽

Tree Classifier ◽

Unintentional Falls

Unintentional falls cause serious health problem and high medical cost, particularly among the elders. Efficient fall detection can ensure fallen subjects with timely rescue, less pain and lower health-care expense. However, the accuracy of the present fall detection system with single accelerometer does not meet the requirement of practical application. In this paper, a fall detection method using three wearable triaxial accelerometers and a decision-tree classifier is proposed. The three triaxial accelerometers are, respectively mounted on the head, the waist and the ankle to capture the acceleration signals of human movement. A Kalman filter is adopted to estimate the body tilt angle. After the features are extracted, the trained decision-tree model is used to predict the fall. The efficiency improvement is evidenced by the scripted and unscripted lateral fall experiments, involving five young healthy volunteers (three males and two females; age: 23.3 ± 1 years). The classification of fall and activities of daily living (ADL) achieve recall, precision and F-value of 93.1%, 95.9%, and 94.5%, respectively, and the system detects all falls during the extended unscripted trials. The experimental results indicate that the complementary movement information coming from three accelerometers can enhance the performance of fall detection. The proposed method is efficient, and it has remarkable improvements in comparison to the method of using one or two accelerometers.

Download Full-text

Selection of the Optimal Decision Tree Model Using Grid Search Method : Focusing on the Analysis of the Factors Affecting Job Satisfaction of Workplace Reserve Force Commanders

Journal of the Korean Operations Research and Management Science Society ◽

10.7737/jkorms.2015.40.2.019 ◽

2015 ◽

Vol 40 (2) ◽

pp. 19-29

Author(s):

Chulwoo Jeong ◽

Won Young Jeong ◽

David Shin

Keyword(s):

Job Satisfaction ◽

Decision Tree ◽

Search Method ◽

Decision Tree Model ◽

Optimal Decision ◽

Grid Search ◽

Tree Model ◽

Factors Affecting ◽

Grid Search Method ◽

Selection Of

Download Full-text

Optimization of Smart Mobile Device Work Time Using an Optimal Decision Tree Classifier and Data Caching Technique in on Premise Network

International Journal of Computer Networks And Applications ◽

10.22247/ijcna/2021/210720 ◽

2021 ◽

Vol 8 (6) ◽

pp. 702

Author(s):

Sridhar S. K. ◽

J. Amutharaj

Keyword(s):

Decision Tree ◽

Mobile Device ◽

Optimal Decision ◽

Decision Tree Classifier ◽

Work Time ◽

Data Caching ◽

Tree Classifier ◽

Smart Mobile

Download Full-text

A novel enhanced decision tree model for detecting chronic kidney disease

Network Modeling Analysis in Health Informatics and Bioinformatics ◽

10.1007/s13721-021-00302-w ◽

2021 ◽

Vol 10 (1) ◽

Author(s):

Avijit Kumar Chaudhuri ◽

Deepankar Sinha ◽

Dilip K. Banerjee ◽

Anirban Das

Keyword(s):

Chronic Kidney Disease ◽

Kidney Disease ◽

Decision Tree ◽

Decision Tree Model ◽

Tree Model

Download Full-text

A decision tree classifier for credit assessment problems in big data environments

Information Systems and e-Business Management ◽

10.1007/s10257-021-00511-w ◽

2021 ◽

Author(s):

Ching-Chin Chern ◽

Weng-U Lei ◽

Kwei-Long Huang ◽

Shu-Yi Chen

Keyword(s):

Big Data ◽

Decision Tree ◽

Decision Tree Classifier ◽

Tree Classifier ◽

Assessment Problems

Download Full-text