Optimization of Smart Mobile Device Work Time  Using an Optimal Decision Tree Classifier and Data  Caching Technique in on Premise Network

Decision tree models have earned a special status in predictive modeling since these are considered comprehensible for human analysis and insight. Classification and Regression Tree (CART) algorithm is one of the renowned decision tree induction algorithms to address the classification as well as regression problems. Finding optimal values for the hyper parameters of a decision tree construction algorithm is a challenging issue. While making an effective decision tree classifier with high accuracy and comprehensibility, we need to address the question of setting optimal values for its hyper parameters like the maximum size of the tree, the minimum number of instances required in a node for inducing a split, node splitting criterion and the amount of pruning. The hyper parameter setting influences the performance of the decision tree model. As researchers, we know that no single setting of hyper parameters works equally well for different datasets. A particular setting that gives an optimal decision tree for one dataset may produce a sub-optimal decision tree model for another dataset. In this paper, we present a hyper heuristic approach for tuning the hyper parameters of Recursive and Partition Trees (rpart), which is a typical implementation of CART in statistical and data analytics package R. We employ an evolutionary algorithm as hyper heuristic for tuning the hyper parameters of the decision tree classifier. The approach is named as Hyper heuristic Evolutionary Approach with Recursive and Partition Trees (HEARpart). The proposed approach is validated on 30 datasets. It is statistically proved that HEARpart performs significantly better than WEKA’s J48 algorithm in terms of error rate, F-measure, and tree size. Further, the suggested hyper heuristic algorithm constructs significantly comprehensible models as compared to WEKA’s J48, CART and other similar decision tree construction strategies. The results show that the accuracy achieved by the hyper heuristic approach is slightly less as compared to the other comparative approaches.

Download Full-text

A decision tree classifier for credit assessment problems in big data environments

Information Systems and e-Business Management ◽

10.1007/s10257-021-00511-w ◽

2021 ◽

Author(s):

Ching-Chin Chern ◽

Weng-U Lei ◽

Kwei-Long Huang ◽

Shu-Yi Chen

Keyword(s):

Big Data ◽

Decision Tree ◽

Decision Tree Classifier ◽

Tree Classifier ◽

Assessment Problems

Download Full-text

Improving the Performance of a Proxy Cache Using Very Fast Decision Tree Classifier

Procedia Computer Science ◽

10.1016/j.procs.2015.04.186 ◽

2015 ◽

Vol 48 ◽

pp. 304-312 ◽

Cited By ~ 6

Author(s):

P. Julian Benadit ◽

F. Sagayaraj Francis

Keyword(s):

Decision Tree ◽

Decision Tree Classifier ◽

Proxy Cache ◽

Tree Classifier ◽

Very Fast Decision Tree ◽

Fast Decision

Download Full-text

Comparing learning accuracies of neural nets and decision-tree classifier systems

Proceedings of the 1990 Symposium on Applied Computing ◽

10.1109/soac.1990.82136 ◽

2002 ◽

Author(s):

A.K. Rigler ◽

D.C. St. Clair

Keyword(s):

Decision Tree ◽

Neural Nets ◽

Classifier Systems ◽

Decision Tree Classifier ◽

Tree Classifier

Download Full-text

CMP: a fast decision tree classifier using multivariate predictions

Proceedings of 16th International Conference on Data Engineering (Cat. No.00CB37073) ◽

10.1109/icde.2000.839444 ◽

2002 ◽

Cited By ~ 10

Author(s):

H. Wang ◽

C. Zaniolo

Keyword(s):

Decision Tree ◽

Decision Tree Classifier ◽

Tree Classifier ◽

Fast Decision

Download Full-text

Comparison of Land Cover Characterization Using EOS MISR and MODIS Data and a Decision Tree Classifier

Geocarto International ◽

10.1080/10106040608542389 ◽

2006 ◽

Vol 21 (3) ◽

pp. 19-26 ◽

Cited By ~ 2

Author(s):

Limin Yang

Keyword(s):

Land Cover ◽

Decision Tree ◽

Decision Tree Classifier ◽

Modis Data ◽

Tree Classifier

Download Full-text

Traffic Prediction Using Decision Tree Classifier in Hive Metastore

Lecture Notes on Data Engineering and Communications Technologies - Proceeding of the International Conference on Computer Networks, Big Data and IoT (ICCBI - 2018) ◽

10.1007/978-3-030-24643-3_68 ◽

2019 ◽

pp. 571-578

Author(s):

D. Suvitha ◽

M. Vijayalakshmi

Keyword(s):

Decision Tree ◽

Traffic Prediction ◽

Decision Tree Classifier ◽

Tree Classifier

Download Full-text

PERFORMANCE ANALYSIS OF BREAST CANCER CLASSIFICATION USING DECISION TREE CLASSIFIERS

International Journal of Current Pharmaceutical Research ◽

10.22159/ijcpr.2017v9i2.17383 ◽

2017 ◽

Vol 9 (2) ◽

pp. 19 ◽

Cited By ~ 6

Author(s):

P. Hamsagayathri ◽

P. Sampath

Keyword(s):

Breast Cancer ◽

Decision Tree ◽

Ductal Carcinoma ◽

Research Work ◽

The United States ◽

Breast Cancer Dataset ◽

Decision Tree Classifier ◽

Cancer Dataset ◽

Term Survival ◽

Tree Classifier

Breast cancer is one of the dangerous cancers among world’s women above 35 y. The breast is made up of lobules that secrete milk and thin milk ducts to carry milk from lobules to the nipple. Breast cancer mostly occurs either in lobules or in milk ducts. The most common type of breast cancer is ductal carcinoma where it starts from ducts and spreads across the lobules and surrounding tissues. According to the medical survey, each year there are about 125.0 per 100,000 new cases of breast cancer are diagnosed and 21.5 per 100,000 women due to this disease in the United States. Also, 246,660 new cases of women with cancer are estimated for the year 2016. Early diagnosis of breast cancer is a key factor for long-term survival of cancer patients. Classification plays an important role in breast cancer detection and used by researchers to analyse and classify the medical data. In this research work, priority-based decision tree classifier algorithm has been implemented for Wisconsin Breast cancer dataset. This paper analyzes the different decision tree classifier algorithms for Wisconsin original, diagnostic and prognostic dataset using WEKA software. The performance of the classifiers are evaluated against the parameters like accuracy, Kappa statistic, Entropy, RMSE, TP Rate, FP Rate, Precision, Recall, F-Measure, ROC, Specificity, Sensitivity.

Download Full-text