Tree aggregation for random forest class probability estimation

Andrew J. Sage; Ulrike Genschel; Dan Nettleton

doi:10.1002/sam.11446

Tree aggregation for random forest class probability estimation

Statistical Analysis and Data Mining The ASA Data Science Journal ◽

10.1002/sam.11446 ◽

2020 ◽

Vol 13 (2) ◽

pp. 134-150

Author(s):

Andrew J. Sage ◽

Ulrike Genschel ◽

Dan Nettleton

Keyword(s):

Random Forest ◽

Probability Estimation ◽

Class Probability Estimation ◽

Class Probability

Download Full-text

A preliminary study on class probability estimation for random forest using kernel density estimators

2016 11th International Conference on Computer Science & Education (ICCSE) ◽

10.1109/iccse.2016.7581566 ◽

2016 ◽

Cited By ~ 2

Author(s):

Fan Yang ◽

Piao Peng ◽

Qifeng Zhou

Keyword(s):

Random Forest ◽

Kernel Density ◽

Probability Estimation ◽

Kernel Density Estimators ◽

Class Probability Estimation ◽

Preliminary Study ◽

Class Probability ◽

Density Estimators

Download Full-text

Not always simple classification: Learning SuperParent for class probability estimation

Expert Systems with Applications ◽

10.1016/j.eswa.2015.02.049 ◽

2015 ◽

Vol 42 (13) ◽

pp. 5433-5440 ◽

Cited By ~ 19

Author(s):

Chen Qiu ◽

Liangxiao Jiang ◽

Chaoqun Li

Keyword(s):

Probability Estimation ◽

Classification Learning ◽

Class Probability Estimation ◽

Simple Classification ◽

Class Probability

Download Full-text

Improving Tree augmented Naive Bayes for class probability estimation

Knowledge-Based Systems ◽

10.1016/j.knosys.2011.08.010 ◽

2012 ◽

Vol 26 ◽

pp. 239-245 ◽

Cited By ~ 71

Author(s):

Liangxiao Jiang ◽

Zhihua Cai ◽

Dianhong Wang ◽

Harry Zhang

Keyword(s):

Naive Bayes ◽

Naïve Bayes ◽

Probability Estimation ◽

Class Probability Estimation ◽

Class Probability

Download Full-text

LEARNING DECISION TREES WITH LOG CONDITIONAL LIKELIHOOD

International Journal of Pattern Recognition and Artificial Intelligence ◽

10.1142/s0218001410007877 ◽

2010 ◽

Vol 24 (01) ◽

pp. 117-151 ◽

Cited By ~ 2

Author(s):

HAN LIANG ◽

YUHONG YAN ◽

HARRY ZHANG

Keyword(s):

Machine Learning ◽

Classification Accuracy ◽

Naive Bayes ◽

Naïve Bayes ◽

Probability Estimation ◽

Conditional Likelihood ◽

Learning Models ◽

Class Probability Estimation ◽

Probability Prediction ◽

Class Probability

In machine learning and data mining, traditional learning models aim for high classification accuracy. However, accurate class probability prediction is more desirable than classification accuracy in many practical applications, such as medical diagnosis. Although it is known that decision trees can be adapted to be class probability estimators in a variety of approaches, and the resulting models are uniformly called Probability Estimation Trees (PETs), the performances of these PETs in class probability estimation, have not yet been investigated. We begin our research by empirically studying PETs in terms of class probability estimation, measured by Log Conditional Likelihood (LCL). We also compare a PET called C4.4 with other representative models, including Naïve Bayes, Naïve Bayes Tree, Bayesian Network, KNN and SVM, in LCL. From our experiments, we draw several valuable conclusions. First, among various tree-based models, C4.4 is the best in yielding precise class probability prediction measured by LCL. We provide an explanation for this and reveal the nature of LCL. Second, compared with non tree-based models, C4.4 also performs best. Finally, LCL does not dominate another well-established relevant metric — AUC, which suggests that different decision-tree learning models should be used for different objectives. Our experiments are conducted on the basis of 36 UCI sample sets. We run all the models within a machine learning platform — Weka. We also explore an approach to improve the class probability estimation of Naïve Bayes Tree. We propose a greedy and recursive learning algorithm, where at each step, LCL is used as the scoring function to expand the decision tree. The algorithm uses Naïve Bayes created at leaves to estimate class probabilities of test samples. The whole tree encodes the posterior class probability in its structure. One benefit of improving class probability estimation is that both classification accuracy and AUC can be possibly scaled up. We call the new model LCL Tree (LCLT). Our experiments on 33 UCI sample sets show that LCLT outperforms all state-of-the-art learning models, such as Naïve Bayes Tree, significantly in accurate class probability prediction measured by LCL, as well as in classification accuracy and AUC.

Download Full-text

Active Sampling for Class Probability Estimation and Ranking

Machine Learning ◽

10.1023/b:mach.0000011806.12374.c3 ◽

2004 ◽

Vol 54 (2) ◽

pp. 153-178 ◽

Cited By ~ 70

Author(s):

Maytal Saar-Tsechansky ◽

Foster Provost

Keyword(s):

Probability Estimation ◽

Active Sampling ◽

Class Probability Estimation ◽

Class Probability

Download Full-text

Overfitting, generalization, and MSE in class probability estimation with high-dimensional data

Biometrical Journal ◽

10.1002/bimj.201300083 ◽

2013 ◽

Vol 56 (2) ◽

pp. 256-269 ◽

Cited By ~ 6

Author(s):

Kyung In Kim ◽

Richard Simon

Keyword(s):

High Dimensional Data ◽

Probability Estimation ◽

High Dimensional ◽

Class Probability Estimation ◽

Class Probability

Download Full-text

Class Probability Estimation and Cost-Sensitive Classification Decisions

Lecture Notes in Computer Science - Machine Learning: ECML 2002 ◽

10.1007/3-540-36755-1_23 ◽

2002 ◽

pp. 270-281 ◽

Cited By ~ 19

Author(s):

Dragos D. Margineantu

Keyword(s):

Probability Estimation ◽

Class Probability Estimation ◽

Cost Sensitive Classification ◽

Class Probability

Download Full-text

DCPE co-training: Co-training based on diversity of class probability estimation

The 2010 International Joint Conference on Neural Networks (IJCNN) ◽

10.1109/ijcnn.2010.5596701 ◽

2010 ◽

Cited By ~ 4

Author(s):

Jin Xu ◽

Haibo He ◽

Hong Man

Keyword(s):

Probability Estimation ◽

Class Probability Estimation ◽

Class Probability

Download Full-text

Class probability estimation for medical studies

Biometrical Journal ◽

10.1002/bimj.201300296 ◽

2014 ◽

Vol 56 (4) ◽

pp. 597-600 ◽

Cited By ~ 8

Author(s):

Richard Simon

Keyword(s):

Probability Estimation ◽

Medical Studies ◽

Class Probability Estimation ◽

Class Probability

Download Full-text

Using Latent Class Probability Estimation And Residual Inclusion To Address Confounding In Medication Adherence Modeling

Value in Health ◽

10.1016/j.jval.2013.03.042 ◽

2013 ◽

Vol 16 (3) ◽

pp. A7

Author(s):

J.F. Slejko ◽

L. Garrison ◽

R.J. Willke

Keyword(s):

Medication Adherence ◽

Latent Class ◽

Probability Estimation ◽

Class Probability Estimation ◽

Class Probability

Download Full-text