scholarly journals Boosted Regression (Boosting): An Introductory Tutorial and a Stata Plugin

Author(s):  
Matthias Schonlau

Boosting, or boosted regression, is a recent data-mining technique that has shown considerable success in predictive accuracy. This article gives an overview of boosting and introduces a new Stata command, boost, that implements the boosting algorithm described in Hastie, Tibshirani, and Friedman (2001, 322). The plugin is illustrated with a Gaussian and a logistic regression example. In the Gaussian regression example, the R2 value computed on a test dataset is R2 = 21.3% for linear regression and R2 = 93.8% for boosting. In the logistic regression example, stepwise logistic regression correctly classifies 54.1% of the observations in a test dataset versus 76.0% for boosted logistic regression. Currently, boost accommodates Gaussian (normal), logistic, and Poisson boosted regression. boost is implemented as a Windows C++ plugin.

2021 ◽  
Vol 2 (1) ◽  
pp. 33-44
Author(s):  
Sinta Septi Pangastuti ◽  
Kartika Fithriasari ◽  
Nur Iriawan ◽  
Wahyuni Suryaningtyas

data mining techniques in education sector have begun to evolve, along with the development of technology and the amount of data that can be stored in an education database storage system. One of them is a database of Bidikmisi scholarships in Indonesia. The Bidikmisi data used in this study will be classified using classification data mining technique. The technique that used in this study is random forest in combination with boosting algorithm and bagging algorithms. These algorithms also combine with SMOTE algorithm to handling the imbalance class in dataset. Based on the performance criteria G-mean and AUC, the algorithm combines with SMOTE tended to be better. The classification accuracy of each method being more than 90%


Author(s):  
Md. Sadeki Salman ◽  
Nazmun Naher Shila ◽  
Khalid Hasan ◽  
Piash Ahmed ◽  
Mumenunnessa Keya ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document