Telecom customer churn prediction based on imbalanced data re-sampling method

Author(s):  
Li Peng ◽  
Yu Xiaoyang ◽  
Sun Boyu ◽  
Huang Jiuling
2019 ◽  
Vol 5 ◽  
pp. 101-110
Author(s):  
Aayush Bhattarai ◽  
Elisha Shrestha ◽  
Ram Prasad Sapkota

Churners are those people who are about to transfer their business to a competitor or simply who cancel a subscription to a service. This paper is based on a specific business sector, which is telecommunication sector. With a churn rate of 30%, the telecommunication sector takes the first place on the list. In this paper, we present some advanced data mining methodologies which predicts customer churn in the pre-paid mobile telecommunications industry using a call detail records dataset. To implement the predictive models, we initially propose and then apply four machine learning algorithms: Random Forest, Naïve Bayes, Logistic Regression, and XG Boost. To evaluate the models, we use various evaluation metrics and find the best model which will be suitable for any class imbalanced data and also our business case. This paper can also be viewed as a comparative study on the most popular machine learning methods applied to the challenging problem of customer churn prediction.


Sign in / Sign up

Export Citation Format

Share Document