scholarly journals Deep Learning for Customer Churn Prediction in E-Commerce Decision Support

2021 ◽  
pp. 3-12
Author(s):  
Maciej Pondel ◽  
Maciej Wuczyński ◽  
Wiesława Gryncewicz ◽  
Łukasz Łysik ◽  
Marcin Hernes ◽  
...  

Churn prediction is a Big Data domain, one of the most demanding use cases of recent time. It is also one of the most critical indicators of a healthy and growing business, irrespective of the size or channel of sales. This paper aims to develop a deep learning model for customers’ churn prediction in e-commerce, which is the main contribution of the article. The experiment was performed over real e-commerce data where 75% of buyers are one-off customers. The prediction based on this business specificity (many one-off customers and very few regular ones) is extremely challenging and, in a natural way, must be inaccurate to a certain ex-tent. Looking from another perspective, correct prediction and subsequent actions resulting in a higher customer retention are very attractive for overall business performance. In such a case, predictions with 74% accuracy, 78% precision, and 68% recall are very promising. Also, the paper fills a research gap and contrib-utes to the existing literature in the area of developing a customer churn prediction method for the retail sector by using deep learning tools based on customer churn and the full history of each customer’s transactions.

Customer Churn Prediction (CCP) is a difficult problem found to be helpful to make decisions due to the rapid growth in the number of telecom providers. At present, deep learning models are familiar because of the significant improvement in different areas. In this paper, a deep learning based CCP is introduced by the use of Stochastic Gradient Boosting (SGD) with Logistic regression (LR) classifier model. By the integration of SGD and LR, effective classification can be accomplished. To further improve the classifier efficiency, misclassified instances are removed from the dataset. Then, the processed data is again provided as input to the classification model. The presented SGD-LR model is validated on a benchmark dataset and the results are examiner with respect to different measures. The experimental outcome pointed out the projected model is superior to available CCP models on the identical dataset.


Sign in / Sign up

Export Citation Format

Share Document