Predicting Employee Attrition using Supervised Learning Classification Models

Author(s):  
Amine Habous ◽  
El Habib Nfaoui ◽  
Youness Oubenaalla
2020 ◽  
Vol 28 (6) ◽  
pp. 1273-1291
Author(s):  
Nesreen El-Rayes ◽  
Ming Fang ◽  
Michael Smith ◽  
Stephen M. Taylor

Purpose The purpose of this study is to develop tree-based binary classification models to predict the likelihood of employee attrition based on firm cultural and management attributes. Design/methodology/approach A data set of resumes anonymously submitted through Glassdoor’s online portal is used in tandem with public company review information to fit decision tree, random forest and gradient boosted tree models to predict the probability of an employee leaving a firm during a job transition. Findings Random forest and decision tree methods are found to be the strongest attrition prediction models. In addition, compensation, company culture and senior management performance play a primary role in an employee’s decision to leave a firm. Practical implications This study may be used by human resources staff to better understand factors which influence employee attrition. In addition, techniques developed in this study may be applied to company-specific data sets to construct customized attrition models. Originality/value This study contains several novel contributions which include exploratory studies such as industry job transition percentages, distributional comparisons between factors strongly contributing to employee attrition between those who left or stayed with the firm and the first comprehensive search over binary classification models to identify which provides the strongest predictive performance of employee attrition.


2020 ◽  
pp. 1-31
Author(s):  
Yiping Jin ◽  
Dittaya Wanvarie ◽  
Phu T. V. Le

Abstract In real-world applications, text classification models often suffer from a lack of accurately labelled documents. The available labelled documents may also be out of domain, making the trained model not able to perform well in the target domain. In this work, we mitigate the data problem of text classification using a two-stage approach. First, we mine representative keywords from a noisy out-of-domain data set using statistical methods. We then apply a dataless classification method to learn from the automatically selected keywords and unlabelled in-domain data. The proposed approach outperformed various supervised learning and dataless classification baselines by a large margin. We evaluated different keyword selection methods intrinsically and extrinsically by measuring their impact on the dataless classification accuracy. Last but not least, we conducted an in-depth analysis of the behaviour of the classifier and explained why the proposed dataless classification method outperformed supervised learning counterparts.


2020 ◽  
Vol 6 (3) ◽  
pp. 376-379
Author(s):  
Seyed Amir Hossein Tabatabaei ◽  
David Pedrosa ◽  
Carsten Eggers ◽  
Max Wullstein ◽  
Urs Kleinholdermann ◽  
...  

AbstractIn this paper, the classification models for Idiopathic Parkinson's syndrome (iPS) detection through timed-up-and-go test performed on iPS-patients are given. The models are based on the supervised learning. The data are extracted via Myo gesture armband worn on two hands. The corresponding models are based on extracted features from signal data and raw signal data respectively. The achieved accuracy from both models are 0.91 and 0.93 with reasonable specificity and sensitivity.


2021 ◽  
Author(s):  
Sandeep Kumar Sunori ◽  
Pushpa Bhakuni Negi ◽  
Pradeep Juneja ◽  
M Niranjanamurthy ◽  
P.G. Om Prakash ◽  
...  

In this paper, the researcher study automatic speech recognition technology for the individual. We propose a new voice recognition system using a hybrid model GMM-HMM. HMM and GMM is a non-linear classification model. Each state in an HMM can be thought of as a GMM. HMM is consider observation for state. It is also known as time series classification model. In this model, samples have been trained independently and parameters consider jointly which provides better performance than other classification models. Speech recognition system consider two types of learning patterns such as supervised learning and unsupervised learning. In this context speaker dependent and speaker independent used for identifying the efficient and effective voice. In this paper researcher considered supervised learning model for recognize efficient voice. This new voice recognition system identifies incorrect phonemes and verifies the correctness of voice pronunciation. Using the GMM-HMM hybrid model produces better performance and effectiveness of voice


2015 ◽  
Vol 10 (8) ◽  
pp. 829
Author(s):  
Aswin Wibisurya ◽  
Ford Lumban Gaol ◽  
Kuncoro Wastuwibowo

2018 ◽  
Vol 2018 (15) ◽  
pp. 132-1-1323
Author(s):  
Shijie Zhang ◽  
Zhengtian Song ◽  
G. M. Dilshan P. Godaliyadda ◽  
Dong Hye Ye ◽  
Atanu Sengupta ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document