Employee Turnover Prediction by Machine Learning Techniques


  • Chyh Kae Ang
  • XinYing Chew Universiti Sains Malaysia (USM)
  • Johnson Olanrewaju Victor
  • Khai Wah Khaw Universiti Sains Malaysia (USM)


HR Analytic, HR Attrition, Machine Learning , Data Analytics, Data Science, Retention Period, Prediction


Employee turnover in Human Resource (HR) analytic is a term used to describe employees who leave the company due to termination, seek better job, or they are dealt with a bad working environment. Typically, a high turnover rate indicates that employees are dissatisfied with their current work environment. This leads to a high cost in terms of productivity,
time and money for the company as they were required to hire, rehire, and retrain the new employees to accustom themselves with their new work environment as well as the tasks assigned. In this paper, we propose a hybrid of machine learning algorithms and a Power BI model to design an Employee Turnover Prediction (ETP) application. Main factor influencing employee exit decisions and employee retention periods will be identified and the retention period for the employees or new applicants will be predicted. Employee dataset with the relevant features will be collected, processed, and analyzed. The analytics results (retention period) act as a benchmark for companies to determine whether they should hire applicants which also would possibly benefit to reduce the turnover rate of their company. 


H. Boushey and H. J. S. Glynn, “There Are Significant Business Costs to Replacing Employees,” Center for American Progress. November 2012. https://www.americanprogress.org/issues/economy/reports/2012/11/16/44464/thereare-significant-business-costs-to-replacing-employees/.

Identifying and Addressing Employee Turnover Issues. (n.d.). Wolters Kluwer. https://www.bizfilings.com/toolkit/research-topics/officehr/identifying-andaddressing-employee-turnover-issues.

K. Martinelli, “Causes of Employee Turnover and Strategies to Reduce it,” High Speed Training. October 13, 2017.


D. Whitelegg, “How do Recruitment Agencies Get Paid (and How Much),” Agency Central. Retrieved October, 2016.

https://www.agencycentral.co.uk/articles/2016-10/howrecruitmentagenciesgetpaid.htm#targetText=The%20cost%20of%20a%20recruitment,for %20hard%20to%20fill%20positions

Alienor, “What is a Data Silo and Why is It Bad for Your Organization?” Plixer. 2018. https://www.plixer.com/blog/data-silowhat-is-it-why-is-it-bad/

Z. A. Bilal. "Predicting customer churn in banking industry using neural networks." Interdisciplinary Description of Complex Systems: INDECS 14.2 (2016): 116-124.

P.C Patel, et al. "Retail store churn and performance–The moderating role of sales amplitude and unpredictability," International Journal of Production Economics 222 (2020): 107510.

G. G. Sundarkumar, R. Vadlamani and V. Siddeshwar, "One-class Support Vector Machine Based Undersampling: Application to churn prediction and insurance fraud detection," 2015 IEEE International Conference on Computational Intelligence and Computing Research (ICCIC). IEEE, 2015.

M. N. Z Milošević and A. Igor, "Early churn prediction with personalized targeting in mobile social games," Expert Systems with Applications 83 (2017): 326-332.

C. Günther, et al., "Modelling and Predicting Customer Churn from An Insurance Company." Scandinavian Actuarial Journal 2014.1 (2014): 58-71.

S. H. Dolatabadi and F. Keynia, "Designing of customer and employee churn prediction model based on data mining method and neural predictor," 2017 2nd International Conference on Computer and Communication Systems (ICCCS), 2017, pp. 74-77, doi:10.1109/CCOMS.2017.8075270.

W.C Hong, P.F. Pai, Y.Y. Huang, and S.L. Yang, “Application of Support Vector Machines in Predicting Employee Turnover Based on Job Performance,” International Conference on Natural Computation. 2005. 668-674. Springer.

A. H. Ali, Z. F. Hussain and S. N. Abd, “Big Data Classification Efficiency Based on Linear Discriminant Analysis,” Iraqi Journal for Computer Science and Mathematics. 2020. 1(1), 7-12.

A. H. Ali and M. Z. Abdullah, “A Novel Approach for Big Data Classification based on Hybrid Parallel Dimensionality Reduction using Spark Cluster,” Computer Science. 2019. 20(4).

A. H. Ali and M. Z. Abdullah, “A Parallel Grid Optimization of SVM Hyperparameter for Big Data Classification using Spark Radoop,” Karbala International Journal of Modern Science. 2020. 6(2), Article 3.

A. Huber, “Staff Attrition vs. Staff Turnover: What's the Difference?” Jobzology. Retrieved March 28, 2018. https://jobzology.com/staffattrition-vs-staff-turnover-whats-the-difference/.

S. E. Schaeffer and S. V. R. Sanchez, “Forecasting Client Retention: A Machine Learning Approach,” Journal of Retailing and Consumer Services. 2020. 52. (C).

V. Bewick, L. Cheek, and J. Ball, “Statistics review 14: Logistic regression.” Critical care. 2005. 9(1), 112.

R. Punnoose and P. Ajit, “Prediction of Employee Turnover in Organizations Using Machine Learning Algorithms,” Algorithms. 2016. 4(5), C5.

P. Chandrayan, “Logistic Regression for Dummies: A Detailed Explanation,” Towards Data Science. Retrieved August 5, 2019. https://towardsdatascience.com/logisticregression-for-dummies-adetailed-explanation-9597f76edf46.

N. Sharma, “People Analytics with Attrition Predictions,” Towards Data Science. Retrieved May 18.


T. Srivastava, “Introduction to KNN, K-Nearest Neighbors: Simplified,” Analytics Vidhya. Retrieved March 26, 2018.

D. S. Sisodia, S. Vishwakarma, and A. Pujahari, “Evaluation of Machine Learning Models for Employee Churn Prediction,” International Conference on Inventive Computing and Informatics (ICICI). 2017. 1016-1020. IEEE.

R. S. Brid, “Decision Trees - A simple way to visualize a decision Medium,” 2018. https://medium.com/greyatom/decision-trees-asimple-way-to-visualize-adecision-dc506a403aeb

H. Jantan, A. R. Hamdan, and Z. A. Othman, “Human Talent Prediction in HRM Using C4.5 Classification Algorithm,” International Journal on Computer Science and Engineering. 2010. 2(8), 2526-2534.

R. Gandhi, “Naive Bayes Classifier,” Towards Data Science. Retrieved May 6, 2018. https://towardsdatascience.com/naive-bayes-classifier-81d512f50a7c.

M. A. Valle, S. Varas, and G. A. Ruz, “Job performance prediction in a call center using a naive Bayes classifier,” Expert Systems with Applications. 2012, 39(11), 9939-9945.

W. Koehrsen, “Random Forest Simple Explanation. Medium.Retrieved December 28, 2017.


Pavansubash, “IBM HR Analytics Employee Attrition & Performance,” Kaggle. Retrieved March 31, 2017.


Y. Charfaoui, “Hands-on with Feature Selection Techniques: Embedded Methods,” Medium. Retrieved January 18, 2020. https://heartbeat.fritz.ai/hands-on-with-featureselection-techniquesembedded-methods-84747e814dab.




How to Cite

Ang, C. K., Chew, X., Victor, J. O. ., & Khaw, K. W. (2021). Employee Turnover Prediction by Machine Learning Techniques. Journal of Telecommunication, Electronic and Computer Engineering (JTEC), 13(4), 49–56. Retrieved from https://jtec.utem.edu.my/jtec/article/view/6148