Predicting Employee Attrition: A Machine Learning Approach in Human Resource Analytics

PDF (782KB), PP.89-101

Views: 0 Downloads: 0

Author(s)

Manjula K. A. 1 Karthikeyan P. 2,* Santhosh V. A. 3

1. Department of Computer Science, University of Calicut, Kerala, India

2. Department of Management Studies, Kannur University, Kerala, India

3. TKM Insitute of Management, Kollam, Kerala, India

* Corresponding author.

DOI: https://doi.org/10.5815/ijieeb.2026.02.06

Received: 22 Oct. 2025 / Revised: 5 Dec. 2025 / Accepted: 7 Jan. 2026 / Published: 8 Apr. 2026

Index Terms

Employee Attrition, Machine Learning, Logistic Regression, Random Forest, Decision Tree, Gradient Boosting, HR Analytics

Abstract

Employee attrition is an important factor that can affect organizations, both financially and operationally. Human Resource (HR) managers often find it difficult to identify exactly which employees might be planning to leave the organization and what is the root cause for their decision. With the recent advances in computing, Machine Learning (ML) techniques are available for analysing, understanding, and solving complex problems. This study analyses the IBM HR Analytics dataset using ML techniques to predict employee attrition and identify the key factors that influence attrition. Four ML models based on Logistic Regression, Decision Tree, Random Forest, and Gradient Boosting have been used for analysing attrition. It is found that Logistic Regression outperformed all other models in predicting attrition. At the same time, Decision Tree is found to be the weakest among the four techniques. On the analysis of feature importance, it is found that variables related to compensation (Monthly Income), career stage (Total Working Years, Age), and tenure at the organization are among the most significant factors influencing attrition.  The insight from this study is expected to help HR managers in developing effective, data-driven strategies to retain their talent in their organization.

Cite This Paper

Manjula K. A., Karthikeyan P., Santhosh V. A., "Predicting Employee Attrition: A Machine Learning Approach in Human Resource Analytics", International Journal of Information Engineering and Electronic Business(IJIEEB), Vol.18, No.2, pp. 89-101, 2026. DOI:10.5815/ijieeb.2026.02.06

Reference

[1]F.Fallucchi, M. Coladangelo, R. Giuliano, and E. W. De Luca, "Predicting employee attrition using machine learning techniques," Computers, vol. 9, no. 4, p. 86, 2020.
[2]Guerranti F, Dimitri GM. A Comparison of Machine Learning Approaches for Predicting Employee Attrition. Applied Sciences. 2023; 13(1):267.
[3]Sheth, Kalgi, Jaynil Patel, and Jaiprakash Verma. "Machine learning-based investigation of employee attrition prediction and analysis." In Emerging Technology Trends in Electronics, Communication and Networking: Select Proceedings of the Fourth International Conference, ET2ECN 2021, pp. 221-238. Singapore: Springer Nature Singapore, 2022.
[4]M. S. Amzad Basha, O. Varikunta, A. U. Devi and S. Raja, "Machine Learning in HR Analytics: A Comparative Study on the Predictive Accuracy of Attrition Models," 2024 2nd International Conference on Device Intelligence, Computing and Communication Technologies (DICCT), Dehradun, India, 2024, pp. 475-480.
[5]G. M. Alharbi, H. M. Alharthi, N. K. Alharthi, S. M. Alotaibi and H. Meshref, "Predicting Employee Attrition Using Machine Learning Approaches," 2025 Intelligent Methods, Systems, and Applications (IMSA), Giza, Egypt, 2025, pp. 650-655.
[6]H. Talebi, A. Khatibi Bardsiri, and V. K. Bardsiri, “ Machine Learning Approaches for Predicting Employee Turnover: A Systematic Review,” Engineering Reports 7, no. 8, 2025.
[7]Hom, Peter W., Thomas W. Lee, Jason D. Shaw, and John P. Hausknecht. "One hundred years of employee turnover theory and research." Journal of applied psychology 102, no. 3 (2017): 530.
[8]Holtom, Brooks C., Terence R. Mitchell, Thomas W. Lee, and Marion B. Eberly. "5 turnover and retention research: a glance at the past, a closer review of the present, and a venture into the future." The Academy of Management Annals 2, no. 1 (2008): 231-274.
[9]Hausknecht, John P., and Charlie O. Trevor. "Collective turnover at the group, unit, and organizational levels: Evidence, issues, and implications." Journal of management 37, no. 1 (2011): 352-388.
[10]Allen, David G., Phillip C. Bryant, and James M. Vardaman. "Retaining talent: Replacing misconceptions with evidence-based strategies." Academy of management Perspectives 24, no. 2 (2010): 48-64.
[11]Frye, Alex, Christopher Boomhower, Michael Smith, Lindsay Vitovsky, and Stacey Fabricant. "Employee attrition: what makes an employee quit?." SMU Data Science Review 1, no. 1 (2018)
[12]Y. Zhao, M. K. Hryniewicki, F. Cheng, B. Fu, and X. Zhu, “Employee turnover prediction with machine learning: A reliable approach,” in Proc. SAI Intelligent Systems Conf., Cham: Springer International Publishing, Sep. 2018, pp. 737–758.
[13]M. Al Akasheh, E. F. Malik, O. Hujran, and N. Zaki, “A decade of research on machine learning techniques for predicting employee turnover: A systematic literature review,” Expert Systems with Applications, vol. 238, p. 121794, 2024.
[14]Price, J. L., & Mueller, C. W.  Absenteeism and turnover of hospital employees. (1986): JAI Press.
[15]Lee, Thomas W., and Terence R. Mitchell. "An alternative approach: The unfolding model of voluntary employee turnover." Academy of management review 19, no. 1 (1994): 51-89.
[16]Mitchell, Terence R., Brooks C. Holtom, Thomas W. Lee, Chris J. Sablynski, and Miriam Erez. "Why people stay: Using job embeddedness to predict voluntary turnover." Academy of management journal 44, no. 6 (2001): 1102-1121.
[17]Demerouti, Evangelia, Arnold B. Bakker, Friedhelm Nachreiner, and Wilmar B. Schaufeli. "The job demands-resources model of burnout." Journal of Applied psychology 86, no. 3 (2001): 499.
[18]Bakker, Arnold B., and Evangelia Demerouti. "The job demands‐resources model: State of the art." Journal of Managerial Psychology 22, no. 3 (2007): 309-328.
[19]Bakker, Arnold, Evangelia Demerouti, and Wilmar Schaufeli. "Dual processes at work in a call centre: An application of the job demands–resources model." European Journal of work and organizational psychology 12, no. 4 (2003): 393-417.
[20]Qin, Xin, Peter Hom, Minya Xu, and Dong Ju. "Applying the job demands–resources model to migrant workers: Exploring how and when geographical distance increases quit propensity." Journal of occupational and organizational psychology 87, no. 2 (2014): 303-328.
[21]Carlson, John R., Dawn S. Carlson, Suzanne Zivnuska, Ranida B. Harris, and Kenneth J. Harris. "Applying the job demands resources model to understand technology as a predictor of turnover intentions." Computers in Human Behavior 77 (2017): 317-325.
[22]Chung, Doohee, Jinseop Yun, Jeha Lee, and Yeram Jeon. "Predictive model of employee attrition based on stacking ensemble learning." Expert Systems with Applications 215 (2023).
[23]Hosmer Jr, David W., Stanley Lemeshow, and Rodney X. Sturdivant. Applied logistic regression. John Wiley & Sons, 2013.
[24]Breiman, Leo, Jerome Friedman, Richard A. Olshen, and Charles J. Stone. Classification and regression trees. Chapman and Hall/CRC, 2017.
[25]Breiman, Leo. "Random forests." Machine learning 45, no. 1 (2001): 5-32.
[26]Chen, Tianqi. "XGBoost: A Scalable Tree Boosting System." Cornell University (2016).
[27]Hsu C-W, Chang C-C, Lin C-J. A practical guide to support vector classification. Department of Computer Science and Information Engineering, National Taiwan University, Taipei, 2016.
[28]Shavitt, Ira, and Eran Segal. "Regularization learning networks: deep learning for tabular datasets." Advances in neural information processing systems 31 (2018).
[29]Gorishniy, Yury, Ivan Rubachev, Valentin Khrulkov, and Artem Babenko. "Revisiting deep learning models for tabular data." Advances in neural information processing systems 34 (2021): 18932-18943.
[30]Jin, Soo-Bong, and Jong-Woo Lee. "Study on accident prediction models in urban railway casualty accidents using logistic regression analysis model." Journal of the Korean Society for Railway 20, no. 4 (2017): 482-490
[31]L. Breiman, J. H. Friedman, R. A. Olshen, and C. J. Stone, Classification and Regression Trees. Belmont, CA, USA: Wadsworth, 1984.
[32]J. R. Quinlan, "Induction of decision trees," Machine Learning, vol. 1, no. 1, pp. 81–106, 1986.
[33]J. R. Quinlan, C4.5: Programs for Machine Learning. San Mateo, CA, USA: Morgan Kaufmann, 1993.
[34]K. Thajeel, K. Samsudin, S. J. Hashim, and F. Hashim, “Machine and deep learning-based XSS detection approaches: A systematic literature review,” J. King Saud Univ.-Comput. Inf. Sci., vol. 35, Jul. 2023, Art. no. 101628.
[35]Y. Freund and R. E. Schapire, "A decision-theoretic generalization of on-line learning and an application to boosting," in Proc. European Conf. Computational Learning Theory, Berlin, Heidelberg: Springer, pp. 23–37, Mar. 1995.
[36]J. H. Friedman, "Greedy function approximation: a gradient boosting machine," Annals of Statistics, vol. 29, no. 5, pp. 1189–1232, 2001
[37]Zhou, Zhengze, and Giles Hooker. "Unbiased measurement of feature importance in tree-based methods." ACM Transactions on Knowledge Discovery from Data (TKDD) 15, no. 2 (2021): 1-21.
[38]Wang, Y. Analysis of Feature Importance Based on Random Forest for Stock Selection. In Proceedings of the 1st International Conference on E-commerce and Artificial Intelligence (ECAI 2024), 2024 pages 374-379
[39]Saarela, Mirka, and Susanne Jauhiainen. "Comparison of feature importance measures as explanations for classification models." SN Applied Sciences 3, no. 2 (2021): 272.
[40]IBM, "IBM HR Analytics Employee Attrition & Performance," Kaggle, 2017. [Online].
[41]Chawla, Nitesh V., Kevin W. Bowyer, Lawrence O. Hall, and W. Philip Kegelmeyer. "SMOTE: synthetic minority over-sampling technique." Journal of artificial intelligence research 16 (2002): 321-357.
[42]Fawcett, Tom. "An introduction to ROC analysis." Pattern recognition letters 27, no. 8 (2006): 861-874.