IJMECS Vol. 18, No. 2, 8 Apr. 2026
Cover page and Table of Contents: PDF (size: 1232KB)
PDF (1232KB), PP.147-162
Views: 0 Downloads: 0
K-means, Decision Tree, Random Forest, Student Stress Analysis, Machine Learning, Artificial Intelligence, Counselling, Education 4.0
Students attending school experience stress due to a variety of factors that can originate either on campus or from home. In managing stress and other related issues, educational institutions now have counselling units that operate as centres. Moreover, several institutions have designated academic counsellors in departments to address the increasing demand for counselling services due to an expanding student population. Timely detection and proactive counselling of stress among students can help avert dropouts, health issues, and other learner behaviours that are detrimental to academic work. This study proposes two approaches to facilitate the automation of student counselling for stress management. We first implemented the K-means algorithm and formed clusters using the elbow and the Silhouette methods. The clusters formed reveal three groups of students. The stressors significantly affected one group, making it vulnerable. The stressors moderately impacted another group, while the final group experiences minimal stress. In the second part of the study, we proposed a classification model to identify the cluster group of any new student. The results of the classification show superior performance for the Decision Tree algorithm with an accuracy of 97.64%. The improvement in the efficiency of the classification algorithms was attained through feature engineering using the Chi-square method.
Delali Kwasi Dake, "Cluster Pattern Analysis of Students Stress using Machine Learning Algorithms with Feature Engineering", International Journal of Modern Education and Computer Science(IJMECS), Vol.18, No.2, pp. 147-162, 2026. DOI:10.5815/ijmecs.2026.02.09
[1]R. De Filippis, • Abdullah, and A. Foysal, “Comprehensive analysis of stress factors affecting students: a machine learning approach,” Discov. Artif. Intell. 2024 41, vol. 4, no. 1, pp. 1–17, 2024, doi: 10.1007/s44163-024-00169-6.
[2]Z. J. Gazzaz et al., “Perceived stress, reasons for and sources of stress among medical students at Rabigh Medical College, King Abdulaziz University, Jeddah, Saudi Arabia,” BMC Med. Educ., vol. 18, no. 1, pp. 1–9, 2018, doi: 10.1186/s12909-018-1133-2.
[3]World Health Organisation (WHO) (2023) : https://www.who.int/news-room/questions-and-answers/item/stress
[4]United Nations Children's Fund (UNICEF) (2022): https://www.unicef.org/parenting/mental-health/what-is-stress
[5]R. Rois, M. Ray, A. Rahman, and S. K. Roy, “Prevalence and predicting factors of perceived stress among Bangladeshi university students using machine learning algorithms,” J. Heal. Popul. Nutr., vol. 40, no. 1, pp. 1–12, 2021, doi: 10.1186/s41043-021-00276-5.
[6]Y. Amanvermez et al., “Stress Management Interventions for College Students: A Systematic Review and Meta-Analysis,” Clin. Psychol. Sci. Pract., vol. 30, no. 4, pp. 423–444, 2023, doi: 10.1111/cpsp.12342.
[7]A. Drigas and E. Mitsea, “Metacognition, Stress – Relaxation Balance & Related Hormones,” Int. J. Recent Contrib. from Eng. Sci. IT, vol. 9, no. 1, p. 4, 2021, doi: 10.3991/ijes.v9i1.19623.
[8]Transforming Education (2024): https://transformingeducation.org/student-stress-statistics/
[9]Research (2024): https://research.com/education/student-stress-statistics
[10]D. Owusu, F. Arthur, R. Okyere-Dankwa, O. Affreh, F. Kwame Kumedzro, and R. S. Maison, “Academic stress and burnout among distance education students in a Ghanaian higher education institution,” Cogent Educ., vol. 11, no. 1, p., 2024, doi: 10.1080/2331186X.2024.2334686.
[11]Y. Amanvermez et al., “Sources of stress among domestic and international students: a cross-sectional study of university students in Amsterdam, The Netherlands,” Anxiety, Stress Coping, vol. 37, no. 4, pp. 428–445, 2024, doi: 10.1080/10615806.2023.2280701.
[12]J. Choi, “Impact of Stress Levels on Eating Behaviors among,” Nutrients, vol. 12, no. 5, pp. 1–8, 2020.
[13]M. de Witte, A. da S. Pinho, G. J. Stams, X. Moonen, A. E. R. Bos, and S. van Hooren, “Music therapy for stress reduction: a systematic review and meta-analysis,” Health Psychol. Rev., vol. 16, no. 1, pp. 134–159, 2022, doi: 10.1080/17437199.2020.1846580.
[14]S. H. Hamaideh, H. Al-Modallal, M. Tanash, and A. Hamdan-Mansour3, “Depression, anxiety and stress among undergraduate students during COVID-19 outbreak and ‘home-quarantine,’” Nurs. Open, vol. 9, no. 2, pp. 1423–1431, 2022, doi: 10.1002/nop2.918.
[15]World Bank (2024): https://www.worldbank.org/en/topic/tertiaryeducation
[16]K. P. Sinaga and M. S. Yang, “Unsupervised K-means clustering algorithm,” IEEE Access, vol. 8, pp. 80716–80727, 2020, doi: 10.1109/ACCESS.2020.2988796.
[17]S. Park et al., “Improving unsupervised image clustering with robust learning,” Proc. IEEE Comput. Soc. Conf. Comput. Vis. Pattern Recognit., no. 2, pp. 12273–12282, 2021, doi: 10.1109/CVPR46437.2021.01210.
[18]S. Križanić, “Educational data mining using cluster analysis and decision tree technique: A case study,” Int. J. Eng. Bus. Manag., vol. 12, pp. 1–9, 2020, doi: 10.1177/1847979020908675.
[19]A. Dutt, “Clustering Algorithms Applied in Educational Data Mining,” Int. J. Inf. Electron. Eng., no. March, 2015, doi: 10.7763/ijiee.2015.v5.513.
[20]M. C. Sáiz-Manzanares, J. J. Rodríguez-Díez, J. F. Díez-Pastor, S. Rodríguez-Arribas, R. Marticorena-Sánchez, and Y. P. Ji, “Monitoring of student learning in learning management systems: An application of educational data mining techniques,” Appl. Sci., vol. 11, no. 6, 2021, doi: 10.3390/app11062677.
[21]S. Tiwari, “Supervised Machine Learning: A Brief Introduction,” Proc. Int. Conf. Virtual Learn., vol. 17, no. 5, pp. 219–230, 2022, doi: 10.58503/icvl-v17y202218.
[22]S. Uddin, A. Khan, M. E. Hossain, and M. A. Moni, “Comparing different supervised machine learning algorithms for disease prediction,” BMC Med. Inform. Decis. Mak., vol. 19, no. 1, pp. 1–16, 2019, doi: 10.1186/s12911-019-1004-8.
[23]A. Khalaf Hamoud et al., “Supervised Learning Algorithms in Educational Data Mining: A Systematic Review,” Southeast Eur. J. Soft Comput., vol. 10, no. 1, pp. 55–70, 2021, [Online]. Available: http://scjournal.ius.edu.ba/index.php/scjournal/article/view/199.
[24]B. Albreiki, N. Zaki, and H. Alashwal, “Revisión bibliográfica sistemática de la predicción del rendimiento de los estudiantes mediante técnicas de aprendizaje automático,” Educ. Sci., vol. 11, no. 9, 2021.
[25]Y. Deng et al., “Family and Academic Stress and Their Impact on Students’ Depression Level and Academic Performance,” Front. Psychiatry, vol. 13, no. June, pp. 1–13, 2022, doi: 10.3389/fpsyt.2022.869337.
[26]A. Alkhawaldeh et al., “Stress Factors, Stress Levels, and Coping Mechanisms among University Students,” Sci. World J., vol. 2023, 2023, doi: 10.1155/2023/2026971.
[27]W. Thanoi, N. Vongsirimas, Y. Sitthimongkol, and P. Klainin-Yobas, “Examining Predictors of Psychological Well-Being among University Students: A Descriptive Comparative Study across Thailand and Singapore,” Int. J. Environ. Res. Public Health, vol. 20, no. 3, 2023, doi: 10.3390/ijerph20031875.
[28]S. Asif, A. Mudassar, T. Z. Shahzad, M. Raouf, and T. Pervaiz, “Frequency of depression, anxiety and stress among university students,” Pakistan J. Med. Sci., vol. 36, no. 5, pp. 971–976, 2020, doi: 10.12669/pjms.36.5.1873.
[29]K. J. Reddy, K. R. Menon, and A. Thattil, “Academic stress and its sources among university students,” Biomed. Pharmacol. J., vol. 11, no. 1, pp. 531–537, 2018, doi: 10.13005/bpj/1404.
[30]D. I. Khelil, D. Z. S. A. Altarawneh, and D. M. A. S. Khasawneh, “Thinking Patterns and their Impact on Coping with Psychological Stress among Jordanian Public University Students,” Migr. Lett., vol. 20, no. 6, pp. 1089–1102, 2023, doi: 10.59670/ml.v20i6.4920.
[31]F. G. Karaoglan Yilmaz, R. Yilmaz, and F. Erdogdu, “The Relationship Between Nomophobia, Emotional Intelligence, Interpersonal Problem-Solving, Perceived Stress, and Self-esteem Among Undergraduate Students,” J. Technol. Behav. Sci., vol. 8, no. 1, pp. 27–35, 2023, doi: 10.1007/s41347-022-00282-2.
[32]M. R. Shehada, G. Alfakhry, I. Jamous, A. A. Aljoujou, and M. Abdul_hak, “Major Stress Sources Amongst Dental Students at Damascus University, Syria,” Int. Dent. J., vol. 73, no. 2, pp. 205–211, 2023, doi: 10.1016/j.identj.2022.03.005.
[33]C. E. Brett, M. L. Mathieson, and A. M. Rowley, “Determinants of wellbeing in university students: The role of residential status, stress, loneliness, resilience, and sense of coherence,” Curr. Psychol., vol. 42, no. 23, pp. 19699–19708, 2023, doi: 10.1007/s12144-022-03125-8.
[34]L.-A. Hanna, M. Wilson, M. Hall, and A. Hanna, “A Questionnaire Study to Investigate Stress among Future Pharmacists by Gender and Year Group,” Pharmacy, vol. 6, no. 3, p. 75, 2018, doi: 10.3390/pharmacy6030075.
[35]J. M. Ngaboyishema, S. Gisanabagabo, J. Paul, and K. Kayonde, “Perception of Stress among Masters ’ Students at Adventist University of Central Africa ( AUCA ),” vol. 6, pp. 83–92, 2022.
[36]A. Pitt, F. Oprescu, G. Tapia, and M. Gray, “An exploratory study of students’ weekly stress levels and sources of stress during the semester,” Act. Learn. High. Educ., vol. 19, no. 1, pp. 61–75, 2018, doi: 10.1177/1469787417731194.
[37]D. S. et al. . Disha Sharma et al., “Stress Prediction of Students using Machine Learning,” Int. J. Mech. Prod. Eng. Res. Dev., vol. 10, no. 3, pp. 5609–5620, 2020, doi: 10.24247/ijmperdjun2020534.
[38]K. Bhatia, M. Kumar, R. K. Bhatia, A. Sharma, and K. Chhabra, “Machine learning based classification of academic stress factors,” Proc. - 2021 4th Int. Conf. Comput. Intell. Commun. Technol. CCICT 2021, pp. 45–50, 2021, doi: 10.1109/CCICT53244.2021.00020.
[39]M. A. Paul and M. Resija, “Stress Detection in College Students Using Machine Learning Algorithm,” Int. J. Res. Publ. Rev. J. homepage www.ijrpr.com, vol. 5, no. 2, pp. 3411–3417, 2024, [Online]. Available: www.ijrpr.com.
[40]R. Ahuja and A. Banga, “Mental stress detection in university students using machine learning algorithms,” Procedia Comput. Sci., vol. 152, pp. 349–353, 2019, doi: 10.1016/j.procs.2019.05.007.
[41]M. Firoz, M. M. Islam, M. Shidujaman, A. Islam, and M. T. Habib, “University student’s mental stress detection using machine learning,” vol. 12779, no. Icmir 2023, p. 113, 2023, doi: 10.1117/12.2690039.
[42]C. Ding, Y. Zhang, and T. Ding, “A systematic hybrid machine learning approach for stress prediction,” PeerJ Comput. Sci., vol. 9, pp. 1–25, 2023, doi: 10.7717/PEERJ-CS.1154.
[43]Dr. J V Gorabal, Mr. Vipul Y S, Mr. Yashas S Bharadwaj, Mr. Mohammed Anas, and Mr. Mohammed Nehad Ameem, “Predictive Analysis of Student Stress Level using Machine Learning,” Int. J. Adv. Res. Sci. Commun. Technol., vol. 12, no. 05, pp. 465–473, 2023, doi: 10.48175/ijarsct-10387.
[44]R. Madhuri, M. R. Murty, J. V. R. Murthy, P. V. G. D. P. Reddy, and S. C. Satapathy, “Cluster Analysis on Different Data Sets Using K-Modes and K-Prototype Algorithms,” Adv. Intell. Syst. Comput., vol. 249 VOLUME, pp. 137–144, 2014, doi: 10.1007/978-3-319-03095-1_15.
[45]M. Ahmed, R. Seraj, and S. M. S. Islam, “The k-means algorithm: A comprehensive survey and performance evaluation,” Electron., vol. 9, no. 8, pp. 1–12, 2020, doi: 10.3390/electronics9081295.
[46]Global Tech Council (2020): https://www.globaltechcouncil.org/data-science/k-means-clustering-vs-hierarchical-clustering/
[47]Blockchain Council (2024): https://www.blockchain-council.org/blogs/difference-between-k-means-and-hierarchical-clustering/
[48]J. Xu and K. Lange, “Power k-means clustering,” 36th Int. Conf. Mach. Learn. ICML 2019, vol. 2019-June, pp. 11977–11991, 2019.
[49]T. M. Ghazal et al., “Performances of k-means clustering algorithm with different distance metrics,” Intell. Autom. Soft Comput., vol. 30, no. 2, pp. 735–742, 2021, doi: 10.32604/iasc.2021.019067.
[50]M. Cui, “on the Elbow Method,” pp. 5–8, 2020, doi: 10.23977/accaf.2020.010102.
[51]A. Sarica, A. Cerasa, and A. Quattrone, “Random forest algorithm for the classification of neuroimaging data in Alzheimer’s disease: A systematic review,” Front. Aging Neurosci., vol. 9, no. OCT, pp. 1–12, 2017, doi: 10.3389/fnagi.2017.00329.
[52]M. Pal and S. Parija, “Prediction of Heart Diseases using Random Forest,” J. Phys. Conf. Ser., vol. 1817, no. 1, 2021, doi: 10.1088/1742-6596/1817/1/012009.
[53]M. Bansal, A. Goyal, and A. Choudhary, “A comparative analysis of K-Nearest Neighbor, Genetic, Support Vector Machine, Decision Tree, and Long Short Term Memory algorithms in machine learning,” Decis. Anal. J., vol. 3, no. November 2021, p. 100071, 2022, doi: 10.1016/j.dajour.2022.100071.
[54]Y. Zhong, “The analysis of cases based on decision tree,” Proc. IEEE Int. Conf. Softw. Eng. Serv. Sci. ICSESS, vol. 0, pp. 142–147, 2016, doi: 10.1109/ICSESS.2016.7883035.
[55]C. Hu, C. Zhang, D. Lei, T. Wu, X. Liu, and L. Zhu, “Achieving Privacy-Preserving and Verifiable Support Vector Machine Training in the Cloud,” IEEE Trans. Inf. Forensics Secur., vol. 18, pp. 3476–3491, 2023, doi: 10.1109/TIFS.2023.3283104.
[56]L. Y. Hu, M. W. Huang, S. W. Ke, and C. F. Tsai, “The distance function effect on k-nearest neighbor classification for medical datasets,” Springerplus, vol. 5, no. 1, 2016, doi: 10.1186/s40064-016-2941-7.
[57]P. Thanh Noi and M. Kappas, “Comparison of Random Forest, k-Nearest Neighbor, and Support Vector Machine Classifiers for Land Cover Classification Using Sentinel-2 Imagery,” Sensors (Basel)., vol. 18, no. 1, 2017, doi: 10.3390/s18010018.
[58]C. Shi, B. Wei, S. Wei, W. Wang, H. Liu, and J. Liu, “A quantitative discriminant method of elbow point for the optimal number of clusters in clustering algorithm,” Eurasip J. Wirel. Commun. Netw., vol. 2021, no. 1, 2021, doi: 10.1186/s13638-021-01910-w.
[59]M. Mehedi Hassan, S. Mollick, and F. Yasmin, “An unsupervised cluster-based feature grouping model for early diabetes detection,” Healthc. Anal., vol. 2, no. September, p. 100112, 2022, doi: 10.1016/j.health.2022.100112.
[60]C. L. Clayman, S. M. Srinivasan, and R. S. Sangwan, “K-means clustering and principal components analysis of microarray data of L1000 landmark genes,” Procedia Comput. Sci., vol. 168, no. 2019, pp. 97–104, 2020, doi: 10.1016/j.procs.2020.02.265.
[61]M. K. Islam, M. S. Ali, M. S. Miah, M. M. Rahman, M. S. Alam, and M. A. Hossain, “Brain tumor detection in MR image using superpixels, principal component analysis and template based K-means clustering algorithm,” Mach. Learn. with Appl., vol. 5, no. January, p. 100044, 2021, doi: 10.1016/j.mlwa.2021.100044.