Fuzzy Clustering of Educational Data with Automated Selection of Processing Parameters in System Analysis of Quality Education

PDF (1001KB), PP.101-111

Views: 0 Downloads: 0

Author(s)

Zhengbing Hu 1 Oleksandr Derevyanchuk 2 Serhiy Balovsyak 2 Yuriy Ushenko 2,* Hanna Kravchenko 3 Iryna Sapsai 4

1. School of Computer Science, Hubei University of Technology, Wuhan, China

2. Yuriy Fedkovych Chernivtsi National University, Chernivtsi, 58012, Ukraine

3. High State Educational Establishment «Chernivtsi transport college», Chernivtsi, 58000, Ukraine

4. Institute of Postgraduate Education, Borys Grinchenko Kyiv Metropolitan University, Kyiv, 02152, Ukraine

* Corresponding author.

DOI: https://doi.org/10.5815/ijmecs.2025.04.07

Received: 18 Apr. 2025 / Revised: 26 May 2025 / Accepted: 27 Jun. 2025 / Published: 8 Aug. 2025

Index Terms

Education Technology, Clustering methods, Data Mining, Educational Data, Fuzzy Logic, K-Means, System Analysis, Quality Education

Abstract

Clustering of educational data was performed in the space of two parameters using the K-Means method. Students who are characterized by grades in certain types of activities were used as objects of clustering. Software for fuzzy data clustering is implemented in the Python language in the Google Colab cloud service. The obtained clusters are described by fuzzy Gaussian membership functions, which allowed to reliably determine the membership of each object to a certain cluster, even if the clusters do not have clear boundaries. Due to clustering, the most important characteristics of the educational process for a certain task are obtained, that is, this is how Data Manning tasks are solved. Fuzzy membership functions implemented using the scikit-fuzzy library. The developed program can also be used for educational purposes, as it allows a better understanding of the principles of cluster analysis and fuzzy logic. The correctness of the work of the developed program was confirmed during the processing of test educational data. The determination of the number of clusters was performed by software, taking into account the intra-cluster and inter-cluster distances, as well as the shape of the clusters. Automated selection of the number of clusters and cluster boundaries allows to reduce data processing time. The developed clustering tools are designed to increase the efficiency of system analysis of quality education.

Cite This Paper

Zhengbing Hu, Oleksandr Derevyanchuk, Serhiy Balovsyak, Yuriy Ushenko, Hanna Kravchenko, Iryna Sapsai, "Fuzzy Clustering of Educational Data with Automated Selection of Processing Parameters in System Analysis of Quality Education", International Journal of Modern Education and Computer Science(IJMECS), Vol.17, No.4, pp. 101-111, 2025. DOI:10.5815/ijmecs.2025.04.07

Reference

[1]R. Ahuja, A. Jha, R. Maurya and R. Srivastava, Analysis of Educational Data Mining. In Harmony Search and Nature Inspired Optimization Algorithms, Springer: Singapore, 2019, pp. 897–907. doi: 10.1007/978-981-13-0761-4_85.
[2]H. Aldowah, H. Al-Samarraie and W.M. Fauzy, “Educational data mining and learning analytics for 21st century higher education: A review and synthesis”, Telemat. Inform., vol. 37, pp. 13-49, 2019. doi: 10.1016/j.tele.2019.01.007.
[3]Ihor Tereikovskyi, Zhengbing Hu, Denys Chernyshev, Liudmyla Tereikovska, Oleksandr Korystin, Oleh Tereikovskyi, “The Method of Semantic Image Segmentation Using Neural Networks”, International Journal of Image, Graphics and Signal Processing, vol. 14, no. 6, pp. 1-14, 2022. 
[4]S.V. Balovsyak, O.V. Derevyanchuk, I.M. Fodchuk, “Method of calculation of averaged digital image profiles by envelopes as the conic sections”, Advances in Intelligent Systems and Computing, Hu Z., Petoukhov S., Dychka I., He M. (Eds.), Springer International Publishing, vol. 754, pp. 204–212, 2019. doi: 10.1007/978-3-319-91008-6_21.
[5]S.V. Balovsyak, O.V. Derevyanchuk, I.M. Fodchuk, O.P. Kroitor, Kh.S. Odaiska, O.O. Pshenychnyi, A. Kotyra, A. Abisheva, “Adaptive oriented filtration of digital images in the spatial domain”, Proc. SPIE 11176, Photonics Applications in Astronomy, Communications, Industry, and High-Energy Physics Experiments, vol. 11176, pp 111761A-1–111761A-6, 2019. doi: https://doi.org/10.1117/12.2537165.
[6]S.V. Balovsyak, O.V. Derevyanchuk, H.O. Kravchenko, O.P. Kroitor, V.V. Tomash, “Computer system for increasing the local contrast of railway transport images”, Proc. SPIE, Fifteenth International Conference on Correlation Optics, vol. 12126, pp. 121261E1-7, 2021. doi: 10.1117/12.2615761.
[7]Derevyanchuk O.V., Kravchenko H.O., Derevianchuk Y.V., Tomash V.V. “Recognition images of broken window glass”, Proceedings of SPIE”, vol. 12938, pp. 210-213, 2024. doi: https://doi.org/10.1117/12.3012995.
[8]O. Berezsky, M. Zarichnyi, and O. Pitsun, "Development of a metric and the methods for quantitative estimation of the segmentation of biomedical images", Eastern-European Journal of Enterprise Technologies, vol. 6, no. 4(90), pp. 4–11, 2017. doi: 10.15587/1729-4061.2017.119493.
[9]Moez Ali. Clustering in Machine Learning: 5 Essential Clustering Algorithms. https://www.datacamp.com/blog/clustering-in-machine-learning-5-essential-clustering-algorithms.
[10]S.V. Balovsyak, Kh. S. Odaiska, “Automatic Determination of the Gaussian Noise Level on Digital Images by High-Pass Filtering for Regions of Interest”, Cybernetics and Systems Analysis, vol. 54, no. 4, pp. 662-670, 2018. doi: 10.1007/s10559-018-0067-3.
[11]M. Wang, X. Wei, “Research on Logistics Center Location-Allocation Problem Based on Two-Stage K-Means Algorithms”, Advances in Computer Science for Engineering and Education, Hu, Z., Petoukhov, S., Dychka, I., He, M. (Eds.), Springer International Publishing, vol. 1247, pp. 52-62, 2021. doi: 10.1007/978-3-030-55506-1_5. 
[12]YanPing Zhao, and XiaoLai Zhou, “K-means Clustering Algorithm and Its Improvement Research”, Journal of Physics: Conference Series, 2nd International Workshop on Electronic communication and Artificial Intelligence (IWECAI 2021) 12-14 March 2021, Nanjing, China, vol. 1873, 012074, pp. 1-5, 2021, doi: 10.1088/1742-6596/1873/1/012074.
[13]Guojun Gan, Chaoqun Ma, and Jianhong Wu, Data Clustering: Theory, Algorithms, and Applications, Second Edition. Society for Industrial and Applied Mathematics (SIAM), 2021.
[14]Jure Leskovec, Anand Rajaraman, and Jeffrey David Ullman, Mining of Massive Datasets, 3rd edition.  Stanford University, 2019.
[15]X. Li, Y. Zhang, H. Cheng, F. Zhou and B. Yin, “An Unsupervised Ensemble Clustering Approach for the Analysis of Student Behavioral Patterns”, IEEE Access, vol. 9, pp. 7076-7091, 2021, doi: 10.1109/ACCESS.2021.3049157.
[16]C. Maithri, H. Chandramouli, “Parallel DBSCAN Clustering Algorithm Using Hadoop Map-reduce Framework for Spatial Data”, International Journal of Information Technology and Computer Science, vol. 14, no. 6, pp. 1-12, 2022. 
[17]Mohamed Nafuri, Ahmad Fikri, Nor Samsiah Sani, Nur Fatin Aqilah Zainudin, Abdul Hadi Abd Rahman, and Mohd Aliff. “Clustering Analysis for Classifying Student Academic Performance in Higher Education”, Applied Sciences, vol. 12, no. 19, pp. 9467, 2022. doi: 10.3390/app12199467.
[18]Denon Arthur Richmond Gono, Bi Tra Goore, Yves Tiecoura, and Kouame Abel Assielou, "Multi-relational Matrix Factorization Approach for Educational Items Clustering," International Journal of Information and Education Technology, vol. 13, no. 1, pp. 42-47, 2023. doi: 10.18178/ijiet.2023.13.1.1778.
[19]Mary Jane Samonte, Gabriel Edrick O. Acuna, Luis Antonio Z. Alvarez, and Jeffrey M. Miraflores, "A Personality-Based Virtual Tutor for Adaptive Online Learning System," International Journal of Information and Education Technology, vol. 13, no. 6, pp. 899-905, 2023. doi: 10.18178/ijiet.2023.13.6.1885.
[20]Zareen Alamgir, Habiba Akram, Saira Karim, Aamir Wali, "Enhancing Student Performance Prediction via Educational Data Mining on Academic data", Informatics in Education. vol. 23, no. 1, pp. 1-24, 2024. doi: 10.15388/infedu.2024.04.
[21]N. Shakhovska, O. Vovk, R. Hasko, Y. Kryvenchuk, “The Method of Big Data Processing for Distance Educational System”, Advances in Intelligent Systems and Computing, Shakhovska N., Stepashko V. (Eds.), Springer International Publishing, vol. 689, pp. 461-473, 2018. doi: 10.1007/978-3-319-70581-1_33.
[22]Batool, S., Rashid, J., Nisar, M.W. et al., "Educational data mining to predict students' academic performance: A survey study", Educ. Inf. Technol, vol. 28, pp. 905-971, 2023. doi: 10.1007/s10639-022-11152-y.
[23]Sarsa, S., Leinonen, J., & Hellas, A., "Empirical Evaluation of Deep Learning Models for Knowledge Tracing: Of Hyperparameters and Metrics on Performance and Replicability", Journal of Educational Data Mining, vol. 14, no. 2, 2022. doi: 10.5281/zenodo.7086179.
[24]Nayak, P., Vaheed, S., Gupta, S. et al., "Predicting students' academic performance by mining the educational data through machine learning-based classification model", Educ. Inf. Technol, vol. 28, pp. 14611-14637, 2023. doi: 10.1007/s10639-023-11706-8.
[25]Delianidi, M., & Diamantaras, K. "KT-Bi-GRU: Student Performance Prediction with a Bi-Directional Recurrent Knowledge Tracing Neural Network", Journal of Educational Data Mining, vol. 15, no. 2, pp. 1-21. 2023. doi: 10.5281/zenodo.7808087.
[26]A. R. Fayek, “Fuzzy Logic and Fuzzy Hybrid Techniques for Construction Engineering and Management”, Journal of Construction Engineering and Management, vol. 146, no. 7, pp. 1-12, 2020. doi: 10.1061/(ASCE)CO.1943-7862.0001854.
[27]Tengku Zatul Hidayah Tengku Petra, Mohd Juzaiddin Ab Aziz, “Analysing Student Performance In Higher Education Using Fuzzy Logic Evaluation”, International journal of scientific & technology research, vol. 10, no. 01, pp. 322-327, 2021.
[28]A. Heni, I. Jdey and H. Ltifi, “K-means and fuzzy c-means fusion for object clustering”, 8th International Conference on Control, Decision and Information Technologies (CoDIT), Istanbul, Turkey, pp. 177-182, 2022. doi: 10.1109/CoDIT55151.2022.9804078.
[29]Serhiy Balovsyak, Oleksandr Derevyanchuk, Vasyl Kovalchuk, Hanna Kravchenko, Maryna Kozhokar, "Face Mask Recognition by the Viola-Jones Method Using Fuzzy Logic", International Journal of Image, Graphics and Signal Processing, vol.16, no.3, pp. 39-51, 2024. 
[30]Thamasan Suwanroj, Orawan Saeung, Punnee Leekitchwatana, and Kanaporn Kaewkamjan, "The Development of Professional Competency Test in Knowledge and Cognitive Skill for Computer Innovation and Digital Industry," International Journal of Information and Education Technology, vol. 13, no. 1, pp. 121-130, 2023. doi: 10.18178/ijiet.2023.13.1.1787.
[31]Omar T., Alzahrani A., Andzohdy M. "Clustering approach for analyzing the student’s efficiency and performance based on data", Journal of Data Analysis and Information Processing, vol. 8, no. 03, pp. 171–182, 2020. doi: 10.4236/jdaip.2020.83010.
[32]J.C. Russ, The Image Processing Handbook. Taylor and Francis Group, 2011.
[33]Google Colab. URL: https://colab.research.google.com.
[34]Derevyanchuk O.V., Balovsyak S.V. Certificate of copyright registration for the work, No. 123369, 31.01.2024. ID CR3206310124. Computer program " Data clustering using fuzzy logic" ("ClusterFuzzy23"). Ukrainian National Office for Intellectual Property and Innovations (IP Office). URL: https://sis.nipo.gov.ua/uk/services/original-document.
[35]scikit-learn. URL: https://scikit-learn.org
[36]Students performance in exams. URL: https://www.kaggle.com/datasets/spscientist/students-performance-in-exams.
[37]M. Zhang, Y. Cheng, J. Gu, Y. Yang, L. Chen, and B. Cui, “Digital Evaluation of Innovative Logistics Talents Based on Improved SAGA-FCM Algorithm,” In: Hu, Z., Zhang, Q., He, M. (eds), Advances in Artificial Systems for Logistics Engineering, Springer, Cham., vol. 223, pp. 374-383, 2024. doi: 10.1007/978-3-031-72017-8_34.
[38]Nadindra Dwi Ariyanta, and Anik Nur Handayani, “Sugeno Fuzzy Personality Prediction System: An Approach to Overcoming Psychological Measurement Uncertainty,” Indonesian Journal of Data and Science, vol. 5(3), pp. 216-228, 2024. doi: 10.56705/ijodas.v5i3.192.
[39]Free Personality Test. URL: https://personalitymax.com.
[40]Experience Based Learning Systems, LLC (EBLS). URL: https://learningfromexperience.com.
[41]Kolb’s Learning Styles and Test. URL: https://en.eduolog.com/kolbs-learning-styles.