Investigation of Student Dropout Problem by Using Data Mining Technique

Full Text (PDF, 1823KB), PP.43-61

Views: 0 Downloads: 0


Sadi Mohammad 1,* Ibrahim Adnan Chowdhury 1 Niloy Roy 1 Md. Nazim Hasan 1 Dip Nandi 2

1. Department of Computer Science, Faculty of Science and Technology, American International University, Bangladesh (AIUB), Dhaka 1229, Bangladesh

2. Faculty of Science and Technology, American International University-Bangladesh (AIUB), Dhaka, Bangladesh

* Corresponding author.


Received: 5 Apr. 2023 / Revised: 23 May 2023 / Accepted: 25 Jun. 2023 / Published: 8 Oct. 2023

Index Terms

Data-Mining, Machine Learning (ML) Algorithms, k-fold cross-validation, Dropout, Predictive Models, Systematic Literature Mapping (SLM)


Throughout the past twenty years, we've seen a huge increase in the number of school universities. Given the intense competition among major universities and schools, this attracts students to apply for admission to these institutions. Early school dropout prediction is a critical problem for learners, and it is hard to tackle. And a wide number of factors can impact student retention. In order to attain the best accuracy, the conclusion of the program, the standard classification approach that was used to solve this problem frequently needs to be applied the majority of organizations and courses launched by universities operate on either an auto model, therefore they always prefer course enrollment over student caliber. As a result, many students stop taking the course after the first year. In order to manage student dropout rates, this research provides a data mining application. The predictive model may provide an effective predictive list of students who typically require the greatest help from the student dropout program given updated data on new students. The results indicate that the object classification algorithm Random Forest data mining technique can create a reliable prediction model using existing student academic data. Future research on student dropout rates will continue to be vital for informing policy decisions, identifying at-risk populations, evaluating interventions, enhancing support services, predicting trends, understanding long-term consequences, and promoting global learning and collaboration in education.

Cite This Paper

Sadi Mohammad, Ibrahim Adnan Chowdhury, Niloy Roy, Md. Nazim Hasan, Dip Nandi, "Investigation of Student Dropout Problem by Using Data Mining Technique", International Journal of Education and Management Engineering (IJEME), Vol.13, No.5, pp. 43-61, 2023. DOI:10.5815/ijeme.2023.05.04


[1]Tinto, V., “Research and practice of student retention:What next, College Student Retention: Research”,Theory, and Practice, 8(1), pp. 1-20, 2006.
[2]Tinto, V., “Leaving College: Rethinking the cause and cure of student attrition”. Chicago: University of Chicago Press, 1993.
[3]Chong, Ho Yu, Samuel Di G., J.H., A. Jannasch-Pannell, W. Lo,C. Kaprolet,2007. Applied Learning Technology Institute.
[4]Tinto, V., “Dropout from Higher Education: A theatrical synthesis of recent research”. Review of Education Research, 45, 89-125, 1975.
[6]J. Han and M. Kamber, “Data Mining: Concepts and Techniques,” Morgan Kaufmann, 2000.
[7]Witten, I. H., Frank, E., Hall, M. A., “Data Mining:Practical Machine Learning Tools and Techniques”, 3rd Ed. Morgan Kaufmann, 2011
[8]Leandro Rondado , Veronica Oliveira ,Bruno Elias and Frank Jose Affonso A Systematic Mapping on the Use of Data Mining for the Face-to-Face School Dropout Problem. ce-to-Face_School_Dropout_Problem
[9]Adejo, O. W., & Connolly, T. (2018). Predicting student academic performance using multi-model heterogeneous ensemble approach. Journal of Applied Research in Higher Education, 10(1), 61–75.
[10]Boris P´erez1,2(B) , Camilo Castellanos2 , and Dar´ıo Correal2,1 Univ. Francisco de Paula Stder., C´ucuta, Colombia,2 Universidad de los Andes, Bogot´a, Colombia {cc.castellanos87,dcorreal}
[11]Marina. B, A. Senthilrajan,"HFIPO-DPNN: A Framework for Predicting the Dropout of Physically Impaired Student from Education".IJMECS Vol.15, No.2, Apr. 2023
[12]Yukselturk E, Ozekes S, Türel YK.: “Predicting dropout student: an application of data mining methods in an online education program”. European Journal of Open, Distance and E-learning. Jul 1;17(1):118–33 (2014).
[13]Link: Out_Rates_Using_Data_Mining_Techniques_A_Case_Study_First_IEEE_Colombian_Conference_ColCACI_2018_Med ellin_Colombia_May_16-18_2018_Revised_Selected_Papers
[14]Ahuja, R., & Kankane, Y. (2017). Predicting the probability of student’s degree completion by using different data mining techniques. 2017 Fourth International Conference on Image Information Processing (ICIIP), 1–4.'s_degree_completion_by_usi ng_different_data_mining_techniques
[15]Early_Dropout_Prediction_using_Data_Mining_A_Case_Study_with_High_School_Students Link: y_with_High_School_Students
[16]A_Systematic_Mapping_on_the_Use_of_Data_Mining_for_the_Face-to-Face_School_Dropout_Problem Link: e_Face-to-Face_School_Dropout_Problem
[17]Padillo, F., Luna, J. M., and Ventura, S. (2020). LAC: Li-brary for associative classification. Knowledge-BasedSystems, 193:105432
[18]Boris P´erez1,2(B) , Camilo Castellanos2 , and Dar´ıo Correal2,1 Univ. Francisco de Paula Stder., C´ucuta, Colombia,2 Universidad de los Andes, Bogot´a, Colombia {cc.castellanos87,dcorreal}
[19]Predicting_Student_DropOuRates_Using_Data_Mining_Techniques Link: niques_A_Case_Study_First_IEEE_Colombian_Conference_ColCACI_2018_Medellin_Colombia_May_16- 18_2018_Revised_Selected_Papers
[20]Cristobal Romero, Sebastian Ventura Data mining in education.
[21]Agrusti, F., Bonavolonta, G., and Mezzini, M. (2019). Uni-versity dropout prediction through educational datamining techniques: A systematic review. Journal ofe-learning and knowledge society, 15:161–182.
[22]Leandro Rondado de Sousa1, Veronica Oliveira de Carvalho1 a, Bruno Elias Penteado2 and Frank Jos´e Affonso1,Universidade Estadual Paulista (Unesp), Instituto de Geociˆencias e Ciˆencias Exatas, Rio Claro, Brazil,Universidade de S˜ao Paulo (USP), Instituto de Ciˆencias Matem´aticas e de Computac¸ ˜ao, S˜ao Carlos, Brazil.
[23]Mukesh Kumar, A.J. Singh, Disha Handa,"Literature Survey on Educational Dropout Prediction".IJEME Vol.7, No.2, Mar. 2017.
[24]Padhraic Smyth, Gregory Piatetsky-Shapiro, Usama Fayyad From Data Mining to Knowledge Discovery in Databases
[25]Guarin, C. E. L., Guzman, E. L., and Gonzalez, F. A.(2015). A model to predict low academic perfor-mance at a specific enrollment using data mining. Re-vista Iberoamericana de Tecnologias del Aprendizaje,10(3):119–125
[26]M´arquez-Vera, C., Cano, A., Romero, C., Noaman, A.Y. M., Mousa Fardoun, H., and Ventura, S. (2016).Early dropout prediction using data mining: A casestudy with high school students. Expert Systems: TheJournal of Knowledge Engineering, 33(1):107–124
[27]Manh˜aes, L. M. B., Cruz, S. M. S., and Zimbr˜ao, G. (2014).WAVE: An architecture for predicting dropout in un- dergraduate courses using EDM. In Proceedings ofthe 29th Annual ACM Symposium on Applied Com-puting (SAC), pages 243–247.
[29]Bean, J. P. (1990). Using retention research in enrollment management. The Strategic Management of College Enrollments, 170–185.
[30]Jadhav, R. J. (2011). Churn Prediction in Telecommunication Using Data Mining Technology. International Journal of Advanced Computer Science and Applications - IJACSA, 2(2), 17-19.