Dynamic Data Mining: Using Dynamic ID3 Algorithm to Solve Any Problem that Needs Decision Tree Support

PDF (1632KB), PP.127-145

Views: 0 Downloads: 0

Author(s)

Amir Amjad Gharbi 1,* Bassel Alkhatib 1,2

1. Syrian Virtual University, Damascus, Syria

2. Al-Sham Private University, Damascus, Syria

* Corresponding author.

DOI: https://doi.org/10.5815/ijmecs.2025.06.09

Received: 9 Dec. 2024 / Revised: 18 Feb. 2025 / Accepted: 24 Apr. 2025 / Published: 8 Dec. 2025

Index Terms

Dynamic Data Mining, Decision Tree ID3, Algorithm Parameters, Stored Dataset Criteria, Performance Measures

Abstract

Most programmers and users resort to find individual solution per problem depending on the data and nature of problem, that will lead to solve a specific problem using an algorithm without the ability of this solution to solve a new problem. This variance comes from the difference in algorithm parameters from one problem to another, as these parameters related to data nature, its size, and values it carried that can affect the way algorithm work. Individual solutions lead to increase in time cost and effort spent on solving a new problem, which the new problem requires to work on programming new criteria for algorithm solution. That is prompted us to highlight necessaries to develop main components for algorithms used in practical life, such as data mining algorithms so that a solution designed for one problem can be more easily adapted to new problems with different data structures, within the general scope of decision tree applicability. These algorithm components need control mechanism settings, so when using component to solve problem, there is no need to develop algorithm settings again, regardless data size and data structure. We found that the dynamic solution saves effort and time needed to solve problems with same algorithm. In this paper, we present our methodology for using ID3 decision tree algorithm to mine data dynamically, and the mechanism used to achieve the dynamic solution, that provides a flexible and reusable solution for a wide range of problems that require decision tree support, reducing the need to redesign or reimplement models for each new task The proposed model was tested on three datasets. The proposed model achieved an accuracy of 97%, 97%, and 93% on the breast cancer, heart disease, and diabetes datasets, respectively.

Cite This Paper

Amir Amjad Gharbi, Bassel Alkhatib, "Dynamic Data Mining: Using Dynamic ID3 Algorithm to Solve Any Problem that Needs Decision Tree Support", International Journal of Modern Education and Computer Science(IJMECS), Vol.17, No.6, pp. 127-145, 2025. DOI:10.5815/ijmecs.2025.06.09

Reference

[1]A. Rajeshkanna and K. Arunesh, "Role of Decision Tree Classification in Data Mining," International Journal of Pure and Applied Mathematics, vol. 119, no. 15, pp. 2533-2543, 2018. [Online]. Available: https://acadpubl.eu/hub/2018-119-15/2/265.pdf.
[2]H. Mahmud, A. K. M. N. Islam, S. I. Ahmed, and K. Smolander, "What influences algorithmic decision-making? A systematic literature review on algorithm aversion," Technological Forecasting and Social Change, vol. 175, p. 121390, 2022/02/01/ 2022, doi: https://doi.org/10.1016/j.techfore.2021.121390.
[3]I. H. Sarker, "Machine Learning: Algorithms, Real-World Applications and Research Directions," SN Computer Science, vol. 2, no. 3, p. 160, 2021/03/22 2021, doi: https://doi.org/10.1007/s42979-021-00592-x.
[4] A. Rajeshkanna and K. Arunesh, "ID3 Decision Tree Classification: An Algorithmic Perspective based on Error rate," in 2020 International Conference on Electronics and Sustainable Communication Systems (ICESC), Coimbatore, India, 2-4 July 2020 2020: IEEE, pp. 787-790, doi: https://doi.org/10.1109/ICESC48915.2020.9155578
[5]E. E. Ogheneovo and P. A. Nlerum, "Iterative Dichotomizer 3 (ID3) Decision Tree: A Machine Learning Algorithm for Data Classification and Predictive Analysis," International Journal of Advanced Engineering Research and Science, vol. 7, no. 4, pp. 514-521, 2020, doi: https://doi.org/10.22161/ijaers.74.60.
[6] C. Marsala, "Fuzzy decision trees for dynamic data," in 2013 IEEE Conference on Evolving and Adaptive Intelligent Systems (EAIS), Singapore, 16-19 April 2013 2013, pp. 17-24, doi: https://doi.org/10.1109/EAIS.2013.6604100
[7]G. Mathew and Z. Obradovic, "Dynamic distributed predictive learning models that preserve privacy for hospitals with insufficient labeled data," Network Modeling Analysis in Health Informatics and Bioinformatics, vol. 2, no. 4, pp. 245-255, 2013/12/01 2013, doi: https://doi.org/10.1007/s13721-013-0041-y.
[8]R. Chen, T. Dai, Y. Zhang, Y. Zhu, X. Liu, and E. Zhao, "GBDT-IL: Incremental Learning of Gradient Boosting Decision Trees to Detect Botnets in Internet of Things," Sensors, vol. 24, no. 7, p. 2083, 2024, doi: https://doi.org/10.3390/s24072083.
[9]A. Lourenço, J. Rodrigo, J. Gama, and G. Marreiros, "DFDT: Dynamic Fast Decision Tree for IoT Data Stream Mining on Edge Devices," arXiv preprint, vol. arXiv.2502.14011, 2025, doi: https://doi.org/10.48550/arXiv.2502.14011.
[10]F. Daghero, A. Burrello, E. Macii, P. Montuschi, M. Poncino, and D. J. Pagliari, "Dynamic Decision Tree Ensembles for Energy-Efficient Inference on IoT Edge Nodes," IEEE Internet of Things Journal, vol. 11, no. 1, pp. 742-757, 2024, doi: https://doi.org/10.1109/JIOT.2023.3286276.
[11]S. Yang, J.-Z. Guo, and J.-W. Jin, "An improved Id3 algorithm for medical data classification," Computers & Electrical Engineering, vol. 65, pp. 474-487, 2018/01/01/ 2018, doi: https://doi.org/10.1016/j.compeleceng.2017.08.005.
[12]T. Jyothirmayi and S. Reddy, "ID3 Algorithm: An Algorithm for better decision tree," International Journal on Computer Science and Engineering, vol. 2, no. 9, pp. 2827-2830, 2010. [Online]. Available: http://www.enggjournals.com/ijcse/doc/IJCSE10-02-09-050.pdf 
[13]M. F. Ak, "A Comparative Analysis of Breast Cancer Detection and Diagnosis Using Data Visualization and Machine Learning Applications," (in eng), Healthcare (Basel, Switzerland), vol. 8, no. 2, Apr 26 2020, doi: https://doi.org/10.3390/healthcare8020111.
[14]C. Kakaraparthi. "Prediction of Heart Disease by ML Techniques." Kaggle; Data science company. https://www.kaggle.com/code/charankakaraparthi/prediction-of-heart-disease-by-ml-techniques (accessed 21/November/2024, 2024).
[15]H. R. Marateb and S. Goudarzi, "A noninvasive method for coronary artery diseases diagnosis using a clinically-interpretable fuzzy rule-based system," (in eng), Journal of research in medical sciences : the official journal of Isfahan University of Medical Sciences, vol. 20, no. 3, pp. 214-23, Mar 2015. [Online]. Available: https://pmc.ncbi.nlm.nih.gov/articles/PMC4468223/.
[16]K. Karagöz. "Pima Indians Diabetes Db to Classification." Kaggle; Data science company. https://www.kaggle.com/code/kursatkaragoz29/pima-indians-diabetes-db-to-classification (accessed 21/11/2024, 2024).
[17]V. Chang, J. Bailey, Q. A. Xu, and Z. Sun, "Pima Indians diabetes mellitus classification based on machine learning (ML) algorithms," (in eng), Neural computing & applications, pp. 1-17, Mar 24 2022, doi: https://doi.org/10.1007/s00521-022-07049-z.