Adaptive Swarm-Optimized Ensemble Learning for Generalizable Heart Disease Risk Prediction

PDF (1088KB), PP.109-129

Views: 0 Downloads: 0

Author(s)

Ankit Maithani 1 Garima Verma 2,*

1. CSE Dept, School of Computing, DIT University, Dehradun, India

2. School of Computing, DIT University, Dehradun, India

* Corresponding author.

DOI: https://doi.org/10.5815/ijitcs.2026.03.08

Received: 8 Jan. 2026 / Revised: 27 Mar. 2026 / Accepted: 20 Apr. 2026 / Published: 8 Jun. 2026

Index Terms

Heart Disease, Modified Spider Monkey Optimization, KNN, Catboost, Cross-validation

Abstract

Machine learning (ML) has made it much easier to find and estimate the risk of early stage of cardiovascular illnesses by making it possible to analyses massive, various clinical datasets quickly and easily. In these kinds of datasets, demographic information, lifestyle characteristics, medical history, and diagnostic measurements are all included. These are all things that may not be easy to see through standard clinical examination. This study examines heart disease prediction through a series of hybrid ML models that integrate neighborhood-based classifiers, swarm intelligence-driven optimization, and ensemble learning, motivated by existing obstacles. There are four hybrid models being proposed: MSMO-KE and MSMO-KM, which combine Modified Spider Monkey Optimization (MSMO) with K-Nearest Neighbour classifiers that use Euclidean and Minkowski distance measures, respectively. There are also two ensemble variants, MSMO-KECB and MSMO-KMCB, which add CatBoost as a final prediction layer. To make sure it is strong and can be used in other situations, the proposed framework is tested on three separate cardiovascular datasets using a cross-validation method. The experimental findings show that the performance is always better than the baseline and the best models that are already used. The MSMO-KMCB model performs the best overall out of all the approaches tested. It has a cross-validated accuracy of 98.2% on Dataset-3 while keeping a high sensitivity. The comparative research demonstrates that the proposed MSMO-based ensemble models surpass current methodologies in predictive accuracy and recall, underscoring their promise for dependable and efficient heart disease risk prediction in clinical decision-support systems.

Cite This Paper

Ankit Maithani, Garima Verma, "Adaptive Swarm-Optimized Ensemble Learning for Generalizable Heart Disease Risk Prediction", International Journal of Information Technology and Computer Science(IJITCS), Vol.18, No.3, pp.109-129, 2026. DOI:10.5815/ijitcs.2026.03.08

Reference

[1]X. Wei, C. Rao, X. Xiao, L. Chen, and M. Goh, “Risk assessment of cardiovascular disease based on SOLSSA-CatBoost model,” Expert Systems with Applications, vol. 219, p. 119648, 2023. https://doi.org/10.1016/j.eswa.2023.119648
[2]M. U. Rehman, S. Naseem, A. U. R. Butt, T. Mahmood, A. R. Khan, I. Khan, J. Khan, and Y. Jung, “Predicting coronary heart disease with advanced machine learning classifiers for improved cardiovascular risk assessment,” Scientific Reports, vol. 15, no. 1, p. 13361, 2025. https://www.nature.com/articles/s41598-025-96437-1
[3]A. Kumar and M. Maben, “Heart disease prediction using machine learning algorithms,” in Proc. 3rd Int. Conf. Networks, Multimedia and Information Technology (NMITCON), Aug. 2025, pp. 1–6. https://ieeexplore.ieee.org/abstract/document/11187908/
[4]D. Asif, M. Bibi, M. S. Arif, and A. Mukheimer, “Enhancing heart disease prediction through ensemble learning techniques with hyperparameter optimization,” Algorithms, vol. 16, no. 6, p. 308, 2023. https://doi.org/10.3390/a16060308
[5]A. Maithani, R. Koli, R. Pal, D. S. Rai, A. Rohilla, and R. Kumar, “Agile approach of predicting cardiac disease using ANN based on feature selection,” in Proc. Int. Conf. Innovative Practices in Technology and Management (ICIPTM), Feb. 2021, pp. 243–248. https://doi.org/10.1109/ICIPTM52218.2021.9388325
[6]A. Maithani and G. Verma, “Hybrid model with improved score level fusion for heart disease classification,” Multimedia Tools and Applications, pp. 1–37, 2023. https://doi.org/10.1007/s11042-023-17599-6
[7]D. Rohan, G. P. Reddy, Y. P. Kumar, K. P. Prakash, and C. P. Reddy, “An extensive experimental analysis for heart disease prediction using artificial intelligence techniques,” Scientific Reports, vol. 15, no. 1, p. 6132, Feb. 2025. https://www.nature.com/articles/s41598-025-90530-1
[8]C. J. Harrison and C. J. Sidey-Gibbons, “Machine learning in medicine: A practical introduction to natural language processing,” BMC Medical Research Methodology, vol. 21, no. 1, pp. 1–11, 2021. https://doi.org/10.1186/s12874-021-01347-1
[9]A. Alqahtani, S. Alsubai, M. Sha, L. Vilcekova, and T. Javed, “Cardiovascular disease detection using ensemble learning,” Computational Intelligence and Neuroscience, vol. 2022, Art. no. 5267498, 2022. https://doi.org/10.1155/2022/5267498
[10]G. Verma and S. Kanrar, “Load balancing model for cloud environment using swarm intelligence technique,” Multiagent and Grid Systems, vol. 19, no. 3, pp. 211–229, 2023. https://content.iospress.com/articles/multiagent-and-grid-systems/mgs230021
[11]Kaur, B. S. Khehra, and A. Singh, “A state of the art survey of soft computing based approaches for heart disease detection and development of a novel PSO-fuzzy approach,” Archives of Computational Methods in Engineering, pp. 1–24, 2025. https://link.springer.com/article/10.1007/s11831-025-10360-2 
[12]S. Mohan, C. Thirumalai, and G. Srivastava, “Effective heart disease prediction using hybrid machine learning techniques,” IEEE Access, vol. 7, pp. 81542–81554, 2019. https://doi.org/10.1109/ACCESS.2019.2923707
[13]P. Rani, R. Kumar, N. M. S. Ahmed, and A. Jain, “A decision support system for heart disease prediction based upon machine learning,” Journal of Reliable Intelligent Environments, vol. 7, no. 3, pp. 263–275, 2021. https://doi.org/10.1007/s40860-021-00133-6
[14]F. Ahmed, M. Saleem, Z. Rajpoot, and A. Noor, “Intelligent heart disease prediction using CatBoost empowered with XAI,” International Journal of Computational and Innovative Sciences, vol. 2, no. 4, pp. 8–13, 2023. http://ijcis.com/index.php/IJCIS/article/view/99
[15]B. Dhananjay and J. Sivaraman, “Analysis and classification of heart rate using CatBoost feature ranking model,” Biomedical Signal Processing and Control, vol. 68, p. 102610, 2021. https://doi.org/10.1016/j.bspc.2021.102610
[16]I. D. Mienye, Y. Sun, and Z. Wang, “An improved ensemble learning approach for the prediction of heart disease risk,” Informatics in Medicine Unlocked, vol. 20, p. 100402, 2020. https://doi.org/10.1016/j.imu.2020.100402
[17]Y. Shaikh, V. K. Parvati, and S. R. Biradar, “Heart disease prediction using ensemble learning,” in Proc. Int. Conf. Applied Intelligence and Sustainable Computing (ICAISC), Jun. 2023, pp. 1–5. https://doi.org/10.1109/ICAISC58445.2023.10200375
[18]A. B. Majumder, S. Gupta, D. Singh, B. Acharya, V. C. Gerogiannis, A. Kanavos, and P. Pintelas, “Heart disease prediction using concatenated hybrid ensemble classifiers,” Algorithms, vol. 16, no. 12, p. 538, 2023. https://doi.org/10.3390/a16120538
[19]M. S. Pathan, A. Nag, M. M. Pathan, and S. Dev, “Analyzing the impact of feature selection on the accuracy of heart disease prediction,” Healthcare Analytics, vol. 2, p. 100060, 2022. https://www.sciencedirect.com/science/article/pii/S2772442522000235
[20]R. Mohammed, J. Rawashdeh, and M. Abdullah, “Machine learning with oversampling and undersampling techniques: Overview study and experimental results,” in Proc. 11th Int. Conf. Information and Communication Systems (ICICS), Apr. 2020, pp. 243–248. https://doi.org/10.1109/ICICS49469.2020.239556
[21]S. B. Imandoust and M. Bolandraftar, “Application of k-nearest neighbor (KNN) approach for predicting economic events: Theoretical background,” Int. J. Engineering Research and Applications, vol. 3, no. 5, pp. 605–610, 2013. https://api.semanticscholar.org/CorpusID:15532755
[22]A. Garg, B. Sharma, and R. Khan, “Heart disease prediction using machine learning techniques,” in IOP Conf. Series: Materials Science and Engineering, vol. 1022, no. 1, p. 012046, 2021. https://doi.org/10.1088/1757-899X/1022/1/012046
[23]A. Kooshari, M. Fartash, P. Mihannezhad, M. Chahardoli, J. Akbari Torkestani, and S. Nazari, “An optimization method in wireless sensor network routing and IoT with water strider algorithm and ant colony optimization algorithm,” Evolutionary Intelligence, 2023. https://doi.org/10.1007/s12065-023-00847-x
[24]L. Wu, X. Huang, J. Cui, C. Liu, and W. Xiao, “Modified adaptive ant colony optimization algorithm and its application for solving path planning of mobile robot,” Expert Systems with Applications, vol. 215, p. 119410, 2023. https://doi.org/10.1016/j.eswa.2022.119410
[25]G. Verma, “Load balancing in cloud environment using opposition-based spider monkey optimization,” Wireless Personal Communications, vol. 137, no. 2, pp. 977–996, 2024. https://link.springer.com/article/10.1007/s11277-024-11445-0
[26]R. Anudeep and S. J. J. Thangaraj, “Accurate prediction of myocardial infarction by comparing logistic regression algorithm with CatBoost classifier,” in E3S Web of Conferences, vol. 399, p. 04019, 2023. https://doi.org/10.1051/e3sconf/202339904019
[27]P. Anuradha and V. K. David, “Feature selection and prediction of heart diseases using gradient boosting algorithms,” in Proc. Int. Conf. Artificial Intelligence and Smart Systems (ICAIS), Mar. 2021, pp. 711–717. https://doi.org/10.1109/ICAIS50930.2021.9395819
[28]A. März, “CatBoostLSS—An extension of CatBoost to probabilistic forecasting,” arXiv preprint, arXiv:2001.02121, 2020. https://doi.org/10.48550/arXiv.2001.02121
[29]S. G. Taj and K. Kalaivani, “Hybrid prediction model with improved score level fusion for heart disease diagnosis,” Computational Biology and Chemistry, p. 108278, 2024. https://www.sciencedirect.com/science/article/abs/pii/S1476927124002664
[30]M. A. Bouqentar, O. Terrada, S. Hamida, S. Saleh, D. Lamrani, B. Cherradi, and A. Raihani, “Early heart disease prediction using feature engineering and machine learning algorithms,” Heliyon, vol. 10, no. 19, 2024. https://www.cell.com/heliyon/fulltext/S2405-8440(24)14762-7
[31]S. Arif, S. H. Son, H. Y. Kim, S.-C. Kim, and J. Y. Lee, “A diagnosis tool for early detection and classification of heart disease in individuals using transformer mechanisms,” Computer Methods and Programs in Biomedicine, p. 109248, 2026. https://www.sciencedirect.com/science/article/abs/pii/S0169260726000167
[32]O. Rainio, J. Tamminen, M. S. Venäläinen, J. Liedes, J. Knuuti, J. Kemppainen, and R. Klén, “Comparison of thresholds for a convolutional neural network classifying medical images,” International Journal of Data Science and Analytics, vol. 20, no. 3, pp. 2093–2099, 2025. https://link.springer.com/article/10.1007/s41060-024-00584-z