Pancreatic Cancer Prediction Using Machine Learning: An Investigation of Different Algorithms

PDF (798KB), PP.127-142

Views: 0 Downloads: 0

Author(s)

Radha Singh Jadaun 1 Nij Mehar Grover 1 K. Srinivas 1 A. Charan Kumari 1,*

1. Department of Electrical Engineering, Faculty of Engineering, Dayalbagh Educational Institute, Dayalbagh, Agra, India

* Corresponding author.

DOI: https://doi.org/10.5815/ijitcs.2025.06.07

Received: 14 Jul. 2025 / Revised: 6 Sep. 2025 / Accepted: 2 Oct. 2025 / Published: 8 Dec. 2025

Index Terms

Pancreatic Cancer, Machine Learning Algorithms, Biomarkers, OncologyBenign Hepatobiliary

Abstract

Pancreatic cancer, characterized by its high mortality rate and scarce treatment options, poses a formidable challenge in the field of oncology. Now, we live in a reality that requires immediate progress in diagnostic and prognostic methodologies to find pancreatic cancer early and understand its stage. This study deals with the pressing requirement for better diagnostic tools by evaluating and deciding the suitable machine learning (ML) algorithms for detecting pancreatic cancer at an early stage. This work uses a publicly available dataset with 590 urine samples which included control, benign hepatobiliary disease as well as Pancreatic Ductal Adenocarcinoma (PDAC) samples. The primary objectives of the research included developing a predictive model based on clinical data, examining various machine learning (ML) algorithms for their diagnostic precision, and improving the early detection rates for pancreatic cancer. The study assessed the efficacy of a broad array of ML algorithms in forecasting outcomes associated with pancreatic cancer. This analysis systematically explored Random Forest, Support Vector Machine, Decision Trees, K-Nearest Neighbours, XGBoost, ADABoost, CatBoost, and GradientBoost. The assessment focused on standard performance metrics such as accuracy, precision (also known as positive predicted value or PPV), recall (sometimes called sensitivity or true positive rate), F1-score, and support. Notably, CatBoost achieved the highest accuracy of 75%, outperforming other models such as Random Forest (74%) and XGBoost (74%), demonstrating its superior classification performance in distinguishing between pancreatic cancer, benign conditions, and non-cancerous cases. In addition to performance evaluation, this study integrates SHAP (Shapley Additive Explanations) analysis to enhance model interpretability, ensuring transparency in feature contributions. SHAP analysis revealed that Plasma CA19-9, LYVE1, and TFF1 were the most influential biomarkers across all classifications, reinforcing their diagnostic significance. This research emphasizes the critical importance of early detection, model interpretability, and clinical applicability, demonstrating that ML algorithms, particularly CatBoost, not only enhance diagnostic precision but also provide explainable predictions that support real-world medical decision-making.

Cite This Paper

Radha Singh Jadaun, Nij Mehar Grover, K. Srinivas, A. Charan Kumari, "Pancreatic Cancer Prediction Using Machine Learning: An Investigation of Different Algorithms", International Journal of Information Technology and Computer Science(IJITCS), Vol.17, No.6, pp.127-142, 2025. DOI:10.5815/ijitcs.2025.06.07

Reference

[1]M. A. Atkinson, M. Campbell-Thompson, I. Kusmartseva, and K. H. Kaestner, "Organisation of the human pancreas in health and in diabetes," Diabetologia, vol. 63, no. 10, pp. 1966-1973, sept 2020, doi: 10.1007/s00125-020-05203-7.
[2]Leung, P.S, “The Renin-Angiotensin System:Current Research Progress in The Pancreas”, Springer,vol 690, pp. 53, 2010. 
[3]Allison Rosenzweig,”Black Americans at Increased Risk for Pancreatic Cancer”, Pancreatic Cancer Action Network. https://pancan.org/news/black-americans-at-increased-risk-for-pancreatic-cancer/
[4]A. Moore and T. Donahue, "Pancreatic Cancer," JAMA, vol. 322, no. 14, pp. 1426-1426, oct 2019, doi: 10.1001/jama.2019.14699. 
[5]Wilson Bakasa and Serestina Viriri, “Pancreatic Cancer Survival Prediction: A Survey of the State-ofthe- Art”, Computational and Mathematical Methods in Medicine, pp. 2021:1188414, sept, 2021, doi: 10.1155/2021/11884.
[6]Wang, S., Zheng, Y., & Zhang, S., “Clinical application of artificial intelligence in pancreatic cancer” Cancer Letters, vol. 471, pp. 13-23, 2020. 
[7]Davide Placido1, Bo Yuan2 Jessica X. Hjaltelin, Amalie D. Haue1, Piotr J Chmura1, Chen Yuan, Jihye Kim, Renato Umeton, Gregory Antell, Alexander Chowdhury, Alexandra Franz, Lauren Brais, Elizabeth Andrews, Debora S. Marks, Aviv Regev, Peter Kraft, Brian M. Wolpin, Michael Rosenthal, Søren Brunak1, Chris Sander, “Pancreatic cancer risk predicted from disease trajectories using deep learning”,Nature Medicine,vol. 29, pp. 1113–1122, may 2023.[Online]. Available: https://www.nature.com/articles/s41591-023-02332-5 
[8]Davide Placido, Bo Yuan, Jessica X. Hjaltelin, Chunlei Zheng, Amalie D. HauePiotr J. Chmura, Chen Yuan, Jihye Kim, Renato Umeton, Gregory Antell, Alexander Chowdhury, Alexandra Franz, Lauren Brais, Elizabeth Andrews, Debora S. Marks, Aviv Regev, Siamack Ayandeh, Mary T. Brophy, Nhan V, Peter Kraft , Brian M. Wolpin, Michael H. Rosenthal, Nathanael R. Fillmore, Soren Brunak  & Chris, “A deep learning algorithm to predict risk of pancreatic cancer from disease trajectories.”, Nature Medicine, vol. 29, no. 5, pp. 1113-1122, May 2023, doi: 10.1038/s41591-023-02332-5.
[9]Bahrudeen shahul Hameed and Uma Maheswari Krishnan, “Artificial Intelligence-Driven Diagnosis of Pancreatic Cancer”, Cancers, vol.14, no. 21, pp. 5382, oct 2022, doi: 10.3390/cancers14215382.
[10]H.S.Saraswathi, Mohamed Rafi, ” Promising Urinary Biomarkers to predict Pancreatic Ductal Adenocarcinoma using Machine Learning Techniques”, European Chemical Bulletin, vol.12, no. 8, pp. 1858-1869,2023.[Online].Available: https://www.eurchembull.com/uploads/paper/341a314135b011716cd2cb97b5170140 
[11]Haidy Nasief, Cheng Zheng, Diane Schott, William Hall, Susan Tsai,Beth Erikson and X.Allen Li, “A machine learning based delta-radiomics process for early prediction of treatment response of pancreatic cancer”, npj Precision Oncology, no. 25, oct 2019. [Online].  Available: https://www.nature.com/articles/s41698-019-0096-z
[12]Seiya Yokoyama1, Taiji Hamada1, Michiyo Higashi1, Kei Matsuo1, Kosei Maemura2,3, Hiroshi Kurahara3, Michiko Horinouchi1, Tsubasa Hiraki1, “Predicted Prognosis of Patients with Pancreatic Cancer by Machine Learning”, Clinical Cancer Research, vol. 26, pp. 2411–2421, 2020, doi: 10.1158/1078-0432.CCR-19-1247. 
[13]Alexandra Corina Faur, Daniela Cornelia Lazar, and Laura Andreea Ghenciu, “Artificial intelligence as a noninvasive tool for pancreatic cancer prediction and diagnosis”, World Journal of Gastroenterology, vol.29, no. 12, pp. 1811-1823, march 2023, doi: 10.3748/wjg.v29.i12.1811.
[14]M.H. Osman, “Predicting survival of pancreatic cancer using supervised machine learning”, GASTROINTESTINAL TUMOURS, NON-COLORECTAL, vol. 29, pp. VIII255.[8], oct 2018, doi:10.1093/annonc/mdy282, 1905.
[15]Alvaro martin. (2023).,” Detect pancreatic cancer early using machine learning”, neural designer, Aug 2023. [Online]. Available: https://www.neuraldesigner.com/learning/examples/pancreatic-cancer/