IJEM Vol. 16, No. 3, 8 Jun. 2026
Cover page and Table of Contents: PDF (size: 1225KB)
PDF (1225KB), PP.412-430
Views: 0 Downloads: 0
Convolutional Neural Network, Long Short-Term memory, accuracy of detection, Soft Voting and Confusion Matrix
This paper presents a comparative study for disease detection that proposes to combine machine learning classifiers (kernel support vector machine, random forest, decision tree, and eXtreme gradient boosting) to form a stronger ensemble classifier and also deep learning classifiers (long short-term memory and convolutional neural networks) to make a decision on whether the deep learning classifiers individually work better or the ensemble classifier consisting of four machine learning classifiers following the feature extraction method bag of features. The main reason for the global crisis in agricultural production is the presence of various vegetable diseases. It damages food quality and reduces production. These diseases must be detected, which is a challenging task to perform manually. Using various algorithms, we can identify vegetable diseases. Recently, deep learning has demonstrated notable success in the field of precision agriculture for identifying vegetable diseases. In this paper, the detection of vegetable diseases is done using three techniques: Ensemble Machine Learning Classifier (Kernel support vector machine, Random Forest, Decision Tree, and eXtreme gradient Boosting), CNN, and LSTM. By using the Bag of Features feature extractor, we extract 500 feature words from each vegetable dataset. CNN, LSTM, these two deep learning algorithms, and the ensemble method of machine learning classifier are used to make classifications of healthy and disease-affected vegetable categories and generate a confusion matrix. Then, from the confusion matrix, the performance metrics (precision, recall, F1-score, and accuracy) are identified. By applying soft voting for each individual classifier of machine learning, we predict the average best accuracy for each of the datasets. At the end, compare the performance of the ensemble method with the two deep learning algorithms according to the accuracy value. For the Cauliflower dataset, the Ensemble Machine Learning Classifier gives the accuracy of 83%, the deep learning classification algorithm CNN presents the accuracy of 94.51%, and LSTM gives the accuracy of 92.95%. The potato dataset's ensemble method accuracy is 89%, Convolutional Neural Network's is 89.47%, and Long Short-Term Memory's is 84.35%.
Trisha Sarkar, Sadia Hossain, Imdadul Islam, "Disease Detection of Vegetables using Ensemble Machine Learning Classifier and Deep Learning with the Aid of Feature Words", International Journal of Engineering and Manufacturing (IJEM), Vol.16, No.3, pp.412-430, 2026. DOI:10.5815/ijem.2026.03.25
[1]A. Gargade and S. Khandekar, “A review: Custard apple leaf parameter analysis and leaf disease detection using digital image processing,” in 2019 3rd International Conference on Computing Methodologies and Communication (ICCMC), 2019, pp. 267–271.
[2]M. K. Tripathi and D. D. Maktedar, “Recent machine learning based approaches for disease detection and classification of agricultural products,” in 2016 International Conference on Computing Communication Control and Automation (ICCUBEA), 2016, pp. 1–6.
[3]M. Chun, H. Jeong, H. Lee, T. Yoo, and H. Jung, “Development of korean food image classification model using public food image dataset and deep learning methods,” IEEE Access, vol. 10, pp. 128 732–128 741, 2022.
[4]S. Al-Eidi, O. Darwish, Y. Chen, and G. Husari, “Snapcatch: Automatic detection of covert timing channels using image processing and machine learning,” IEEE Access, vol. 9, pp. 177–191, 2021.
[5]N. Gunathilaka, S. Lokuliyana, A. C. Udurawana, D. Dissanayaka, and J. A. Jayakody,“Efficient agricultural sensor network with disease detection,” in 2019 International Conference on Advancements in Computing (ICAC), 2019, pp. 446–451.
[6]Y. Safari, J. Nakatumba-Nabende, R. Nakasi, and R. Nakibuule, “A review on automated detection and assessment of fruit damage using machine learning,” IEEE Access, vol. 12, pp. 21 358–21 381, 2024.
[7]S. K. Behera, L. Jena, A. K. Rath, and P. K. Sethy, “Disease classification and grading of orange using machine learning and fuzzy logic,” in 2018 International Conference on Communication and Signal Processing (ICCSP), 2018, pp. 0678–0682.
[8]A. Bekkanti, V. S. R. K. P. Gunde, S. Itnal, G. Parasa, and C. M. A. K. Z. Basha, “Computer based classification of diseased fruit using k-means and support vector machine,”in 2020 Third International Conference on Smart Systems and Inventive Technology (IC-SSIT), 2020, pp. 1227–1232.
[9]S. R. Dubey and A. S. Jalal, “Detection and classification of apple fruit diseases using complete local binary patterns,” in 2012 Third International Conference on Computer and Communication Technology, 2012, pp. 346–351.
[10]K. Devi and Rathamani, “Image segmentation k-means clustering algorithm for fruit disease detection image processing,” in 2020 4th International Conference on Electronics,Communication and Aerospace Technology (ICECA), 2020, pp. 861–865.
[11]Q. Wang and F. Qi, “Tomato diseases recognition based on faster rcnn,” in 2019 10th International Conference on Information Technology in Medicine and Education (ITME), 2019, pp. 772–776.
[12]H. Akshay Koushik, R. B. Bharadwaj, R. P. E. Naik, G. Ramesh, M. J. Yogesh, and S. Habeeb, “Detection and classification of diseased mangoes,” in 2020 International Conference on Computer Science and Its Application in Agriculture (ICOSICA), 2020, pp. 1–8.
[13]K. H. Jin, M. T. McCann, E. Froustey, and M. Unser, “Deep convolutional neural network for inverse problems in imaging,” IEEE Transactions on Image Processing, vol. 26, no. 9, pp. 4509–4522, 2017.
[14]R. Ramya, P. Kumar, K. Sivanandam, and M. Babykala, “Detection and classification of fruit diseases using image processing cloud computing,” in 2020 International Conference on Computer Communication and Informatics (ICCCI), 2020, pp. 1–6.
[15]A. Magsi, J. A. Mahar, M. A. Razzaq, and S. H. Gill, “Date palm disease identification using features extraction and deep learning approach,” in 2020 IEEE 23rd International Multitopic Conference (INMIC), 2020, pp. 1–6.
[16]Puneet, Deepika, P. Singh, R. Bansal, and S. Sharma, “Coronary heart disease prediction using voting classifier ensemble learning,” in 2021 3rd International Conference on Advances in Computing, Communication Control and Networking (ICAC3N), 2021, pp. 181–185.
[17]A. Athar, S. Ali, M. M. Sheeraz, S. Bhattachariee, and H.-C. Kim, “Sentimental analysis of movie reviews using soft voting ensemble-based machine learning,” in 2021 Eighth International Conference on Social Network Analysis, Management and Security (SNAMS), 2021, pp. 01–05.
[18]A. Oad et al., "Plant Leaf Disease Detection Using Ensemble Learning and Explainable AI," in IEEE Access, vol. 12, pp. 156038-156049, 2024, doi: 10.1109/ACCESS.2024.3484574.
[19]J. Yang and F. Wang, "Auto-Ensemble: An Adaptive Learning Rate Scheduling Based Deep Learning Model Ensembling," in IEEE Access, vol. 8, pp. 217499-217509, 2020, doi: 10.1109/ACCESS.2020.3041525.
[20]A. Goyal and K. Lakhwani, "Integrating advanced deep learning techniques for enhanced detection and classification of citrus leaf and fruit diseases," Scientific Reports, vol. 15, Apr. 2025, doi: 10.1038/s41598-025-97159-0.
[21]S. Lee, A. S. Arora, and C. M. Yun, "Detecting strawberry diseases and pest infections in the very early stage with an ensemble deep-learning model," Frontiers in Plant Science, vol. 13, Oct. 2022, doi: 10.3389/fpls.2022.991134.
[22]MusabbirArrafi. 2022.VegNet-organized dataset of cauliflower disease. https://www.kaggle.com/datasets/musabbirarrafi/vegnet-organized-dataset-of-cauliflower-disease.
[23]Saswattulo. 2023.Potato disease detection using MobileNet. https://www.kaggle.com/code/saswattulo/potato-disease-detection-using-mobilenet