Gajanan K. Birajdar

Work place: Ramrao Adik Institute of Technology, Department of Computer Science and Engineering, Navi Mumbai, 400706, India

E-mail: gajanan.birajdar@rait.ac.in

Website: https://orcid.org/0000-0003-3531-3958

Research Interests:

Biography

Gajanan K. Birajdar obtained his M.Tech. (Electronics and Telecommunication Engineering) from Dr. Babasaheb Ambedkar Technological University, Maharashtra, India, in 2004 and Ph.D. in the area of blind image forensics from Nagpur University in 2018. He is working as a Professor in the Department of Computer Science and Engineering, Ramrao Adik Institute of Technology Nerul, Navi Mumbai, affiliated to D Y Patil deemed to be University. He is a member of various professional bodies like ISTE, IETE and IE (I). He has published several papers in the International/National Journals/Conferences and book chapters in the domain of Medical Signal/Image Processing, Image Processing and Machine Learning. He has 05 granted patents on his credit. He has published a book on Computational Intelligence in Image and Video Processing. His current research interests are Medical Health Informatics, Multimedia Forensics and Machine Learning in Biomedical Signal and Image Processing.

Author Articles
Classification and Shelf Life Prediction of Bananas Using Thermal Imaging with Vision Transformer and Random Forest

By Amey Kulkarni Sejal Pathrabe Hans Gupta Gajanan K. Birajdar Sangita Chaudhari

DOI: https://doi.org/10.5815/ijigsp.2026.02.12, Pub. Date: 8 Apr. 2026

Classifying and predicting banana shelf life is vital for optimizing storage and distribution in agriculture. Traditional methods, relying on subjective visual inspection, are inconsistent and time-intensive. This study presents a new, non-destructive approach combining thermal imaging, and machine learning to classify naturally ripened and artificially ripened bananas and forecast their shelf life. Preprocessed thermal images are flattened, segmented into fixed-size patches, and then linearly projected into feature tokens. Position embeddings are incorporated to retain spatial information, and the sequence is processed by a Vision Transformer (ViT) encoder, which leverages self-attention mechanisms to model relationships between patches. The [CLS] token output is subsequently processed through fully connected layers for final classification, achieving 97.59% accuracy. Validation using t-SNE visualization demonstrated clear class separability, and receiver operating characteristic (ROC) curves confirmed robust performance. With an MSE of 0.10, MAE of 0.18, and R2 score of 0.85, the random forest algorithm performed exceptionally well at predicting the shelf life of artificially ripened bananas. This approach offers significant advantages, including improved accuracy, reduced subjectivity, and efficiency in data processing. By integrating thermal imaging with advanced models, the proposed method enhances agricultural supply chain management and promotes precision in ripening classification and shelf life prediction.

[...] Read more.
Other Articles