International Journal of Image, Graphics and Signal Processing (IJIGSP)

ISSN: 2074-9074 (Print)

ISSN: 2074-9082 (Online)

DOI: https://doi.org/10.5815/ijigsp

Website: https://www.mecs-press.org/ijigsp

Published By: MECS Press

Frequency: 6 issues per year

Number(s) Available: 141

(IJIGSP) in Google Scholar Citations / h5-index

IJIGSP is committed to bridge the theory and practice of images, graphics, and signal processing. From innovative ideas to specific algorithms and full system implementations, IJIGSP publishes original, peer-reviewed, and high quality articles in the areas of images, graphics, and signal processing. IJIGSP is a well-indexed scholarly journal and is indispensable reading and references for people working at the cutting edge of images, graphics, and signal processing applications.

 

IJIGSP has been abstracted or indexed by several world class databases: Scopus, Google Scholar, Microsoft Academic Search, CrossRef, Baidu Wenku, IndexCopernicus, IET Inspec, EBSCO, JournalSeek, ULRICH's Periodicals Directory, WorldCat, Scirus, Academic Journals Database, Stanford University Libraries, Cornell University Library, UniSA Library, CNKI Scholar, ProQuest, J-Gate, ZDB, BASE, OhioLINK, iThenticate, Open Access Articles, Open Science Directory, National Science Library of Chinese Academy of Sciences, The HKU Scholars Hub, etc..

Latest Issue
Most Viewed
Most Downloaded

IJIGSP Vol. 18, No. 1, Feb. 2026

REGULAR PAPERS

Uncertainty-Aware Source-Free Domain Adaptation for Dental CBCT Image Segmentation

By Sviatoslav Dziubenko Tymur Dovzhenko Andriy Kyrylyuk Kamila Storchak

DOI: https://doi.org/10.5815/ijigsp.2026.01.01, Pub. Date: 8 Feb. 2026

The aim of this study is evaluating the efficacy of combining source-free domain adaptation techniques with quantitative uncertainty assessment, aimed at enhancing image segmentation in new domains. The research employs an uncertainty-aware source-free domain adaptation strategy, encompassing the generation of pseudo-labels, their filtration based on entropy and variance of predictions, alongside the involvement of an Exponential Moving Average (EMA) teacher and a tailored loss function. For validation purposes, segmentation models pre-trained on one image dataset were subsequently adapted to another dataset. A comprehensive comparative and ablation analysis, coupled with the visualization of the correlation between segmentation errors and the degree of uncertainty, was conducted. The ablation study corroborated that the complete configuration with the EMA teacher yielded the most favorable results. Data visualization elucidated a direct correlation between high uncertainty and an increased risk for segmentation errors. The findings of this study substantiate the viability of employing uncertainty assessment within the source-free domain adaptation process for clinical dentistry. The proposed methodology facilitates the adaptation of models to new conditions without necessitating retraining, thereby rendering the decision-making process more transparent. Future studies should consider assessing the efficacy of the proposed approach in additional dental visualization tasks, such as implant planning or orthodontic analysis.

[...] Read more.
Fuzzy-Enhanced U-Net with Dual Attention for Histopathological Image Analysis in High Grade Serous Ovarian Cancer

By Anandakumar K. Chandrasekar C

DOI: https://doi.org/10.5815/ijigsp.2026.01.02, Pub. Date: 8 Feb. 2026

High-quality image reconstruction plays an important part in histopathological image analysis, especially for HGSOC diagnosis, because of a great deal of fine cellular structures that should be clearly visible. In real scenarios, however, medical images usually face a series of problems due to acquisition limitations, which might obscure some significant diagnostic features. This work presents FUDA-NET, a new image denoising framework that enhances noisy histopathological images while maintaining the integrity of structure and texture. The architecture is based on an improved U-Net design integrated with a dual attention mechanism- Channel and Spatial attention, which enables the network to selectively emphasize meaningful features and suppress background noise. Additionally, a fuzzy logic layer is incorporated at the bottleneck to handle uncertainty and enhance contextual reasoning during feature extraction. This proposed FUDA-NET framework combines Mean Squared Error (MSE) and Structural Similarity Index Measures (SSIM) based loss function to ensure both pixel wise accuracy and perception similarity. Experiment conducted on 12,019 training images and 1188 testing images of High Grade Serous Ovarian Cancer, histopathological data set shows that FUDA-NET achieves superior denoising performance outperforming traditional and recent deep learning methods such as DnCNN, U-Net, U-Net with Attention and Noise2Noise in terms of PSNR, SSIM, MSE, MAE and FSIM. This approach contributes to improve visual clarity and diagnostic reliability in medical imaging.

[...] Read more.
An Effective Semi-Supervised Feature Extraction Model with Reduced Architectural Complexity for Image Forgery Classification

By Jisha K. R. Sabna N.

DOI: https://doi.org/10.5815/ijigsp.2026.01.03, Pub. Date: 8 Feb. 2026

A generalized deep learning approach tracking image forgeries of any category with reduced architectural complexity, without compromising the performance is presented in this paper. A convolutional encoder-decoder architecture-based image reconstruction model is framed to extract all the pertinent information from the images. Performance comparison of similar networks constructed with varying architectural complexity led to the selection of this design. The best reconstruction feature extractor showed faster convergence and improved accuracy, as observed from the training and validation performance curves. Dimensionally compressed information from the reconstruction model is utilized by dense layers and further classified. Experimenting with forgery datasets inclusive of different forgery types ensured the generalizability of the model. In comparison with the reconstruction models adopting transfer learning in the encoder side utilizing MobileNet, ResNet 50, and VGG 19, the proposed model exhibited competitive and consistently improved mean Precision and F1-score performance across multiple datasets, as validated through multi-seed experimentation. Additionally, with the reduced architecture, the proposed model performed on par than the state-of-the-art approaches against which it was compared.

[...] Read more.
Multi-Scale and Auxiliary-Supervised Architectures for Accurate Road Network Mapping

By Mohamed El Mehdi Imam Lila Meddeber Tarik Zouagui

DOI: https://doi.org/10.5815/ijigsp.2026.01.04, Pub. Date: 8 Feb. 2026

Automated road network extraction from satellite imagery represents a critical advancement for Geographic Information Systems (GIS) applications in infrastructure management and urban planning. This paper introduces two novel deep learning architectures based on LinkNet: RoadNet-MS (Multi-Scale) and RoadNet-AUX (Multi-Scale with Auxiliary Supervision), specifically designed to enhance road segmentation performance. RoadNet-MS incorporates Multi-Scale Contextual Blocks (CMS-Blocks) and hybrid blocks to effectively capture diverse contextual features at multiple scales, achieving F1-scores of 78.87% on the challenging DeepGlobe dataset and 82.30% on the Boston & Los Angeles dataset. RoadNet-AUX extends this architecture through auxiliary supervision, further improving performance with F1-scores of 79.14% on DeepGlobe and 82.33% on Boston-LA. Both proposed architectures demonstrate competitive performance and consistent improvements over existing methods, including the state-of-the-art NL-LinkNet, across both evaluation datasets. Notably, RoadNet-MS achieves the highest precision (83.55%) among all compared methods on DeepGlobe. These contributions provide a pathway toward more accurate and scalable road network mapping, essential for modern urban planning and infrastructure monitoring applications.

[...] Read more.
Segment Wise EEG Signal Compression Using LSTM Auto Encoder for Enhanced Efficiency

By Uma. M. Mohammed Javidh S. Ruchi Shah Prabhu Sethuramalingam M. M. Reddy

DOI: https://doi.org/10.5815/ijigsp.2026.01.05, Pub. Date: 8 Feb. 2026

Efficient compression of electroencephalogram (EEG) signals is crucial for enabling real-time monitoring, storage, and transmission in various medical and non-medical applications. This paper presents a segment-wise processing approach using temporal modeling-based auto encoders for EEG signal compression. By leveraging models such as Long Short-Term Memory (LSTM), Gated Recurrent Unit (GRU), Recurrent Neural Network (RNN), and Self-Attention, the proposed method effectively captures temporal dependencies in the EEG data. Segment-wise processing not only enhances compression efficiency but also significantly reduces the processing time of these sequence models. Extensive experiments demonstrate that GRU-based auto encoders offer the best performance, particularly at lower Data Reduction Factors (DRFs), achieving a minimal signal loss of 0.2% at a 50% compression ratio, making it suitable for medical applications. For non-medical scenarios, a higher compression ratio of 75% with a signal loss of 5.4% is found to be acceptable. The results indicate that the proposed approach achieves a favorable balance between compression efficiency, signal fidelity, and computational performance.

[...] Read more.
Automated PCOS Detection Using Fine-Grained Deep Feature Extraction and Explainable AI: A Transformer-Based Ensemble Approach

By Ifra Bilal Shah Pramod Kumar Yadav

DOI: https://doi.org/10.5815/ijigsp.2026.01.06, Pub. Date: 8 Feb. 2026

Polycystic Ovary Syndrome (PCOS) is a prevalent endocrine condition affecting women of reproductive age, hallmarked by hormonal abnormalities, ovarian cysts, and metabolic issues. Early diagnosis is essential to prevent long-term effects such as infertility, diabetes, and cardiovascular issues. Conventional diagnostic approaches relying on manual interpretation of ultrasound images are time-consuming and error-prone. To overcome these limitations, we propose an automated diagnostic framework leveraging deep feature extraction and ensemble learning. Initially, ResNet50 is utilized as a convolutional feature extractor, and its extracted features are classified using ensemble of Random Forest (RF) and Gradient Boosting (GB) classifiers. Subsequently, we also employed the Swin Transformer which is a hierarchical vision transformer to extract deep features from ultrasound images, which were fed to Random Forest and Gradient Boosting classifiers. These features were handled separately from those of ResNet50, and no feature concatenation was done. Compared to the ResNet50-based ensemble model, which achieved a classification accuracy of 99.2%, the Swin Transformer–based ensemble model performed better by attaining the accuracy of 99.87%. Furthermore, Explainable AI approaches (Grad-CAM) were applied to both ResNet50-based model and Swin Transformer-based model to highlight key regions contributing to the predictions. This scalable and interpretable system offers encouraging potential for advancing PCOS detection and other medical imaging applications.

[...] Read more.
Weighted Late Fusion based Deep Attention Neural Network for Detecting Multi-Modal Emotion

By Srinivas P. V. V. S. Shaik Nazeera Khamar Nohith Borusu Mohan Guru Raghavendra Kota Harika Vuyyuru Sampath Patchigolla

DOI: https://doi.org/10.5815/ijigsp.2026.01.07, Pub. Date: 8 Feb. 2026

In the field of affective computing research, multi-modal emotion detection has gained popularity as a way to boost recognition robustness and get around the constraints of processing a multiple type of data. Human emotions are utilized for defining a variety of methodologies, including physiological indicators, facial expressions, as well as neuroimaging tactics. Here, a novel deep attention mechanism is used for detecting multi-modal emotions. Initially, the data are collected from audio and video features. For dimensionality reduction, the audio features are extracted using Constant-Q chromagram and Mel-Frequency Cepstral Coefficients (MM-FC2). After extraction, the audio generation is carried out by a Convolutional Dense Capsule Network (Conv_DCN) is used. Next is video data; the key frame extraction is carried out using Enhanced spatial-temporal and Second-Order Gaussian kernels. Here, Second-Order Gaussian kernels are a powerful tool for extracting features from video data and converting it into a format suitable for image-based analysis. Next, for video generation, DenseNet-169 is used. At last, all the extracted features are fused, and  
emotions are detected using a Weighted Late Fusion Deep Attention Neural Network (WLF_DAttNN). Python tool is used for implementation, and the performance measure achieved an accuracy of 97% for RAVDESS and 96% for CREMA-D dataset.

[...] Read more.
A Novel Hybrid Approach Using MRMR-based Feature Selection and Bayesian Optimized Random Forest Classification for Accurate Fabric Defect Detection

By Ritu Juneja Anil Dudy

DOI: https://doi.org/10.5815/ijigsp.2026.01.08, Pub. Date: 8 Feb. 2026

The textile industry holds a central position in India's economy, contributing substantially to both employment and GDP. Despite technological advancements, maintaining stringent quality standards remains a persistent challenge due to defects such as cracks, stains, and inconsistencies in fabrics. Traditional manual inspection methods, while effective to a degree, are labor-intensive, time-consuming, and prone to human error. This paper proposes an innovative approach to address these challenges through the application of machine learning and computer vision techniques in fabric defect detection. Specifically, the research focuses on integrating advanced texture feature extraction methods—Gray-Level Co-occurrence Matrix (GLCM), Local Binary Patterns (LBP), and Histogram of Oriented Gradients (HOG)—with a robust classification framework using Bayesian optimized Random Forest. The methodology emphasizes efficient feature selection via Minimum Redundancy Maximum Relevance (MRMR), enhancing the system's accuracy and efficiency. By leveraging a comprehensive dataset from Kaggle encompassing diverse fabric defects, the proposed system aims to significantly improve defect detection accuracy, reduce manual intervention, and ensure consistent product quality across textile manufacturing processes. The highest accuracy achieved in the evaluation is 99.52%.

[...] Read more.
Diabetic Kidney Disease Prediction Using Hybrid Deep Learning Model

By Konne Madhavi Harwant Singh Arri

DOI: https://doi.org/10.5815/ijigsp.2026.01.09, Pub. Date: 8 Feb. 2026

Diabetic Kidney Disease (DKD) was recently identified as a significant microvascular consequence of diabetes. Many researchers are working on the classification of DKD from non-diabetic kidney disease (NDKD), but the required accuracy has not been achieved yet. This study aims to enhance diagnostic accuracy using a hybrid Deep Learning (DL) method, Convolutional Neural Network, and Long Short-Term Memory (CNN-LSTM). Clinical data on DKD were collected and preprocessed to address issues like missing values, duplicates, and outliers. Key preprocessing steps included imputation, z-score, min-max normalization, and feature encoding. Feature selection based on a correlation matrix identified the most relevant variables. Subsequently, both CNN-LSTM and Convolutional Neural Network (CNN) models were trained using processed data, with identical hyperparameters, as detailed in the methodology. Evaluation metrics such as Accuracy, Sensitivity, Specificity, Precision, F1-score, and ROC plots were employed to assess model performance. The CNN-LSTM model achieved a high Accuracy of 98%, surpassing the CNN model’s Accuracy of 96.5%. In addition to accuracy, all metrics showed that the CNN-LSTM outperformed the CNN.

[...] Read more.
Multimodal Emotion Recognition Using EEG and Facial Expressions with Potential Applications in Driver Monitoring

By Ch. Raga Madhuri Anideep Seelam Fatima Farheen Shaik Aadi Siva Kartheek Pamarthi Mohan Kireeti Krovi

DOI: https://doi.org/10.5815/ijigsp.2026.01.10, Pub. Date: 8 Feb. 2026

Mental conditions such as fatigue, distraction, and cognitive overload are known to contribute significantly to traffic accidents. Accurate recognition of these cognitive and emotional states is therefore important for the development of intelligent monitoring systems. In this study, a multimodal emotion recognition framework using electroencephalography (EEG) signals and facial expression features is proposed, with potential applications in driver monitoring. The approach integrates Long Short-Term Memory (LSTM) networks and Transformer architectures for EEG-based temporal feature extraction, along with Vision Transformers (ViT) for facial feature representation. Feature-level fusion is employed to combine physiological and visual modalities, enabling improved emotion classification performance compared to unimodal approaches. The model is evaluated using accuracy, precision, recall, and F1-score metrics, achieving an overall accuracy of 96.38%, demonstrating the effectiveness of multimodal learning. Although the experiments are conducted on general-purpose emotion datasets, the results indicate that the proposed framework can serve as a reliable foundation for driver monitoring applications, such as fatigue, distraction, and cognitive state assessment, in intelligent transportation systems.

[...] Read more.
Edibility Detection of Mushroom Using Ensemble Methods

By Nusrat Jahan Pinky S.M. Mohidul Islam Rafia Sharmin Alice

DOI: https://doi.org/10.5815/ijigsp.2019.04.05, Pub. Date: 8 Apr. 2019

Mushrooms are the most familiar delicious food which is cholesterol free as well as rich in vitamins and minerals. Though nearly 45,000 species of mushrooms have been known throughout the world, most of them are poisonous and few are lethally poisonous. Identifying edible or poisonous mushroom through the naked eye is quite difficult. Even there is no easy rule for edibility identification using machine learning methods that work for all types of data. Our aim is to find a robust method for identifying mushrooms edibility with better performance than existing works. In this paper, three ensemble methods are used to detect the edibility of mushrooms: Bagging, Boosting, and random forest. By using the most significant features, five feature sets are made for making five base models of each ensemble method. The accuracy is measured for ensemble methods using five both fixed feature set-based models and randomly selected feature set based models, for two types of test sets. The result shows that better performance is obtained for methods made of fixed feature sets-based models than randomly selected feature set-based models. The highest accuracy is obtained for the proposed model-based random forest for both test sets.

[...] Read more.
Evolutionary Image Enhancement Using Multi-Objective Genetic Algorithm

By Dhirendra Pal Singh Ashish Khare

DOI: https://doi.org/10.5815/ijigsp.2014.01.09, Pub. Date: 8 Nov. 2013

Image Processing is the art of examining, identifying and judging the significances of the Images. Image enhancement refers to attenuation, or sharpening, of image features such as edgels, boundaries, or contrast to make the processed image more useful for analysis. Image enhancement procedures utilize the computers to provide good and improved images for study by the human interpreters. In this paper we proposed a novel method that uses the Genetic Algorithm with Multi-objective criteria to find more enhance version of images. The proposed method has been verified with benchmark images in Image Enhancement. The simple Genetic Algorithm may not explore much enough to find out more enhanced image. In the proposed method three objectives are taken in to consideration. They are intensity, entropy and number of edgels. Proposed algorithm achieved automatic image enhancement criteria by incorporating the objectives (intensity, entropy, edges). We review some of the existing Image Enhancement technique. We also compared the results of our algorithms with another Genetic Algorithm based techniques. We expect that further improvements can be achieved by incorporating linear relationship between some other techniques.

[...] Read more.
Mobile-Based Skin Disease Diagnosis System Using Convolutional Neural Networks (CNN)

By M.W.P Maduranga Dilshan Nandasena

DOI: https://doi.org/10.5815/ijigsp.2022.03.05, Pub. Date: 8 Jun. 2022

This paper presents a design and development of an Artificial Intelligence (AI) based mobile application to detect the type of skin disease. Skin diseases are a serious hazard to everyone throughout the world. However, it is difficult to make accurate skin diseases diagnosis. In this work, Deep learning algorithms Convolution Neural Networks (CNN) is proposed to classify skin diseases on the HAM10000 dataset. An extensive review of research articles on object identification methods and a comparison of their relative qualities were given to find a method that would work well for detecting skin diseases. The CNN-based technique was recognized as the best method for identifying skin diseases. A mobile application, on the other hand, is built for quick and accurate action. By looking at an image of the afflicted area at the beginning of a skin illness, it assists patients and dermatologists in determining the kind of disease present. Its resilience in detecting the impacted region considerably faster with nearly 2x fewer computations than the standard MobileNet model results in low computing efforts. This study revealed that MobileNet with transfer learning yielding an accuracy of about 85% is the most suitable model for automatic skin disease identification. According to these findings, the suggested approach can assist general practitioners in quickly and accurately diagnosing skin diseases using the smart phone.

[...] Read more.
A Review of Self-supervised Learning Methods in the Field of Medical Image Analysis

By Jiashu Xu

DOI: https://doi.org/10.5815/ijigsp.2021.04.03, Pub. Date: 8 Aug. 2021

In the field of medical image analysis, supervised deep learning strategies have achieved significant development, while these methods rely on large labeled datasets. Self-Supervised learning (SSL) provides a new strategy to pre-train a neural network with unlabeled data. This is a new unsupervised learning paradigm that has achieved significant breakthroughs in recent years. So, more and more researchers are trying to utilize SSL methods for medical image analysis, to meet the challenge of assembling large medical datasets. To our knowledge, so far there still a shortage of reviews of self-supervised learning methods in the field of medical image analysis, our work of this article aims to fill this gap and comprehensively review the application of self-supervised learning in the medical field. This article provides the latest and most detailed overview of self-supervised learning in the medical field and promotes the development of unsupervised learning in the field of medical imaging. These methods are divided into three categories: context-based, generation-based, and contrast-based, and then show the pros and cons of each category and evaluates their performance in downstream tasks. Finally, we conclude with the limitations of the current methods and discussed the future direction.

[...] Read more.
Text Region Extraction: A Morphological Based Image Analysis Using Genetic Algorithm

By Dhirendra Pal Singh Ashish Khare

DOI: https://doi.org/10.5815/ijigsp.2015.02.06, Pub. Date: 8 Jan. 2015

Image analysis belongs to the area of computer vision and pattern recognition. These areas are also a part of digital image processing, where researchers have a great attention in the area of content retrieval information from various types of images having complex background, low contrast background or multi-spectral background etc. These contents may be found in any form like texture data, shape, and objects. Text Region Extraction as a content from an mage is a class of problems in Digital Image Processing Applications that aims to provides necessary information which are widely used in many fields medical imaging, pattern recognition, Robotics, Artificial intelligent Transport systems etc. To extract the text data information has becomes a challenging task. Since, Text extraction are very useful for identifying and analysis the whole information about image, Therefore, In this paper, we propose a unified framework by combining morphological operations and Genetic Algorithms for extracting and analyzing the text data region which may be embedded in an image by means of variety of texts: font, size, skew angle, distortion by slant and tilt, shape of the object which texts are on, etc. We have established our proposed methods on gray level image sets and make qualitative and quantitative comparisons with other existing methods and concluded that proposed method is better than others.

[...] Read more.
Image Denoising based on Enhanced Wavelet Global Thresholding Using Intelligent Signal Processing Algorithm

By Joseph Isabona Agbotiname Lucky Imoize Stephen Ojo

DOI: https://doi.org/10.5815/ijigsp.2023.05.01, Pub. Date: 8 Oct. 2023

Denoising is a vital aspect of image preprocessing, often explored to eliminate noise in an image to restore its proper characteristic formation and clarity. Unfortunately, noise often degrades the quality of valuable images, making them meaningless for practical applications. Several methods have been deployed to address this problem, but the quality of the recovered images still requires enhancement for efficient applications in practice. In this paper, a wavelet-based universal thresholding technique that possesses the capacity to optimally denoise highly degraded noisy images with both uniform and non-uniform variations in illumination and contrast is proposed. The proposed method, herein referred to as the modified wavelet-based universal thresholding (MWUT), compared to three state-of-the-art denoising techniques, was employed to denoise five noisy images. In order to appraise the qualities of the images obtained, seven performance indicators comprising the Root Mean Square Error (RMSE), Mean Absolute Error (MAE), Structural Content (SC), Peak Signal to Noise Ratio (PSNR), Structural Similarity Index Method (SSIM), Signal-to-Reconstruction-Error Ratio (SRER), Blind Spatial Quality Evaluator (NIQE), and Blind/Referenceless Image Spatial Quality Evaluator (BRISQUE) were employed. The first five indicators – RMSE, MAE, SC, PSNR, SSIM, and SRER- are reference indicators, while the remaining two – NIQE and BRISQUE- are referenceless. For the superior performance of the proposed wavelet threshold algorithm, the SC, PSNR, SSIM, and SRER must be higher, while lower values of NIQE, BRISQUE, RMSE, and MAE are preferred. A higher and better value of PSNR, SSIM, and SRER in the final results shows the superior performance of our proposed MWUT denoising technique over the preliminaries. Lower NIQE, BRISQUE, RMSE, and MAE values also indicate higher and better image quality results using the proposed modified wavelet-based universal thresholding technique over the existing schemes. The modified wavelet-based universal thresholding technique would find practical applications in digital image processing and enhancement.

[...] Read more.
An Efficient Brain Tumor Detection Algorithm Using Watershed & Thresholding Based Segmentation

By Anam Mustaqeem Engr Ali Javed Tehseen Fatima

DOI: https://doi.org/10.5815/ijigsp.2012.10.05, Pub. Date: 28 Sep. 2012

During past few years, brain tumor segmentation in magnetic resonance imaging (MRI) has become an emergent research area in the ?eld of medical imaging system. Brain tumor detection helps in finding the exact size and location of tumor. An efficient algorithm is proposed in this paper for tumor detection based on segmentation and morphological operators. Firstly quality of scanned image is enhanced and then morphological operators are applied to detect the tumor in the scanned image.

[...] Read more.
Restoration of Degraded Gray Images Using Genetic Algorithm

By Dhirendra Pal Singh Ashish Khare

DOI: https://doi.org/10.5815/ijigsp.2016.03.04, Pub. Date: 8 Mar. 2016

This Image deblurring aims to eliminate or decrease the degradations that has been occurred while the image has been obtained. In this paper, we proposed a unified framework for restoration process by enhancement and more quantified deblurred images with the help of Genetic Algorithm. The developed method uses an iterative procedure using evolutionary criteria and produce better images with most restored frequency-content. We have compared the proposed methods with Lucy-Richardson Restoration method, method proposed by W. Dong [34] and Inverse Filter Restoration Method; and demonstrated that the proposed method is more accurate by achieving high quality visualized restored images in terms of various statistical quality measures.

[...] Read more.
Improving Retinal Image Quality Using the Contrast Stretching, Histogram Equalization, and CLAHE Methods with Median Filters

By Erwin Dwi Ratna Ningsih

DOI: https://doi.org/10.5815/ijigsp.2020.02.04, Pub. Date: 8 Apr. 2020

This paper performs three different contrast testing methods, namely contrast stretching, histogram equalization, and CLAHE using a median filter. Poor quality images will be corrected and performed with a median filter removal filter. STARE dataset images that use images with different contrast values for each image. For this reason, evaluating the results of the three parameters tested are; MSE, PSNR, and SSIM. With the gray level scale image and contrast stretching which stretches the pixel value by stretching the stretchlim technique with the MSE result are 9.15, PSNR is 42.14 dB, and SSIM is 0.88. And the HE method and median filter with the results of the average value of MSE is 18.67, PSNR is 41.33 dB, and SSIM is 0.77. Whereas for CLAHE and median filters the average yield of MSE is 28.42, PSNR is 35.30 dB, and SSIM is 0.86. From the test results, it can be seen that the proposed method has MSE and PSNR values as well as SSIM values. 

[...] Read more.
Non-Invasive Blood Group Prediction Using Optimized EfficientNet Architecture: A Systematic Approach

By Nitin Sakharam Ujgare Nagendra Pratap Singh Prem Kumari Verma Madhusudan Patil Aryan Verma

DOI: https://doi.org/10.5815/ijigsp.2024.01.06, Pub. Date: 8 Feb. 2024

This research work proposed a non-invasive blood group prediction approach using deep learning. The ability to swiftly and accurately determine blood types plays a critical role in medical emergencies prior to administering red blood cell, platelet, and plasma transfusions. Even a minor error during blood transfer can have severe consequences, including fatality. Traditional methods rely on time-consuming automated blood analyzers for pathological assessment. However, these processes involve skin pricking, which can cause bleeding, fainting, and potential skin lacerations. The proposed approach circumvents noninvasive procedures by leveraging rich EfficientNet deep learning architecture to analyze images of superficial blood vessels found on the finger. By illuminating the finger with laser light, the optical image of blood vessels hidden on the finger skin surface area is captured, which incorporates specific antigen shapes such as antigen ‘A’ and antigen ‘B’ present on the surface. Captured shapes of different antigen further used to predict the blood group of humans. The system requires high-definition camera to capture the antigen pattern from the red blood cells surface for classification of blood type without piercing the skin of patient. The proposed solution is not only straightforward and easily implementable but also offers significant advantages in terms of cost-effectiveness and immediate identification of ABO blood groups. This approach holds great promise for medical emergencies, military battleground scenarios, and is particularly valuable when dealing with infants where invasive procedures pose additional risks.

[...] Read more.
Edibility Detection of Mushroom Using Ensemble Methods

By Nusrat Jahan Pinky S.M. Mohidul Islam Rafia Sharmin Alice

DOI: https://doi.org/10.5815/ijigsp.2019.04.05, Pub. Date: 8 Apr. 2019

Mushrooms are the most familiar delicious food which is cholesterol free as well as rich in vitamins and minerals. Though nearly 45,000 species of mushrooms have been known throughout the world, most of them are poisonous and few are lethally poisonous. Identifying edible or poisonous mushroom through the naked eye is quite difficult. Even there is no easy rule for edibility identification using machine learning methods that work for all types of data. Our aim is to find a robust method for identifying mushrooms edibility with better performance than existing works. In this paper, three ensemble methods are used to detect the edibility of mushrooms: Bagging, Boosting, and random forest. By using the most significant features, five feature sets are made for making five base models of each ensemble method. The accuracy is measured for ensemble methods using five both fixed feature set-based models and randomly selected feature set based models, for two types of test sets. The result shows that better performance is obtained for methods made of fixed feature sets-based models than randomly selected feature set-based models. The highest accuracy is obtained for the proposed model-based random forest for both test sets.

[...] Read more.
Evolutionary Image Enhancement Using Multi-Objective Genetic Algorithm

By Dhirendra Pal Singh Ashish Khare

DOI: https://doi.org/10.5815/ijigsp.2014.01.09, Pub. Date: 8 Nov. 2013

Image Processing is the art of examining, identifying and judging the significances of the Images. Image enhancement refers to attenuation, or sharpening, of image features such as edgels, boundaries, or contrast to make the processed image more useful for analysis. Image enhancement procedures utilize the computers to provide good and improved images for study by the human interpreters. In this paper we proposed a novel method that uses the Genetic Algorithm with Multi-objective criteria to find more enhance version of images. The proposed method has been verified with benchmark images in Image Enhancement. The simple Genetic Algorithm may not explore much enough to find out more enhanced image. In the proposed method three objectives are taken in to consideration. They are intensity, entropy and number of edgels. Proposed algorithm achieved automatic image enhancement criteria by incorporating the objectives (intensity, entropy, edges). We review some of the existing Image Enhancement technique. We also compared the results of our algorithms with another Genetic Algorithm based techniques. We expect that further improvements can be achieved by incorporating linear relationship between some other techniques.

[...] Read more.
Image Denoising based on Enhanced Wavelet Global Thresholding Using Intelligent Signal Processing Algorithm

By Joseph Isabona Agbotiname Lucky Imoize Stephen Ojo

DOI: https://doi.org/10.5815/ijigsp.2023.05.01, Pub. Date: 8 Oct. 2023

Denoising is a vital aspect of image preprocessing, often explored to eliminate noise in an image to restore its proper characteristic formation and clarity. Unfortunately, noise often degrades the quality of valuable images, making them meaningless for practical applications. Several methods have been deployed to address this problem, but the quality of the recovered images still requires enhancement for efficient applications in practice. In this paper, a wavelet-based universal thresholding technique that possesses the capacity to optimally denoise highly degraded noisy images with both uniform and non-uniform variations in illumination and contrast is proposed. The proposed method, herein referred to as the modified wavelet-based universal thresholding (MWUT), compared to three state-of-the-art denoising techniques, was employed to denoise five noisy images. In order to appraise the qualities of the images obtained, seven performance indicators comprising the Root Mean Square Error (RMSE), Mean Absolute Error (MAE), Structural Content (SC), Peak Signal to Noise Ratio (PSNR), Structural Similarity Index Method (SSIM), Signal-to-Reconstruction-Error Ratio (SRER), Blind Spatial Quality Evaluator (NIQE), and Blind/Referenceless Image Spatial Quality Evaluator (BRISQUE) were employed. The first five indicators – RMSE, MAE, SC, PSNR, SSIM, and SRER- are reference indicators, while the remaining two – NIQE and BRISQUE- are referenceless. For the superior performance of the proposed wavelet threshold algorithm, the SC, PSNR, SSIM, and SRER must be higher, while lower values of NIQE, BRISQUE, RMSE, and MAE are preferred. A higher and better value of PSNR, SSIM, and SRER in the final results shows the superior performance of our proposed MWUT denoising technique over the preliminaries. Lower NIQE, BRISQUE, RMSE, and MAE values also indicate higher and better image quality results using the proposed modified wavelet-based universal thresholding technique over the existing schemes. The modified wavelet-based universal thresholding technique would find practical applications in digital image processing and enhancement.

[...] Read more.
A Review of Self-supervised Learning Methods in the Field of Medical Image Analysis

By Jiashu Xu

DOI: https://doi.org/10.5815/ijigsp.2021.04.03, Pub. Date: 8 Aug. 2021

In the field of medical image analysis, supervised deep learning strategies have achieved significant development, while these methods rely on large labeled datasets. Self-Supervised learning (SSL) provides a new strategy to pre-train a neural network with unlabeled data. This is a new unsupervised learning paradigm that has achieved significant breakthroughs in recent years. So, more and more researchers are trying to utilize SSL methods for medical image analysis, to meet the challenge of assembling large medical datasets. To our knowledge, so far there still a shortage of reviews of self-supervised learning methods in the field of medical image analysis, our work of this article aims to fill this gap and comprehensively review the application of self-supervised learning in the medical field. This article provides the latest and most detailed overview of self-supervised learning in the medical field and promotes the development of unsupervised learning in the field of medical imaging. These methods are divided into three categories: context-based, generation-based, and contrast-based, and then show the pros and cons of each category and evaluates their performance in downstream tasks. Finally, we conclude with the limitations of the current methods and discussed the future direction.

[...] Read more.
Breast Cancer Classification from Ultrasound Images using VGG16 Model based Transfer Learning

By A. B. M. Aowlad Hossain Jannatul Kamrun Nisha Fatematuj Johora

DOI: https://doi.org/10.5815/ijigsp.2023.01.02, Pub. Date: 8 Feb. 2023

Ultrasound based breast screening is gaining attention recently especially for dense breast. The technological advancement, cancer awareness, and cost-safety-availability benefits lead rapid rise of breast ultrasound market. The irregular shape, intensity variation, and additional blood vessels of malignant cancer are distinguishable in ultrasound images from the benign phase. However, classification of breast cancer using ultrasound images is a difficult process owing to speckle noise and complex textures of breast. In this paper, a breast cancer classification method is presented using VGG16 model based transfer learning approach. We have used median filter to despeckle the images. The layers for convolution process of the pretrained VGG16 model along with the maxpooling layers have been used as feature extractor and a proposed fully connected two layers deep neural network has been designed as classifier. Adam optimizer is used with learning rate of 0.001 and binary cross-entropy is chosen as the loss function for model optimization. Dropout of hidden layers is used to avoid overfitting. Breast Ultrasound images from two databases (total 897 images) have been combined to train, validate and test the performance and generalization strength of the classifier. Experimental results showed the training accuracy as 98.2% and testing accuracy as 91% for blind testing data with a reduced of computational complexity. Gradient class activation mapping (Grad-CAM) technique has been used to visualize and check the targeted regions localization effort at the final convolutional layer and found as noteworthy. The outcomes of this work might be useful for the clinical applications of breast cancer diagnosis.

[...] Read more.
Mobile-Based Skin Disease Diagnosis System Using Convolutional Neural Networks (CNN)

By M.W.P Maduranga Dilshan Nandasena

DOI: https://doi.org/10.5815/ijigsp.2022.03.05, Pub. Date: 8 Jun. 2022

This paper presents a design and development of an Artificial Intelligence (AI) based mobile application to detect the type of skin disease. Skin diseases are a serious hazard to everyone throughout the world. However, it is difficult to make accurate skin diseases diagnosis. In this work, Deep learning algorithms Convolution Neural Networks (CNN) is proposed to classify skin diseases on the HAM10000 dataset. An extensive review of research articles on object identification methods and a comparison of their relative qualities were given to find a method that would work well for detecting skin diseases. The CNN-based technique was recognized as the best method for identifying skin diseases. A mobile application, on the other hand, is built for quick and accurate action. By looking at an image of the afflicted area at the beginning of a skin illness, it assists patients and dermatologists in determining the kind of disease present. Its resilience in detecting the impacted region considerably faster with nearly 2x fewer computations than the standard MobileNet model results in low computing efforts. This study revealed that MobileNet with transfer learning yielding an accuracy of about 85% is the most suitable model for automatic skin disease identification. According to these findings, the suggested approach can assist general practitioners in quickly and accurately diagnosing skin diseases using the smart phone.

[...] Read more.
Text Region Extraction: A Morphological Based Image Analysis Using Genetic Algorithm

By Dhirendra Pal Singh Ashish Khare

DOI: https://doi.org/10.5815/ijigsp.2015.02.06, Pub. Date: 8 Jan. 2015

Image analysis belongs to the area of computer vision and pattern recognition. These areas are also a part of digital image processing, where researchers have a great attention in the area of content retrieval information from various types of images having complex background, low contrast background or multi-spectral background etc. These contents may be found in any form like texture data, shape, and objects. Text Region Extraction as a content from an mage is a class of problems in Digital Image Processing Applications that aims to provides necessary information which are widely used in many fields medical imaging, pattern recognition, Robotics, Artificial intelligent Transport systems etc. To extract the text data information has becomes a challenging task. Since, Text extraction are very useful for identifying and analysis the whole information about image, Therefore, In this paper, we propose a unified framework by combining morphological operations and Genetic Algorithms for extracting and analyzing the text data region which may be embedded in an image by means of variety of texts: font, size, skew angle, distortion by slant and tilt, shape of the object which texts are on, etc. We have established our proposed methods on gray level image sets and make qualitative and quantitative comparisons with other existing methods and concluded that proposed method is better than others.

[...] Read more.
Retinal Image Segmentation for Diabetic Retinopathy Detection using U-Net Architecture

By Swapnil V. Deshmukh Apash Roy Pratik Agrawal

DOI: https://doi.org/10.5815/ijigsp.2023.01.07, Pub. Date: 8 Feb. 2023

Diabetic retinopathy is one of the most serious eye diseases and can lead to permanent blindness if not diagnosed early. The main cause of this is diabetes. Not every diabetic will develop diabetic retinopathy, but the risk of developing diabetes is undeniable. This requires the early diagnosis of Diabetic retinopathy. Segmentation is one of the approaches which is useful for detecting the blood vessels in the retinal image. This paper proposed the three models based on a deep learning approach for recognizing blood vessels from retinal images using region-based segmentation techniques. The proposed model consists of four steps preprocessing, Augmentation, Model training, and Performance measure. The augmented retinal images are fed to the three models for training and finally, get the segmented image. The proposed three models are applied on publically available data set of DRIVE, STARE, and HRF. It is observed that more thin blood vessels are segmented on the retinal image in the HRF dataset using model-3. The performance of proposed three models is compare with other state-of-art-methods of blood vessels segmentation of DRIVE, STARE, and HRF datasets.

[...] Read more.
A Review on Image Reconstruction through MRI k-Space Data

By Tanuj Kumar Jhamb Vinith Rejathalal V.K. Govindan

DOI: https://doi.org/10.5815/ijigsp.2015.07.06, Pub. Date: 8 Jun. 2015

Image reconstruction is the process of generating an image of an object from the signals captured by the scanning machine. Medical imaging is an interdisciplinary field combining physics, biology, mathematics and computational sciences. This paper provides a complete overview of image reconstruction process in MRI (Magnetic Resonance Imaging). It reviews the computational aspect of medical image reconstruction. MRI is one of the commonly used medical imaging techniques. The data collected by MRI scanner for image reconstruction is called the k-space data. For reconstructing an image from k-space data, there are various algorithms such as Homodyne algorithm, Zero Filling method, Dictionary Learning, and Projections onto Convex Set method. All the characteristics of k-space data and MRI data collection technique are reviewed in detail. The algorithms used for image reconstruction discussed in detail along with their pros and cons. Various modern magnetic resonance imaging techniques like functional MRI, diffusion MRI have also been introduced. The concepts of classical techniques like Expectation Maximization, Sensitive Encoding, Level Set Method, and the recent techniques such as Alternating Minimization, Signal Modeling, and Sphere Shaped Support Vector Machine are also reviewed. It is observed that most of these techniques enhance the gradient encoding and reduce the scanning time. Classical algorithms provide undesirable blurring effect when the degree of phase variation is high in partial k-space. Modern reconstructions algorithms such as Dictionary learning works well even with high phase variation as these are iterative procedures.

[...] Read more.
Real-Time Video based Human Suspicious Activity Recognition with Transfer Learning for Deep Learning

By Indhumathi .J Balasubramanian .M Balasaigayathri .B

DOI: https://doi.org/10.5815/ijigsp.2023.01.05, Pub. Date: 8 Feb. 2023

Nowadays, the primary concern of any society is providing safety to an individual. It is very hard to recognize the human behaviour and identify whether it is suspicious or normal. Deep learning approaches paved the way for the development of various machine learning and artificial intelligence. The proposed system detects real-time human activity using a convolutional neural network. The objective of the study is to develop a real-time application for Activity recognition using with and without transfer learning methods. The proposed system considers criminal, suspicious and normal categories of activities. Differentiate suspicious behaviour videos are collected from different peoples(men/women). This proposed system is used to detect suspicious activities of a person. The novel 2D-CNN, pre-trained VGG-16 and ResNet50 is trained on video frames of human activities such as normal and suspicious behaviour. Similarly, the transfer learning in VGG16 and ResNet50 is trained using human suspicious activity datasets. The results show that the novel 2D-CNN, VGG16, and ResNet50 without transfer learning achieve accuracy of 98.96%, 97.84%, and 99.03%, respectively. In Kaggle/real-time video, the proposed system employing 2D-CNN outperforms the pre-trained model VGG16. The trained model is used to classify the activity in the real-time captured video. The performance obtained on ResNet50 with transfer learning accuracy of 99.18% is higher than VGG16 transfer learning accuracy of 98.36%. 

[...] Read more.