International Journal of Image, Graphics and Signal Processing (IJIGSP)

ISSN: 2074-9074 (Print)

ISSN: 2074-9082 (Online)

DOI: https://doi.org/10.5815/ijigsp

Website: https://www.mecs-press.org/ijigsp

Published By: MECS Press

Frequency: 6 issues per year

Number(s) Available: 137

(IJIGSP) in Google Scholar Citations / h5-index

IJIGSP is committed to bridge the theory and practice of images, graphics, and signal processing. From innovative ideas to specific algorithms and full system implementations, IJIGSP publishes original, peer-reviewed, and high quality articles in the areas of images, graphics, and signal processing. IJIGSP is a well-indexed scholarly journal and is indispensable reading and references for people working at the cutting edge of images, graphics, and signal processing applications.

 

IJIGSP has been abstracted or indexed by several world class databases: Scopus, Google Scholar, Microsoft Academic Search, CrossRef, Baidu Wenku, IndexCopernicus, IET Inspec, EBSCO, JournalSeek, ULRICH's Periodicals Directory, WorldCat, Scirus, Academic Journals Database, Stanford University Libraries, Cornell University Library, UniSA Library, CNKI Scholar, ProQuest, J-Gate, ZDB, BASE, OhioLINK, iThenticate, Open Access Articles, Open Science Directory, National Science Library of Chinese Academy of Sciences, The HKU Scholars Hub, etc..

Latest Issue
Most Viewed
Most Downloaded

IJIGSP Vol. 17, No. 3, Jun. 2025

REGULAR PAPERS

LiteDVDNet: Optimizing FastDVDNet for High-Speed Video Denoising

By Andrii Ilchenko Sergii Stirenko

DOI: https://doi.org/10.5815/ijigsp.2025.03.01, Pub. Date: 8 Jun. 2025

The growing demand for high-quality video processing in real-time applications demands efficient denoising techniques that can operate swiftly while maintaining visual fidelity. Conventional approaches often struggle to balance these competing requirements, especially when dealing with high-resolution video streams or resource-constrained environments. This study aims to develop methods for accelerating video data denoising using deep convolutional neural networks while maintaining acceptable output quality. We selected the popular FastDVDNet denoising network, which operates on a sliding window principle, as our baseline for comparison and a starting point for our research. This paper proposes several modifications of FastDVDNet that significantly enhance computational efficiency. We introduce four key optimizations: caching intermediate denoising results, reducing intermediate channels in input block, simplifying convolutional blocks, and halving the number of channels. We evaluated these modifications on the Set8 dataset and compared the results with the original model at various noise levels. Finally, we introduce LiteDVDNet, a fine-tuned version of FastDVDNet model that achieves the optimal balance between processing speed, and denoising performance. We developed two model variants: LiteDVDNet-32, which is 3× faster than the original model with only 0.18 dB average PSNR reduction, and the more lightweight LiteDVDNet-16, which delivers a 5× speed improvement at the cost of 0.61 dB average PSNR reduction. 

[...] Read more.
Pelican Optimization based Histogram Equalization for Contrast Enhancement and Brightness Preservation

By Niveditta Thakur Nafis Uddin Khan Sunil Datt Sharma Abul Bashar

DOI: https://doi.org/10.5815/ijigsp.2025.03.02, Pub. Date: 8 Jun. 2025

Image contrast is very important visual characteristics that will considerably improve the appearance of the image. In this paper image contrast is to be enhanced optimally to accurately portray all the data in the image using nature inspired meta-heuristic algorithms. Algorithms have been devised and proposed to enhance the contrast of low contrast images in this work. Poor image contrast caused by a low-quality capturing device, biased user experience, and an unsuitable environment setting during image capture is the main problem encountered during the image enhancement process. Histogram Equalization (HE), a frequently used technique for contrast enhancement, typically produces images with unwanted artifacts, an unnatural appearance, and washed-out appearances. The degree of enhancement is beyond the control of the global HE. The quality of an image is crucial for human comprehension, making image contrast enhancement (ICE) a crucial pre-processing stage in image processing and analysis. In the current study, the Pelican Optimization Algorithm, a contemporary meta-heuristic (MH) algorithm influenced by nature, is used as the foundation for the grayscale image contrast enhancement (GICE) approach (POA). The comparison of proposed method with existing contrast enhancement techniques has been done on the basis of standard image quality metrics. The proposed algorithm performance on standard test image and Kodak dataset demonstrates that total image contrast and information provided in the image are both greatly improved by the suggested POA-based image enhancement technique.

[...] Read more.
Discriminative Sentimental NLP Model with Re-Enforcement Deep Learning Model for the Slogan Generation

By Shailesh S. Sangle Raghavendra R. Sedamkar

DOI: https://doi.org/10.5815/ijigsp.2025.03.03, Pub. Date: 8 Jun. 2025

Effective communication is paramount in election campaigns, and slogans are crucial in conveying messages and eliciting voter sentiment. This paper introduces CRFVReC (Conditional Random Field with Variable-Length Receptive Fields), a statistical modeling technique to categorize and generate election campaign slogans by analyzing their sentiment. It is a novel approach for sentiment-based slogan generation and analysis in election campaigns. The reason was to choose datasets that precisely capture voter sentiment from a variety of sources such as social media (SM) posts, public comments, and news articles. The datasets were meticulously chosen to encompass a broad spectrum of sentiments and issues that are pertinent to voters. The CRFVReC model was set up to maximize the performance of sentiment classification and slogan generation. Modifying parameters such as the length of the receptive field to match the length of slogans enhanced the model's adaptability and increased its accuracy. Utilizing Conditional Random Fields (CRFs), CRFVReC classifies election campaign slogans into optimistic and pessimistic sentiments and generates slogans that resonate emotionally with voters. The key objectives of this study are twofold: first, to accurately classify election campaign slogans into two primary sentiment categories, optimistic and pessimistic, and second, to generate emotionally resonant slogans that can effectively connect with voters. Extensive experiments and sentiment analyses are conducted using a diverse dataset of election campaign slogans to assess the efficiency of CRFVReC. The results highlight the model's remarkable precision in sentiment classification, demonstrating its capability to discriminate between optimistic and pessimistic sentiment in slogans. The model exhibits elevated accuracy, precision, recall, and AUC scores in sentiment classification, utilizing a diverse dataset. Furthermore, CRFVReC showcases its creative potential in generating slogans with emotionally compelling content. Its capability holds significant promise for campaign strategists and political communicators seeking to craft slogans that resonate with voters deeply emotionally. Additionally, the model's adaptability to slogans of varying lengths makes it a versatile tool for election campaign management and strategy development. The CRFVReC emerges as a robust and adaptable solution for sentiment-based slogan generation and analysis in the complex landscape of election campaigns. Its contributions lie not only in inaccurate sentiment classification but also in its potential to shape the narrative of political campaigns through the creation of emotionally impactful slogans. This research contributes to the fields of political communication and campaign management, providing valuable tools and insights for practitioners and researchers.

[...] Read more.
Hybrid System for Image Storage and Retrieval in Big Data Environments

By Glib Tereshchenko Iryna Kyrychenko Victoria Vysotska Zhengbing Hu Yuriy Ushenko Mariia Talakh

DOI: https://doi.org/10.5815/ijigsp.2025.03.04, Pub. Date: 8 Jun. 2025

This paper presents a hybrid image storage model for big data environments. The model combines relational and non-relational (NoSQL) databases, file systems (IPFS), and blockchain technologies to ensure an optimal balance between performance, scalability, and security in image storage. The existing approaches to organising image data storage and image compression methods in decentralised systems are analysed. Optimised image indexing is proposed to accelerate data search and access. A prototype system based on the proposed model was developed, and an experimental study was conducted on various image datasets (medical, satellite, and digital art). The experimental results demonstrate that the hybrid model outperforms traditional approaches: image access time is reduced by ~30% compared to standalone storage systems, providing high scalability (with increased nodes, processing time decreases nonlinearly). The efficiency of image compression in reducing storage costs in blockchain-oriented systems is also confirmed: the WebP format allows file size to be reduced by 40–60% while maintaining acceptable quality (PSNR > 30 dB). The proposed solution is relevant for medical diagnostics, video surveillance systems, geographic information systems, and other fields requiring reliable storage and fast processing of large-scale image datasets.

[...] Read more.
Colorization-based U-Net Architecture for Precise Liver Tumor Segmentation in Clinical CT Images

By Ika Novita Dewi Abu Salam Danang Wahyu Utomo

DOI: https://doi.org/10.5815/ijigsp.2025.03.05, Pub. Date: 8 Jun. 2025

Accurate liver and tumor segmentation from medical imaging plays an important role in effective diagnosis and appropriate treatment planning, especially in the case of liver cancer. This research proposed a novel U-Net architecture enhanced with image colorization techniques for precise liver tumor segmentation in clinical CT images. The proposed image colorization-based U-Net, which integrates both grayscale-based and RGB-based architectures, was tested on the LiTS dataset and real clinical data. This evaluation aimed to measure its effectiveness in liver and tumor segmentation across different imaging conditions. The grayscale-based U-Net achieved high segmentation accuracy, achieving a DICE coefficient of 99.95% for liver segmentation and 90.44% for tumor segmentation. This strong performance suggests its ability to precisely delineate anatomical structures. The model also achieved an RMSE of 0.019, a PSNR of 82.14, and a pixel accuracy of 0.316, reflecting its capability to reduce reconstruction while preserving overall image quality. These findings further support the model’s reliability in challenging imaging scenarios, suggesting its potential as an effective tool for liver tumor segmentation. To further validate its real-world applicability, the model was tested on clinical data, where it effectively segmented liver and tumor regions across diverse imaging conditions. By addressing challenges such as low contrast and variability in tumor characteristics, the use of grayscale-based colorization techniques enhances feature representation, leading to improved segmentation outcomes. The findings demonstrate the potential of the proposed approach to enhance liver and tumor localization, providing a robust framework for clinical applications.

[...] Read more.
Next-Gen Market Predictor: Transformed Moving Average Fast-RNN Hybrid with Advanced CNNS

By Swarnalata Rath Nilima R. Das Binod Kumar Pattanayak

DOI: https://doi.org/10.5815/ijigsp.2025.03.06, Pub. Date: 8 Jun. 2025

Stock price prediction anticipates future stock prices using historical data and computational models to assist and guide investing decisions. In financial forecasting, accuracy and efficacy in stock price prediction are essential for making better choices. This research describes a hybrid deep learning strategy for improving the extraction and interpretation of the crucial details from stock price time series data. Traditional approaches confront challenges such as computational complexity and nonlinear stock prices. The suggested method pre-processes stock data with Moving Average Z-Transformation, which emphasises long-term trends and reduces fluctuations in the short term. It combines a Transformed Moving Average Fast-RNN Hybrid with Advanced CNNs to create an efficient computational framework. The Enhanced Deep-CNN layer comprises convolutional layers, batch normalisation, leaky ReLU activations, dropout, max pooling and a dense layer. The performance of the model is quantified using metrics including Mean Absolute Error (MAE), Mean Square Error (MSE), Root Mean Square Error (RMSE), and R-squared (R2). It shows superior prediction accuracy with MAEs of 0.28, 0.15, 0.34, 0.17, and 0.13 for Kotak, ICICI, Axis, and SBI, respectively, outperforming previous models. These measurements provide detailed information about the model's predictive skills, proving its ability to improve stock price forecast accuracy significantly.

[...] Read more.
Binary Segmentation Dataset Distances for Transfer Learning

By Victor Sineglazov Kirill Riazanovskiy Olexander Klanovets

DOI: https://doi.org/10.5815/ijigsp.2025.03.07, Pub. Date: 8 Jun. 2025

This work is devoted to developing a novel transfer learning approach for solving binary semantic segmentation problems that often arise on short samples in the medical (segmentation of nodules in lungs, tumors, polyps, etc.) and other domains. The goal is to optimally select the most suitable dataset from a different subject area with similar feature space and distribution to the target data. Examples show that a severe disadvantage of transfer learning is the difficulty of selecting an initial training sample for pre-training a neural network. In this paper, we propose metrics for calculating the distance between binary segmentation datasets, allowing us to select the optimal initial training set for transfer learning. These metrics are based on the geometric distances estimation of the dataset using optimal transport, Wasserstein distance for Gaussian mixture models, clustering, and their hybrids. Experiments on datasets of medical segmentation Decathlon, LIDC, and a private dataset of tuberculomas in the lungs are presented, proving a statistically strict correlation of these metrics with a relative increase in segmentation accuracy during transfer learning.

[...] Read more.
Multi-Stage Medical Image Encryption System Combining RSA and Steganography

By Jahin Ahmed Faizul Hakim Md. Asadur Rahman

DOI: https://doi.org/10.5815/ijigsp.2025.03.08, Pub. Date: 8 Jun. 2025

Data security has become a major concern in the present era of the communication revolution, especially maintaining the confidentiality of medical images a prime concern in e-health establishments. As conventional techniques hold numerous drawbacks, this study aims to develop an image encryption algorithm by combining two renowned methods: the RSA algorithm and steganography. The proposed algorithm is modified with the help of the conventional RSA algorithm and steganography to provide an attainable solution to this alarming issue. RSA technique encrypts multiple medical images with distinct keys; further, these keys are embedded in two images to be transmitted secretly with the help of LSB steganography. The proposed system generates images of an unidentifiable pattern after encryption and decrypts those images without any loss. The claimed performances and robustness of the system are justified using different numerical and graphical measures such as PSNR, MSE, SSIM, NPCR, UACI, and histograms. This encryption method can be used for medical image transmission where image security is a vital concern.

[...] Read more.
Edibility Detection of Mushroom Using Ensemble Methods

By Nusrat Jahan Pinky S.M. Mohidul Islam Rafia Sharmin Alice

DOI: https://doi.org/10.5815/ijigsp.2019.04.05, Pub. Date: 8 Apr. 2019

Mushrooms are the most familiar delicious food which is cholesterol free as well as rich in vitamins and minerals. Though nearly 45,000 species of mushrooms have been known throughout the world, most of them are poisonous and few are lethally poisonous. Identifying edible or poisonous mushroom through the naked eye is quite difficult. Even there is no easy rule for edibility identification using machine learning methods that work for all types of data. Our aim is to find a robust method for identifying mushrooms edibility with better performance than existing works. In this paper, three ensemble methods are used to detect the edibility of mushrooms: Bagging, Boosting, and random forest. By using the most significant features, five feature sets are made for making five base models of each ensemble method. The accuracy is measured for ensemble methods using five both fixed feature set-based models and randomly selected feature set based models, for two types of test sets. The result shows that better performance is obtained for methods made of fixed feature sets-based models than randomly selected feature set-based models. The highest accuracy is obtained for the proposed model-based random forest for both test sets.

[...] Read more.
Evolutionary Image Enhancement Using Multi-Objective Genetic Algorithm

By Dhirendra Pal Singh Ashish Khare

DOI: https://doi.org/10.5815/ijigsp.2014.01.09, Pub. Date: 8 Nov. 2013

Image Processing is the art of examining, identifying and judging the significances of the Images. Image enhancement refers to attenuation, or sharpening, of image features such as edgels, boundaries, or contrast to make the processed image more useful for analysis. Image enhancement procedures utilize the computers to provide good and improved images for study by the human interpreters. In this paper we proposed a novel method that uses the Genetic Algorithm with Multi-objective criteria to find more enhance version of images. The proposed method has been verified with benchmark images in Image Enhancement. The simple Genetic Algorithm may not explore much enough to find out more enhanced image. In the proposed method three objectives are taken in to consideration. They are intensity, entropy and number of edgels. Proposed algorithm achieved automatic image enhancement criteria by incorporating the objectives (intensity, entropy, edges). We review some of the existing Image Enhancement technique. We also compared the results of our algorithms with another Genetic Algorithm based techniques. We expect that further improvements can be achieved by incorporating linear relationship between some other techniques.

[...] Read more.
Image Denoising based on Enhanced Wavelet Global Thresholding Using Intelligent Signal Processing Algorithm

By Joseph Isabona Agbotiname Lucky Imoize Stephen Ojo

DOI: https://doi.org/10.5815/ijigsp.2023.05.01, Pub. Date: 8 Oct. 2023

Denoising is a vital aspect of image preprocessing, often explored to eliminate noise in an image to restore its proper characteristic formation and clarity. Unfortunately, noise often degrades the quality of valuable images, making them meaningless for practical applications. Several methods have been deployed to address this problem, but the quality of the recovered images still requires enhancement for efficient applications in practice. In this paper, a wavelet-based universal thresholding technique that possesses the capacity to optimally denoise highly degraded noisy images with both uniform and non-uniform variations in illumination and contrast is proposed. The proposed method, herein referred to as the modified wavelet-based universal thresholding (MWUT), compared to three state-of-the-art denoising techniques, was employed to denoise five noisy images. In order to appraise the qualities of the images obtained, seven performance indicators comprising the Root Mean Square Error (RMSE), Mean Absolute Error (MAE), Structural Content (SC), Peak Signal to Noise Ratio (PSNR), Structural Similarity Index Method (SSIM), Signal-to-Reconstruction-Error Ratio (SRER), Blind Spatial Quality Evaluator (NIQE), and Blind/Referenceless Image Spatial Quality Evaluator (BRISQUE) were employed. The first five indicators – RMSE, MAE, SC, PSNR, SSIM, and SRER- are reference indicators, while the remaining two – NIQE and BRISQUE- are referenceless. For the superior performance of the proposed wavelet threshold algorithm, the SC, PSNR, SSIM, and SRER must be higher, while lower values of NIQE, BRISQUE, RMSE, and MAE are preferred. A higher and better value of PSNR, SSIM, and SRER in the final results shows the superior performance of our proposed MWUT denoising technique over the preliminaries. Lower NIQE, BRISQUE, RMSE, and MAE values also indicate higher and better image quality results using the proposed modified wavelet-based universal thresholding technique over the existing schemes. The modified wavelet-based universal thresholding technique would find practical applications in digital image processing and enhancement.

[...] Read more.
Text Region Extraction: A Morphological Based Image Analysis Using Genetic Algorithm

By Dhirendra Pal Singh Ashish Khare

DOI: https://doi.org/10.5815/ijigsp.2015.02.06, Pub. Date: 8 Jan. 2015

Image analysis belongs to the area of computer vision and pattern recognition. These areas are also a part of digital image processing, where researchers have a great attention in the area of content retrieval information from various types of images having complex background, low contrast background or multi-spectral background etc. These contents may be found in any form like texture data, shape, and objects. Text Region Extraction as a content from an mage is a class of problems in Digital Image Processing Applications that aims to provides necessary information which are widely used in many fields medical imaging, pattern recognition, Robotics, Artificial intelligent Transport systems etc. To extract the text data information has becomes a challenging task. Since, Text extraction are very useful for identifying and analysis the whole information about image, Therefore, In this paper, we propose a unified framework by combining morphological operations and Genetic Algorithms for extracting and analyzing the text data region which may be embedded in an image by means of variety of texts: font, size, skew angle, distortion by slant and tilt, shape of the object which texts are on, etc. We have established our proposed methods on gray level image sets and make qualitative and quantitative comparisons with other existing methods and concluded that proposed method is better than others.

[...] Read more.
Mobile-Based Skin Disease Diagnosis System Using Convolutional Neural Networks (CNN)

By M.W.P Maduranga Dilshan Nandasena

DOI: https://doi.org/10.5815/ijigsp.2022.03.05, Pub. Date: 8 Jun. 2022

This paper presents a design and development of an Artificial Intelligence (AI) based mobile application to detect the type of skin disease. Skin diseases are a serious hazard to everyone throughout the world. However, it is difficult to make accurate skin diseases diagnosis. In this work, Deep learning algorithms Convolution Neural Networks (CNN) is proposed to classify skin diseases on the HAM10000 dataset. An extensive review of research articles on object identification methods and a comparison of their relative qualities were given to find a method that would work well for detecting skin diseases. The CNN-based technique was recognized as the best method for identifying skin diseases. A mobile application, on the other hand, is built for quick and accurate action. By looking at an image of the afflicted area at the beginning of a skin illness, it assists patients and dermatologists in determining the kind of disease present. Its resilience in detecting the impacted region considerably faster with nearly 2x fewer computations than the standard MobileNet model results in low computing efforts. This study revealed that MobileNet with transfer learning yielding an accuracy of about 85% is the most suitable model for automatic skin disease identification. According to these findings, the suggested approach can assist general practitioners in quickly and accurately diagnosing skin diseases using the smart phone.

[...] Read more.
A Review of Self-supervised Learning Methods in the Field of Medical Image Analysis

By Jiashu Xu

DOI: https://doi.org/10.5815/ijigsp.2021.04.03, Pub. Date: 8 Aug. 2021

In the field of medical image analysis, supervised deep learning strategies have achieved significant development, while these methods rely on large labeled datasets. Self-Supervised learning (SSL) provides a new strategy to pre-train a neural network with unlabeled data. This is a new unsupervised learning paradigm that has achieved significant breakthroughs in recent years. So, more and more researchers are trying to utilize SSL methods for medical image analysis, to meet the challenge of assembling large medical datasets. To our knowledge, so far there still a shortage of reviews of self-supervised learning methods in the field of medical image analysis, our work of this article aims to fill this gap and comprehensively review the application of self-supervised learning in the medical field. This article provides the latest and most detailed overview of self-supervised learning in the medical field and promotes the development of unsupervised learning in the field of medical imaging. These methods are divided into three categories: context-based, generation-based, and contrast-based, and then show the pros and cons of each category and evaluates their performance in downstream tasks. Finally, we conclude with the limitations of the current methods and discussed the future direction.

[...] Read more.
Radio Receiver with Internal Compression of Input Signals Using a Dispersive Delay Line with Bandpass Filters

By Roman Pantyeyev Felix Yanovsky Andriy Mykolushko Volodymyr Shutko

DOI: https://doi.org/10.5815/ijigsp.2023.06.01, Pub. Date: 8 Dec. 2023

This article proposes a receiving device in which arbitrary input signals are subject to pre-detector processing for the subsequent implementation of the idea of compressing broadband modulated pulses with a matched filter to increase the signal-to-noise ratio and improve resolution. For this purpose, a model of a dispersive delay line is developed based on series-connected high-frequency time delay lines with taps in the form of bandpass filters, and analysis of this model is performed as a part of the radio receiving device with chirp signal compression. The article presents the mathematical description of the processes of formation and compression of chirp signals based on their matched filtering using the developed model and proposes the block diagram of a radio receiving device using the principle of compression of received signals. The proposed model can be implemented in devices for receiving unknown signals, in particular in passive radar. It also can be used for studying signal compression processes based on linear frequency modulation in traditional radar systems.

[...] Read more.
An Efficient Brain Tumor Detection Algorithm Using Watershed & Thresholding Based Segmentation

By Anam Mustaqeem Engr Ali Javed Tehseen Fatima

DOI: https://doi.org/10.5815/ijigsp.2012.10.05, Pub. Date: 28 Sep. 2012

During past few years, brain tumor segmentation in magnetic resonance imaging (MRI) has become an emergent research area in the ?eld of medical imaging system. Brain tumor detection helps in finding the exact size and location of tumor. An efficient algorithm is proposed in this paper for tumor detection based on segmentation and morphological operators. Firstly quality of scanned image is enhanced and then morphological operators are applied to detect the tumor in the scanned image.

[...] Read more.
Restoration of Degraded Gray Images Using Genetic Algorithm

By Dhirendra Pal Singh Ashish Khare

DOI: https://doi.org/10.5815/ijigsp.2016.03.04, Pub. Date: 8 Mar. 2016

This Image deblurring aims to eliminate or decrease the degradations that has been occurred while the image has been obtained. In this paper, we proposed a unified framework for restoration process by enhancement and more quantified deblurred images with the help of Genetic Algorithm. The developed method uses an iterative procedure using evolutionary criteria and produce better images with most restored frequency-content. We have compared the proposed methods with Lucy-Richardson Restoration method, method proposed by W. Dong [34] and Inverse Filter Restoration Method; and demonstrated that the proposed method is more accurate by achieving high quality visualized restored images in terms of various statistical quality measures.

[...] Read more.
Fast Encryption Scheme for Secure Transmission of e-Healthcare Images

By Devisha Tiwari Bhaskar Mondal Anil Singh

DOI: https://doi.org/10.5815/ijigsp.2023.05.07, Pub. Date: 8 Oct. 2023

E-healthcare systems (EHSD), medical communications, digital imaging (DICOM) things have gained popularity over the past decade as they have become the top contenders for interoperability and adoption as a global standard for transmitting and communicating medical data. Security is a growing issue as EHSD and DICOM have grown more usable on any-to-any devices. The goal of this research is to create a privacy-preserving encryption technique for EHSD rapid communication with minimal storage. A new 2D logistic-sine chaotic map (2DLSCM) is used to design the proposed encryption method, which has been developed specifically for peer-to-peer communications via unique keys. Through the 3D Lorenz map which feeds the initial values to it, the 2DLSCM is able to provide a unique keyspace of 2544 bits (2^544bits) in each go of peer-to-peer paired transmission. Permutation-diffusion design is used in the encryption process, and 2DLSCM with 3DLorenz system are used to generate unique initial values for the keys. Without interfering with real-time medical transmission, the approach can quickly encrypt any EHSD image and DICOM objects. To assess the method, five distinct EHSD images of different kinds, sizes, and quality are selected. The findings indicate strong protection, speed, and scalability when compared to existing similar methods in literature.

[...] Read more.
Edibility Detection of Mushroom Using Ensemble Methods

By Nusrat Jahan Pinky S.M. Mohidul Islam Rafia Sharmin Alice

DOI: https://doi.org/10.5815/ijigsp.2019.04.05, Pub. Date: 8 Apr. 2019

Mushrooms are the most familiar delicious food which is cholesterol free as well as rich in vitamins and minerals. Though nearly 45,000 species of mushrooms have been known throughout the world, most of them are poisonous and few are lethally poisonous. Identifying edible or poisonous mushroom through the naked eye is quite difficult. Even there is no easy rule for edibility identification using machine learning methods that work for all types of data. Our aim is to find a robust method for identifying mushrooms edibility with better performance than existing works. In this paper, three ensemble methods are used to detect the edibility of mushrooms: Bagging, Boosting, and random forest. By using the most significant features, five feature sets are made for making five base models of each ensemble method. The accuracy is measured for ensemble methods using five both fixed feature set-based models and randomly selected feature set based models, for two types of test sets. The result shows that better performance is obtained for methods made of fixed feature sets-based models than randomly selected feature set-based models. The highest accuracy is obtained for the proposed model-based random forest for both test sets.

[...] Read more.
Evolutionary Image Enhancement Using Multi-Objective Genetic Algorithm

By Dhirendra Pal Singh Ashish Khare

DOI: https://doi.org/10.5815/ijigsp.2014.01.09, Pub. Date: 8 Nov. 2013

Image Processing is the art of examining, identifying and judging the significances of the Images. Image enhancement refers to attenuation, or sharpening, of image features such as edgels, boundaries, or contrast to make the processed image more useful for analysis. Image enhancement procedures utilize the computers to provide good and improved images for study by the human interpreters. In this paper we proposed a novel method that uses the Genetic Algorithm with Multi-objective criteria to find more enhance version of images. The proposed method has been verified with benchmark images in Image Enhancement. The simple Genetic Algorithm may not explore much enough to find out more enhanced image. In the proposed method three objectives are taken in to consideration. They are intensity, entropy and number of edgels. Proposed algorithm achieved automatic image enhancement criteria by incorporating the objectives (intensity, entropy, edges). We review some of the existing Image Enhancement technique. We also compared the results of our algorithms with another Genetic Algorithm based techniques. We expect that further improvements can be achieved by incorporating linear relationship between some other techniques.

[...] Read more.
Image Denoising based on Enhanced Wavelet Global Thresholding Using Intelligent Signal Processing Algorithm

By Joseph Isabona Agbotiname Lucky Imoize Stephen Ojo

DOI: https://doi.org/10.5815/ijigsp.2023.05.01, Pub. Date: 8 Oct. 2023

Denoising is a vital aspect of image preprocessing, often explored to eliminate noise in an image to restore its proper characteristic formation and clarity. Unfortunately, noise often degrades the quality of valuable images, making them meaningless for practical applications. Several methods have been deployed to address this problem, but the quality of the recovered images still requires enhancement for efficient applications in practice. In this paper, a wavelet-based universal thresholding technique that possesses the capacity to optimally denoise highly degraded noisy images with both uniform and non-uniform variations in illumination and contrast is proposed. The proposed method, herein referred to as the modified wavelet-based universal thresholding (MWUT), compared to three state-of-the-art denoising techniques, was employed to denoise five noisy images. In order to appraise the qualities of the images obtained, seven performance indicators comprising the Root Mean Square Error (RMSE), Mean Absolute Error (MAE), Structural Content (SC), Peak Signal to Noise Ratio (PSNR), Structural Similarity Index Method (SSIM), Signal-to-Reconstruction-Error Ratio (SRER), Blind Spatial Quality Evaluator (NIQE), and Blind/Referenceless Image Spatial Quality Evaluator (BRISQUE) were employed. The first five indicators – RMSE, MAE, SC, PSNR, SSIM, and SRER- are reference indicators, while the remaining two – NIQE and BRISQUE- are referenceless. For the superior performance of the proposed wavelet threshold algorithm, the SC, PSNR, SSIM, and SRER must be higher, while lower values of NIQE, BRISQUE, RMSE, and MAE are preferred. A higher and better value of PSNR, SSIM, and SRER in the final results shows the superior performance of our proposed MWUT denoising technique over the preliminaries. Lower NIQE, BRISQUE, RMSE, and MAE values also indicate higher and better image quality results using the proposed modified wavelet-based universal thresholding technique over the existing schemes. The modified wavelet-based universal thresholding technique would find practical applications in digital image processing and enhancement.

[...] Read more.
A Review of Self-supervised Learning Methods in the Field of Medical Image Analysis

By Jiashu Xu

DOI: https://doi.org/10.5815/ijigsp.2021.04.03, Pub. Date: 8 Aug. 2021

In the field of medical image analysis, supervised deep learning strategies have achieved significant development, while these methods rely on large labeled datasets. Self-Supervised learning (SSL) provides a new strategy to pre-train a neural network with unlabeled data. This is a new unsupervised learning paradigm that has achieved significant breakthroughs in recent years. So, more and more researchers are trying to utilize SSL methods for medical image analysis, to meet the challenge of assembling large medical datasets. To our knowledge, so far there still a shortage of reviews of self-supervised learning methods in the field of medical image analysis, our work of this article aims to fill this gap and comprehensively review the application of self-supervised learning in the medical field. This article provides the latest and most detailed overview of self-supervised learning in the medical field and promotes the development of unsupervised learning in the field of medical imaging. These methods are divided into three categories: context-based, generation-based, and contrast-based, and then show the pros and cons of each category and evaluates their performance in downstream tasks. Finally, we conclude with the limitations of the current methods and discussed the future direction.

[...] Read more.
Breast Cancer Classification from Ultrasound Images using VGG16 Model based Transfer Learning

By A. B. M. Aowlad Hossain Jannatul Kamrun Nisha Fatematuj Johora

DOI: https://doi.org/10.5815/ijigsp.2023.01.02, Pub. Date: 8 Feb. 2023

Ultrasound based breast screening is gaining attention recently especially for dense breast. The technological advancement, cancer awareness, and cost-safety-availability benefits lead rapid rise of breast ultrasound market. The irregular shape, intensity variation, and additional blood vessels of malignant cancer are distinguishable in ultrasound images from the benign phase. However, classification of breast cancer using ultrasound images is a difficult process owing to speckle noise and complex textures of breast. In this paper, a breast cancer classification method is presented using VGG16 model based transfer learning approach. We have used median filter to despeckle the images. The layers for convolution process of the pretrained VGG16 model along with the maxpooling layers have been used as feature extractor and a proposed fully connected two layers deep neural network has been designed as classifier. Adam optimizer is used with learning rate of 0.001 and binary cross-entropy is chosen as the loss function for model optimization. Dropout of hidden layers is used to avoid overfitting. Breast Ultrasound images from two databases (total 897 images) have been combined to train, validate and test the performance and generalization strength of the classifier. Experimental results showed the training accuracy as 98.2% and testing accuracy as 91% for blind testing data with a reduced of computational complexity. Gradient class activation mapping (Grad-CAM) technique has been used to visualize and check the targeted regions localization effort at the final convolutional layer and found as noteworthy. The outcomes of this work might be useful for the clinical applications of breast cancer diagnosis.

[...] Read more.
Text Region Extraction: A Morphological Based Image Analysis Using Genetic Algorithm

By Dhirendra Pal Singh Ashish Khare

DOI: https://doi.org/10.5815/ijigsp.2015.02.06, Pub. Date: 8 Jan. 2015

Image analysis belongs to the area of computer vision and pattern recognition. These areas are also a part of digital image processing, where researchers have a great attention in the area of content retrieval information from various types of images having complex background, low contrast background or multi-spectral background etc. These contents may be found in any form like texture data, shape, and objects. Text Region Extraction as a content from an mage is a class of problems in Digital Image Processing Applications that aims to provides necessary information which are widely used in many fields medical imaging, pattern recognition, Robotics, Artificial intelligent Transport systems etc. To extract the text data information has becomes a challenging task. Since, Text extraction are very useful for identifying and analysis the whole information about image, Therefore, In this paper, we propose a unified framework by combining morphological operations and Genetic Algorithms for extracting and analyzing the text data region which may be embedded in an image by means of variety of texts: font, size, skew angle, distortion by slant and tilt, shape of the object which texts are on, etc. We have established our proposed methods on gray level image sets and make qualitative and quantitative comparisons with other existing methods and concluded that proposed method is better than others.

[...] Read more.
Retinal Image Segmentation for Diabetic Retinopathy Detection using U-Net Architecture

By Swapnil V. Deshmukh Apash Roy Pratik Agrawal

DOI: https://doi.org/10.5815/ijigsp.2023.01.07, Pub. Date: 8 Feb. 2023

Diabetic retinopathy is one of the most serious eye diseases and can lead to permanent blindness if not diagnosed early. The main cause of this is diabetes. Not every diabetic will develop diabetic retinopathy, but the risk of developing diabetes is undeniable. This requires the early diagnosis of Diabetic retinopathy. Segmentation is one of the approaches which is useful for detecting the blood vessels in the retinal image. This paper proposed the three models based on a deep learning approach for recognizing blood vessels from retinal images using region-based segmentation techniques. The proposed model consists of four steps preprocessing, Augmentation, Model training, and Performance measure. The augmented retinal images are fed to the three models for training and finally, get the segmented image. The proposed three models are applied on publically available data set of DRIVE, STARE, and HRF. It is observed that more thin blood vessels are segmented on the retinal image in the HRF dataset using model-3. The performance of proposed three models is compare with other state-of-art-methods of blood vessels segmentation of DRIVE, STARE, and HRF datasets.

[...] Read more.
Mobile-Based Skin Disease Diagnosis System Using Convolutional Neural Networks (CNN)

By M.W.P Maduranga Dilshan Nandasena

DOI: https://doi.org/10.5815/ijigsp.2022.03.05, Pub. Date: 8 Jun. 2022

This paper presents a design and development of an Artificial Intelligence (AI) based mobile application to detect the type of skin disease. Skin diseases are a serious hazard to everyone throughout the world. However, it is difficult to make accurate skin diseases diagnosis. In this work, Deep learning algorithms Convolution Neural Networks (CNN) is proposed to classify skin diseases on the HAM10000 dataset. An extensive review of research articles on object identification methods and a comparison of their relative qualities were given to find a method that would work well for detecting skin diseases. The CNN-based technique was recognized as the best method for identifying skin diseases. A mobile application, on the other hand, is built for quick and accurate action. By looking at an image of the afflicted area at the beginning of a skin illness, it assists patients and dermatologists in determining the kind of disease present. Its resilience in detecting the impacted region considerably faster with nearly 2x fewer computations than the standard MobileNet model results in low computing efforts. This study revealed that MobileNet with transfer learning yielding an accuracy of about 85% is the most suitable model for automatic skin disease identification. According to these findings, the suggested approach can assist general practitioners in quickly and accurately diagnosing skin diseases using the smart phone.

[...] Read more.
A Review on Image Reconstruction through MRI k-Space Data

By Tanuj Kumar Jhamb Vinith Rejathalal V.K. Govindan

DOI: https://doi.org/10.5815/ijigsp.2015.07.06, Pub. Date: 8 Jun. 2015

Image reconstruction is the process of generating an image of an object from the signals captured by the scanning machine. Medical imaging is an interdisciplinary field combining physics, biology, mathematics and computational sciences. This paper provides a complete overview of image reconstruction process in MRI (Magnetic Resonance Imaging). It reviews the computational aspect of medical image reconstruction. MRI is one of the commonly used medical imaging techniques. The data collected by MRI scanner for image reconstruction is called the k-space data. For reconstructing an image from k-space data, there are various algorithms such as Homodyne algorithm, Zero Filling method, Dictionary Learning, and Projections onto Convex Set method. All the characteristics of k-space data and MRI data collection technique are reviewed in detail. The algorithms used for image reconstruction discussed in detail along with their pros and cons. Various modern magnetic resonance imaging techniques like functional MRI, diffusion MRI have also been introduced. The concepts of classical techniques like Expectation Maximization, Sensitive Encoding, Level Set Method, and the recent techniques such as Alternating Minimization, Signal Modeling, and Sphere Shaped Support Vector Machine are also reviewed. It is observed that most of these techniques enhance the gradient encoding and reduce the scanning time. Classical algorithms provide undesirable blurring effect when the degree of phase variation is high in partial k-space. Modern reconstructions algorithms such as Dictionary learning works well even with high phase variation as these are iterative procedures.

[...] Read more.
Real-Time Video based Human Suspicious Activity Recognition with Transfer Learning for Deep Learning

By Indhumathi .J Balasubramanian .M Balasaigayathri .B

DOI: https://doi.org/10.5815/ijigsp.2023.01.05, Pub. Date: 8 Feb. 2023

Nowadays, the primary concern of any society is providing safety to an individual. It is very hard to recognize the human behaviour and identify whether it is suspicious or normal. Deep learning approaches paved the way for the development of various machine learning and artificial intelligence. The proposed system detects real-time human activity using a convolutional neural network. The objective of the study is to develop a real-time application for Activity recognition using with and without transfer learning methods. The proposed system considers criminal, suspicious and normal categories of activities. Differentiate suspicious behaviour videos are collected from different peoples(men/women). This proposed system is used to detect suspicious activities of a person. The novel 2D-CNN, pre-trained VGG-16 and ResNet50 is trained on video frames of human activities such as normal and suspicious behaviour. Similarly, the transfer learning in VGG16 and ResNet50 is trained using human suspicious activity datasets. The results show that the novel 2D-CNN, VGG16, and ResNet50 without transfer learning achieve accuracy of 98.96%, 97.84%, and 99.03%, respectively. In Kaggle/real-time video, the proposed system employing 2D-CNN outperforms the pre-trained model VGG16. The trained model is used to classify the activity in the real-time captured video. The performance obtained on ResNet50 with transfer learning accuracy of 99.18% is higher than VGG16 transfer learning accuracy of 98.36%. 

[...] Read more.