Serhii Hatsan

Work place: Department of Information Systems and Technologies of the National Technical University of Ukraine “Igor Sikorsky Kyiv Polytechnic Institute,” Kyiv, 03056, Ukraine

E-mail: serhiihatcan@gmail.com

Website: https://orcid.org/0009-0004-5011-0627

Research Interests: Computer Vision, Intelligent Systems, IoT

Biography

Serhii Hatsan is B.Sc. in Information Systems and Technologies and is currently pursuing the M.Sc. degree in Igor Sikorsky Kyiv Polytechnic Institute, Ukraine. Currently working as Network and Security Specialist at SPAN, Ukraine. His research interests include IOT, Embeded Systems, AI, Computer Vision.

Author Articles
Mask-Aware Localized Inpainting Method for CPU-Based Inference

By Volodymyr Oliinyk Serhii Hatsan

DOI: https://doi.org/10.5815/ijem.2026.03.03, Pub. Date: 8 Jun. 2026

Image generation methods, including inpainting, are evolving rapidly; however, high memory requirements continue to limit their practical deployment. As a result, the efficient utilization of Latent Diffusion Models on edge devices has become increasingly important. This work explores techniques for reducing memory usage in Latent Diffusion Models while preserving their generative capabilities.

We propose a resource-efficient inpainting method optimized for CPU-based inference, based on a combination of VAE tiling, attention slicing, and dynamic region-of-interest slicing. Experimental results demonstrate that the model's memory footprint can be significantly reduced while maintaining output quality, without substantial increases in computation time, enabling execution on systems with as little as 4 GB of memory and only two processing cores. While the introduced optimizations, particularly those based on localized image processing, introduce an inherent trade-off between memory usage and computational cost, resulting in longer inference times compared to GPU-accelerated solutions, they demonstrate strong potential for deployment in memory-limited environments.

Additionally, we provide analysis of key deployment bottlenecks, including model compilation for cold-start overhead mitigation, proper runtime configuration and scheduler selection. These findings confirm the feasibility of effectively deploying Latent Diffusion Models for inpainting tasks on CPU-only, resource-constrained platforms, thereby broadening their applicability to edge computing scenarios.

[...] Read more.
Other Articles