Work place: Department of Information Systems and Technologies of the National Technical University of Ukraine “Igor Sikorsky Kyiv Polytechnic Institute,” Kyiv, 03056, Ukraine
E-mail: serhiihatcan@gmail.com
Website: https://orcid.org/0009-0004-5011-0627
Research Interests: Computer Vision, Intelligent Systems, IoT
Biography
Serhii Hatsan is B.Sc. in Information Systems and Technologies and is currently pursuing the M.Sc. degree in Igor Sikorsky Kyiv Polytechnic Institute, Ukraine. Currently working as Network and Security Specialist at SPAN, Ukraine. His research interests include IOT, Embeded Systems, AI, Computer Vision.
By Volodymyr Oliinyk Serhii Hatsan
DOI: https://doi.org/10.5815/ijem.2026.03.03, Pub. Date: 8 Jun. 2026
Image generation methods, including inpainting, are evolving rapidly; however, high memory requirements continue to limit their practical deployment. As a result, the efficient utilization of Latent Diffusion Models on edge devices has become increasingly important. This work explores techniques for reducing memory usage in Latent Diffusion Models while preserving their generative capabilities.
We propose a resource-efficient inpainting method optimized for CPU-based inference, based on a combination of VAE tiling, attention slicing, and dynamic region-of-interest slicing. Experimental results demonstrate that the model's memory footprint can be significantly reduced while maintaining output quality, without substantial increases in computation time, enabling execution on systems with as little as 4 GB of memory and only two processing cores. While the introduced optimizations, particularly those based on localized image processing, introduce an inherent trade-off between memory usage and computational cost, resulting in longer inference times compared to GPU-accelerated solutions, they demonstrate strong potential for deployment in memory-limited environments.
Additionally, we provide analysis of key deployment bottlenecks, including model compilation for cold-start overhead mitigation, proper runtime configuration and scheduler selection. These findings confirm the feasibility of effectively deploying Latent Diffusion Models for inpainting tasks on CPU-only, resource-constrained platforms, thereby broadening their applicability to edge computing scenarios.
[...] Read more.Subscribe to receive issue release notifications and newsletters from MECS Press journals