Volodymyr Oliinyk

Work place: Department of Information Systems and Technologies of the National Technical University of Ukraine “Igor Sikorsky Kyiv Polytechnic Institute,” Kyiv, 03056, Ukraine

E-mail: oliinyk.volodymyr@edu.kpi.ua

Website: https://orcid.org/0000-0002-4647-2658

Research Interests:

Biography

Volodymyr Oliinyk received his PhD in 2011 from Igor Sikorsky Kyiv Polytechnic Institute, Ukraine. He is currently working as Associate Professor in the Department of Information Systems and Technologies at Igor Sikorsky Kyiv Polytechnic Institute, Ukraine.

His research interests include generative AI and deep learning, computer vision and natural language processing, agentic AI. 

Author Articles
Mask-Aware Localized Inpainting Method for CPU-Based Inference

By Volodymyr Oliinyk Serhii Hatsan

DOI: https://doi.org/10.5815/ijem.2026.03.03, Pub. Date: 8 Jun. 2026

Image generation methods, including inpainting, are evolving rapidly; however, high memory requirements continue to limit their practical deployment. As a result, the efficient utilization of Latent Diffusion Models on edge devices has become increasingly important. This work explores techniques for reducing memory usage in Latent Diffusion Models while preserving their generative capabilities.

We propose a resource-efficient inpainting method optimized for CPU-based inference, based on a combination of VAE tiling, attention slicing, and dynamic region-of-interest slicing. Experimental results demonstrate that the model's memory footprint can be significantly reduced while maintaining output quality, without substantial increases in computation time, enabling execution on systems with as little as 4 GB of memory and only two processing cores. While the introduced optimizations, particularly those based on localized image processing, introduce an inherent trade-off between memory usage and computational cost, resulting in longer inference times compared to GPU-accelerated solutions, they demonstrate strong potential for deployment in memory-limited environments.

Additionally, we provide analysis of key deployment bottlenecks, including model compilation for cold-start overhead mitigation, proper runtime configuration and scheduler selection. These findings confirm the feasibility of effectively deploying Latent Diffusion Models for inpainting tasks on CPU-only, resource-constrained platforms, thereby broadening their applicability to edge computing scenarios.

[...] Read more.
Other Articles