GSWO : a programming model for GPU-enabled parallelization of sliding window operations in image processing.

Yang, Po and Clapworthy, Gordon and Dong, Feng and Codreanu, Valeriu and Williams, David and Liu, Baoquan and Roerdink, Jos B. T. M. and Deng, Zhikun (2016) GSWO : a programming model for GPU-enabled parallelization of sliding window operations in image processing. Signal Processing: Image Communication, 47. pp. 332-345. ISSN 0923-5965 (https://doi.org/10.1016/j.image.2016.05.003)

[thumbnail of Yang-etal-SPIC2016-GSWO-a-programming-model-GPU-enabled-parallelization-sliding-window-operations-image-processing]
Preview
Text. Filename: Yang_etal_SPIC2016_GSWO_a_programming_model_GPU_enabled_parallelization_sliding_window_operations_image_processing.pdf
Accepted Author Manuscript
License: Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 logo

Download (1MB)| Preview

Abstract

Sliding Window Operations (SWOs) are widely used in image processing applications. They often have to be performed repeatedly across the target image, which can demand significant computing resources when processing large images with large windows. In applications in which real-time performance is essential, running these filters on a CPU often fails to deliver results within an acceptable timeframe. The emergence of sophisticated graphic processing units (GPUs) presents an opportunity to address this challenge. However, GPU programming requires a steep learning curve and is error-prone for novices, so the availability of a tool that can produce a GPU implementation automatically from the original CPU source code can provide an attractive means by which the GPU power can be harnessed effectively. This paper presents a GPU-enabled programming model, called GSWO, which can assist GPU novices by converting their SWO-based image processing applications from the original C/C++ source code to CUDA code in a highly automated manner. This model includes a new set of simple SWO pragmas to generate GPU kernels and to support effective GPU memory management. We have implemented this programming model based on a CPU-to-GPU translator (C2GPU). Evaluations have been performed on a number of typical SWO image filters and applications. The experimental results show that the GSWO model is capable of efficiently accelerating these applications, with improved applicability and a speed-up of performance compared to several leading CPU-to-GPU source-to-source translators.