Convolution of large 3D images on GPU and its decomposition

Pavel Karas, David Svoboda
2011 EURASIP Journal on Advances in Signal Processing  
In this article, we propose a method for computing convolution of large 3D images. The convolution is performed in a frequency domain using a convolution theorem. The algorithm is accelerated on a graphic card by means of the CUDA parallel computing model. Convolution is decomposed in a frequency domain using the decimation in frequency algorithm. We pay attention to keeping our approach efficient in terms of both time and memory consumption and also in terms of memory transfers between CPU and
more » ... GPU which have a significant inuence on overall computational time. We also study the implementation on multiple GPUs and compare the results between the multi-GPU and multi-CPU implementations.
doi:10.1186/1687-6180-2011-120 fatcat:y3hbriwikjaljnlklbw74awcry