Skip to main content

Showing 1–1 of 1 results for author: Dimoudi, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:1910.01972  [pdf, ps, other

    cs.MS cs.DC cs.PF

    GPU Fast Convolution via the Overlap-and-Save Method in Shared Memory

    Authors: Karel Adámek, Sofia Dimoudi, Mike Giles, Wesley Armour

    Abstract: We present an implementation of the overlap-and-save method, a method for the convolution of very long signals with short response functions, which is tailored to GPUs. We have implemented several FFT algorithms (using the CUDA programming language) which exploit GPU shared memory, allowing for GPU accelerated convolution. We compare our implementation with an implementation of the overlap-and-sav… ▽ More

    Submitted 10 April, 2020; v1 submitted 4 October, 2019; originally announced October 2019.

    Comments: accepted to ACM TACO

    Journal ref: ACM Trans. Archit. Code Optim. 17, 3, Article 18 (September 2020)