RUS  ENG
Full version
JOURNALS // Program Systems: Theory and Applications // Archive

Program Systems: Theory and Applications, 2017 Volume 8, Issue 1, Pages 47–62 (Mi ps248)

This article is cited in 1 paper

Hardware, software and distributed supercomputer systems

Effective computation of two-dimensional FFT on a homogeneous or heterogeneous cluster

D. Yu. Knyazkov

Ishlinsky Institute for Problems in Mechanics of the Russian Academy of Sciences

Abstract: The paper considers performing two-dimensional FFT on a supercomputer. It investigates a dependance of FFT computation time from a matrix size for MVS-100K, MVS-10P and HybriLIT supercomputers. A method of CPU-GPU load balance for a heterogeneous cluster is proposed. For a TESLA K40 card it is shown, that two-dimensional FFT computation time is almost equal to data transferring time. The computation itself is 48 times faster when using GPU comparing to two-processors node. (In Russian).

Key words and phrases: HPC, supercomputer computations, fast Fourier transform, FFT, GPU computations.

UDC: 519.68

DOI: 10.25209/2079-3316-2017-8-1-47-62



© Steklov Math. Inst. of RAS, 2024