RUS  ENG
Full version
JOURNALS // Vestnik Yuzhno-Ural'skogo Gosudarstvennogo Universiteta. Seriya "Vychislitelnaya Matematika i Informatika" // Archive

Vestn. YuUrGU. Ser. Vych. Matem. Inform., 2013 Volume 2, Issue 4, Pages 28–43 (Mi vyurv107)

Computer Science, Engineering and Control

KernelGen — a prototype of LLVM-based auto-parallelizing C/Fortran compiler for NVIDIA GRUs

N. N. Likhogruda, D. N. Mikushinb

a Lomonosov Moscow State University (Moscow, Russian Federation)
b Universit`a della Svizzera italiana (Lugano, Switzerland)

Abstract: The KernelGen project (http://kernelgen.org/) aims to develop Fortran and C compilers based on the state-of-art open-source technologies for automatic GPU kernels generation from unmodified CPU source code, significantly improving the code porting experiences. Parallelism detection is based on LLVM/Polly and CLooG, extended with mapping of loops onto GPU compute grid, and assisted with runtime alias analysis. PTX assembly code is generated with NVPTX backend. Thanks to integration with GCC frontend by means of DragonEgg plugin, and customized linker, KernelGen features full GCC compatibility, and is able to compile complex applications into hybrid binaries containing both CPU and GPU-enabled executables. In addition to more robust parallelism detection, test kernels produced by KernelGen are up to 60 % faster than generated by PGI compiler for kernels source with manually inserted OpenACC directives.

Keywords: GPU, LLVM, OpenACC, JIT-compilation, polyhedral analysis.

UDC: 004.4’422, 004.432.2, 004.4’418, 51-37

Received: 08.06.2013

DOI: 10.14529/cmse130403



© Steklov Math. Inst. of RAS, 2024