RUS  ENG
Full version
JOURNALS // Vestnik Yuzhno-Ural'skogo Gosudarstvennogo Universiteta. Seriya "Vychislitelnaya Matematika i Informatika" // Archive

Vestn. YuUrGU. Ser. Vych. Matem. Inform., 2023 Volume 12, Issue 2, Pages 78–92 (Mi vyurv298)

Solving grid equations using the alternating-triangular method on a graphics accelerator

A. I. Sukhinova, V. N. Litvinovab, A. E. Chistyakova, A. V. Nikitinaac, N. N. Grachevaab, N. B. Rudenkoab

a Don State Technical University (Gagarin Sq. 1, Rostov-on-Don, 344003 Russia)
b Azov-Black Sea Engineering Institute of Don State Agrarian University (Lenina 21, Zernograd, 347740 Russia)
c Southern Federal University (Bolshaya Sadovaya 105/42, Rostov-on-Don, 344006 Russia)

Abstract: The paper describes a parallel-pipeline implementation of solving grid equations using the modified alternating-triangular iterative method (MATM), obtained by numerically solving the equations of mathematical physics. The greatest computational costs at using this method are on the stages of solving a system of linear algebraic equations (SLAE) with lower triangular and upper non-triangular matrices. An algorithm for solving the SLAE with a lower triangular matrix on a graphics accelerator using NVIDIA CUDA technology is presented. To implement the parallel-pipeline method, a three-dimensional decomposition of the computational domain was used. It is divided into blocks along the $y$ coordinate, the number of which corresponds to the number of GPU streaming multiprocessors involved in the calculations. In turn, the blocks are divided into fragments according to two spatial coordinates — $x$ and $z$. The presented graph model describes the relationship between adjacent fragments of the computational grid and the pipeline calculation process. Based on the results of computational experiments, a regression model was obtained that describes the dependence of the time for calculation one MATM step on the GPU, the acceleration and efficiency for SLAE solution with a lower triangular matrix by the parallel-pipeline method on the GPU were calculated using the different number of streaming multiprocessors.

Keywords: mathematical modeling, parallel algorithm, graphics accelerator.

UDC: 519.6

Received: 15.03.2023

Language: English

DOI: 10.14529/cmse230204



© Steklov Math. Inst. of RAS, 2024