A. I. Sukhinov, V. N. Litvinov, A. E. Chistyakov, A. V. Nikitina, N. N. Gracheva, N. B. Rudenko, “Solving grid equations using the alternating-triangular method on a graphics accelerator”, Vestn. YuUrGU. Ser. Vych. Matem. Inform., 2023, Volume 12, Issue 2,Pages <nobr>78

Solving grid equations using the alternating-triangular method on a graphics accelerator

A. I. Sukhinov^a, V. N. Litvinov^ab, A. E. Chistyakov^a, A. V. Nikitina^ac, N. N. Gracheva^ab, N. B. Rudenko^ab

^a Don State Technical University (Gagarin Sq. 1, Rostov-on-Don, 344003 Russia)
^b Azov-Black Sea Engineering Institute of Don State Agrarian University (Lenina 21, Zernograd, 347740 Russia)
^c Southern Federal University (Bolshaya Sadovaya 105/42, Rostov-on-Don, 344006 Russia)

Abstract: The paper describes a parallel-pipeline implementation of solving grid equations using the modified alternating-triangular iterative method (MATM), obtained by numerically solving the equations of mathematical physics. The greatest computational costs at using this method are on the stages of solving a system of linear algebraic equations (SLAE) with lower triangular and upper non-triangular matrices. An algorithm for solving the SLAE with a lower triangular matrix on a graphics accelerator using NVIDIA CUDA technology is presented. To implement the parallel-pipeline method, a three-dimensional decomposition of the computational domain was used. It is divided into blocks along the $y$ coordinate, the number of which corresponds to the number of GPU streaming multiprocessors involved in the calculations. In turn, the blocks are divided into fragments according to two spatial coordinates — $x$ and $z$. The presented graph model describes the relationship between adjacent fragments of the computational grid and the pipeline calculation process. Based on the results of computational experiments, a regression model was obtained that describes the dependence of the time for calculation one MATM step on the GPU, the acceleration and efficiency for SLAE solution with a lower triangular matrix by the parallel-pipeline method on the GPU were calculated using the different number of streaming multiprocessors.

Keywords: mathematical modeling, parallel algorithm, graphics accelerator.

UDC: 519.6

Received: 15.03.2023

Language: English

DOI: 10.14529/cmse230204