RUS  ENG
Full version
JOURNALS // Numerical methods and programming // Archive

Num. Meth. Prog., 2025 Volume 26, Issue 2, Pages 140–149 (Mi vmp1155)

An asynchronous sparse Cholesky solver for workstations with NUMA architecture

A. S. Maslov, M. M. Makarov, N. N. Potravkin, S. O. Proskurnia

LLC “TS INTEGRATION”

Abstract: A parallel Cholesky factorization algorithm for sparse matrices has been implemented, based on the asynchronous task paradigm and accounting for the specifics of NUMA architecture. The execution of the numerical factorization stage and forward/backward substitution is represented as a directed acyclic graph (DAG), which removes synchronization barriers and enhances data access locality to improve the utilization efficiency of the computational device's memory hierarchy. Performance evaluation demonstrates good scalability compared to the highly optimized commercial package Intel MKL PARDISO, confirming the effectiveness of the proposed approach.

Keywords: Cholesky factorization, NUMA architecture, asynchronous task paradigm, directed acyclic graph, hwloc library.

UDC: 519.612.2

Received: 13.02.2025
Accepted: 17.03.2025

DOI: 10.26089/NumMet.v26r210



© Steklov Math. Inst. of RAS, 2025