Abstract:
An approach to optimize computing applications for multiprocessor systems
with nonuniform memory access (the so-called NUMA systems) is considered.
This approach allows one to make the most use of system computing resources
with minimal changes in application codes and can be applied in hybrid MPI-threaded
programs on modern cluster systems. Some results of numerical experiments on
a large number of realistic problems are discussed.
Keywords:high-performance computing; hybrid MPI-threaded programs; NUMA systems.