Abstract:
The article analyzes the effectiveness of the implementation of NAS benchmarks from NPB 3.3.1 package (EP, MG, BT, SP, LU) on cluster nodes with different architectures using multi-core processors, NVidia graphics accelerators and Intel coprocessors. Characteristics of tests de-veloped in high-level Fortran-DVMH language (hereafter referred to as FDVMH), and their im-plementation in other languages are compared. We research the effect of different optimization methods for FDVMH NAS benchmarks necessary for their effective work on Intel Xeon Phi co-processor. The results of the simultaneous using of all cores of CPU, GPU and Intel Xeon Phi co-processor are presented.