JSC "OKBM Afrikantov", department of IT development and implementation
Abstract:
The paper presents the problem of high-performance computer clustering to operate the current CAE systems. The main issues being the matter of IT services when solving this problem as well as the methods of their solution are given. The principles of HPC clusters designing in view of the specific character of the problems to be solved are considered. The main subsystems of the compute cluster are shown. The overview of the tools for complex server infrastructure operation computerization, support, and maintenance is given.
Keywords:CAE, HPC-cluster, system administration, monitoring, distributed resource management.