RUS  ENG
Full version
JOURNALS // Numerical methods and programming // Archive

Num. Meth. Prog., 2011 Volume 12, Issue 4, Pages 90–93 (Mi vmp223)

Программирование

An approach to cluster system task flow monitoring, analysis, and visualization

A. V. Adinetz, P. A. Bryzgalov, Vad. V. Voevodin, S. A. Zhumatii, D. A. Nikitenko

M.V. Lomonosov Moscow State University, Research Computing Center

Abstract: Big cluster systems are spreading wide, so that the efficiency of use of such systems is a very actual task for now. In order to solve this task, it is needed to identify the efficiency problems appearing during the task execution, to notify users about appeared problems, and to suggest possible ways to resolve them. This can be achieved by the continuous monitoring of running tasks and by data analysis. This paper discusses an approach to solve these tasks and describes a working prototype.

Keywords: parallel computing; monitoring; tasks flow; computing cluster.

UDC: 004.432



© Steklov Math. Inst. of RAS, 2024