RUS  ENG
Full version
JOURNALS // Upravlenie Bol'shimi Sistemami // Archive

UBS, 2017 Issue 70, Pages 58–86 (Mi ubs935)

Network-based models in Control

On topological fault-tolerance of scalable computing systems

V. A. Melent'ev

Rzhanov Institute of Semiconductor Physics Siberian Branch of RAS, Novosibirsk

Abstract: Problems of the analysis of topological fault tolerance of the scalable computing system and ensuring its sustainability to fault of the given multiplicity are considered. The measure of topological fault tolerance is offered, which connects the computing system topology with its potential parallelism for the given fault multiplicity. The relationship between the functions of topological scalability and topological fault tolerance is defined. The dependence of the minimum of a topological fault tolerance by the girth of the system graph is shown. Model of parallel computings, and functions of the topological fault tolerance and scalability are adapted to the existence of unique nodes in information topology of the solved task. A method for configuring fault-tolerant subsystems for a deficient topological fault tolerance of a computing system is proposed, while providing the preassigned fault multiplicity for the solved task is achieved by duplicating subsystems which are configured for less, than the preassigned, fault multiplicity.

Keywords: scalable computing systems, their topological fault-tolerance.

UDC: 021.8 + 025.1
BBK: 78.34

Received: September 20, 2016
Published: November 30, 2017



Bibliographic databases:


© Steklov Math. Inst. of RAS, 2025