RUS  ENG
Full version
JOURNALS // Sistemy i Sredstva Informatiki [Systems and Means of Informatics] // Archive

Sistemy i Sredstva Inform., 2006 Issue 16, Pages 374–385 (Mi ssi20)

Архитектура, системные решения и программное обеспечение вычислительных комплексов и сетей новых поколений

Методы реализации отказоустойчивости приложений с недетерминированным поведением

V. S. Dolgopolov, V. N. Zakharov, L. M. Kozlova, V. A. Kozmidiady, O. L. Obukhova


Abstract: The authors consider two methods of transparent fault tolerance implementation for application servers with non-deterministic behavior and provide their comparison. The first method — (snapshot/restore) — is based on the well-known mechanism of checkpoints (snapshots), which is supplemented with logging of events happened with resources and having influence on determinism of behavior (resource histories). The behavior after failure provides recovery of application states and controlled execution, using resource histories. The second method — (lock-step) — uses only the events logging which is accompanied by the permanent controlled execution on the reserve node of the application server. The arguments in favor of “snapshot/restore” method are presented.

UDC: 004.2



© Steklov Math. Inst. of RAS, 2024