RUS  ENG
Full version
JOURNALS // Sistemy i Sredstva Informatiki [Systems and Means of Informatics] // Archive

Sistemy i Sredstva Inform., 2024 Volume 34, Issue 3, Pages 14–22 (Mi ssi942)

To the problem of identifying failures in the information technology infrastructure by monitoring and analyzing indirect data

D. V. Smirnova, A. A. Grushob, M. I. Zabezhailob

a Sberbank of Russia, 19 Vavilov Str., Moscow 117999, Russian Federation
b Federal Research Center "Computer Science and Control" of the Russian Academy of Sciences, 44-2 Vavilov Str., Moscow 119133, Russian Federation

Abstract: Research aspects of the problem of identifying failures in complex information technology (IT) systems are discussed. A failure is understood as an abnormal mode of operation of the IT infrastructure, in which the specified functionality of the business processes supported by it is not provided but the existing means of monitoring the functioning of the IT infrastructure do not raise alarms. In such situations, the conclusion about the failure can be formed only by indirect data, in particular, by the reaction of users contacting the support service, etc. The tasks of building identification systems for such abnormal situations, the so-called downdetectors, are considered in the context of some research problems of modern artificial intelligence: intellectual analysis of natural language texts, identification of cause-and-effect relationships in the analyzed data, training on precedents in open subject areas, etc. The paper proposes directions of scientific research and formulation of tasks, the solution of which is necessary to significantly increase the efficiency of detecting failures using downdetector methods.

Keywords: information infrastructure, indirect data monitoring, downdetectors, cause-and-effect, artificial intelligence.

Received: 02.08.2024

DOI: 10.14357/08696527240302



© Steklov Math. Inst. of RAS, 2025