RUS  ENG
Full version
JOURNALS // Proceedings of the Institute for System Programming of the RAS // Archive

Proceedings of ISP RAS, 2022 Volume 34, Issue 1, Pages 123–140 (Mi tisp669)

Redundancy and uncertainty-based algorithms for computation planning

A. G. Feoktistova, R. O. Kostromina, S. A. Gorskya, I. V. Bychkova, A. N. Tchernykhbcd, O. Yu. Basharinaae

a Matrosov Institute for System Dynamics and Control Theory of Siberian Branch of RAS
b South Ural State University
c Ivannikov Institute for System Programming of the RAS
d Centro de Investigación Científica y de Educación Superior
e Irkutsk State University

Abstract: Nowadays, the development and use of workflow-based applications (distributed applied software packages) are some of the key challenges in terms of preparing and carrying out large-scale scientific experiments in distributed environments with heterogeneous computing resources. The environment resources can be represented by clusters of personal computers, supercomputers, and private or public cloud platforms and differ in their computational characteristics. Moreover, the composition and characteristics of resources change in dynamics. Therefore, computations planning and resource allocation in the considered environments are important problems. In this regard, we propose new algorithms for computation planning taking into account redundancy and uncertainty in such distributed applied software packages. Compared to other algorithms of a similar purpose, the proposed algorithms use evaluations of workflow execution makespan obtained in the process of continuous integration, delivery, and deployment of applied software. The proposed algorithms provide the construction of redundant problem-solving schemes that allow us to adapt them to the dynamic characteristics of computational resources and improve distributed computing reliability. The algorithms are based on a theory of conceptual modeling computational processes. We demonstrate the process of constructing problem-solving schemes on model examples. In addition, we show the utility in using redundancy for increasing the distributed computing reliability In comparison with some traditional meta-schedulers.

Keywords: distributed computing, scientific applications, workflow, computation planning, redundancy, uncertainty.

DOI: 10.15514/ISPRAS-2022-34(1)-9



© Steklov Math. Inst. of RAS, 2024