RUS  ENG
Full version
JOURNALS // Doklady Rossijskoj Akademii Nauk. Mathematika, Informatika, Processy Upravlenia // Archive

Dokl. RAN. Math. Inf. Proc. Upr., 2024 Volume 520, Number 2, Pages 295–312 (Mi danma608)

SPECIAL ISSUE: ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING TECHNOLOGIES

Zero order algorithm for decentralised optimization problems

A. S. Veprikovabc, E. D. Petrova, G. V. Evseeva, A. N. Beznosikovacd

a Moscow Institute of Physics and Technology, Moscow, Russia
b Ivannikov Institute for System Programming of the RAS, Moscow, Russia
c Sber, AI Laboratory, Moscow, Russia
d Innopolis University, Innopolis, Tatarstan, Russia

Abstract: In this paper we consider a distributed optimisation problem in the black-box formulation. This means that the target function $f$ is decomposed into the sum of $n$ functions $f$, where $n$ is the number of workers, where each worker is assumed to have access only to the zero-order noisy oracle, i.e., only to the values of $f (x)$ with added noise. In this paper, we propose a new ZO-MARINA method based on the state-of-the-art MARINA distributed optimization algorithm. In particular, the following modifications are made to solve the problem in the black-box formulation: i) its approximation is used instead of the honest gradient, ii) the difference of two approximated gradients at some coordinates is used instead of the compression operator. In this paper, a theoretical convergence analysis is provided for non-convex functions and functions satisfying the Polak–Lojasiewicz condition. The speed of the proposed algorithm is correlated with the results for an algorithm that uses a first-order oracle. The theoretical results are validated in computational experiments to find optimal hyperparameters for the Resnet-18 neural network when trained on the CIFAR-10 dataset and the SVM model on the LibSVM library dataset and on the Mnist-784 dataset.

Keywords: distributed optimization, zero-order methods, gradient-free methods, machine learning.

UDC: 004.8

Received: 27.09.2024
Accepted: 02.10.2024

DOI: 10.31857/S2686954324700656


 English version:
Doklady Mathematics, 2024, 110:suppl. 1, 261–S277

Bibliographic databases:


© Steklov Math. Inst. of RAS, 2025