A. S. Veprikov, E. D. Petrov, G. V. Evseev, A. N. Beznosikov, “Zero order algorithm for decentralised optimization problems”, Dokl. RAN. Math. Inf. Proc. Upr., 2024, Volume 520, Number 2,Pages <nobr>295

SPECIAL ISSUE: ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING TECHNOLOGIES

Zero order algorithm for decentralised optimization problems

A. S. Veprikov^abc, E. D. Petrov^a, G. V. Evseev^a, A. N. Beznosikov^acd

^a Moscow Institute of Physics and Technology, Moscow, Russia
^b Ivannikov Institute for System Programming of the RAS, Moscow, Russia
^c Sber, AI Laboratory, Moscow, Russia
^d Innopolis University, Innopolis, Tatarstan, Russia

Abstract: In this paper we consider a distributed optimisation problem in the black-box formulation. This means that the target function $f$ is decomposed into the sum of $n$ functions $f$, where $n$ is the number of workers, where each worker is assumed to have access only to the zero-order noisy oracle, i.e., only to the values of $f (x)$ with added noise. In this paper, we propose a new ZO-MARINA method based on the state-of-the-art MARINA distributed optimization algorithm. In particular, the following modifications are made to solve the problem in the black-box formulation: i) its approximation is used instead of the honest gradient, ii) the difference of two approximated gradients at some coordinates is used instead of the compression operator. In this paper, a theoretical convergence analysis is provided for non-convex functions and functions satisfying the Polak–Lojasiewicz condition. The speed of the proposed algorithm is correlated with the results for an algorithm that uses a first-order oracle. The theoretical results are validated in computational experiments to find optimal hyperparameters for the Resnet-18 neural network when trained on the CIFAR-10 dataset and the SVM model on the LibSVM library dataset and on the Mnist-784 dataset.

Keywords: distributed optimization, zero-order methods, gradient-free methods, machine learning.

UDC: 004.8

Received: 27.09.2024
Accepted: 02.10.2024

DOI: 10.31857/S2686954324700656