RUS  ENG
Full version
JOURNALS // Zhurnal Vychislitel'noi Matematiki i Matematicheskoi Fiziki // Archive

Zh. Vychisl. Mat. Mat. Fiz., 2015 Volume 55, Number 4, Pages 582–598 (Mi zvmmf10186)

This article is cited in 16 papers

On the efficiency of a randomized mirror descent algorithm in online optimization problems

A. V. Gasnikovabc, Yu. E. Nesterovbca, V. G. Spokoinyacb

a Moscow Institute of Physics and Technology, Institutskii per. 9, Dolgoprudnyi, Moscow oblast, 141700, Russia
b National Research University Higher School of Economics, Myasnitskaya ul. 20, Moscow, 101000, Russia
c Institute for Information Transmission Problems, Russian Academy of Sciences, Bolshoi Karetnyi per. 19, Moscow, 127051, Russia

Abstract: A randomized online version of the mirror descent method is proposed. It differs from the existing versions by the randomization method. Randomization is performed at the stage of the projection of a subgradient of the function being optimized onto the unit simplex rather than at the stage of the computation of a subgradient, which is common practice. As a result, a componentwise subgradient descent with a randomly chosen component is obtained, which admits an online interpretation. This observation, for example, has made it possible to uniformly interpret results on weighting expert decisions and propose the most efficient method for searching for an equilibrium in a zero-sum two-person matrix game with sparse matrix.

Key words: mirror descent method, dual averaging method, online optimization, exponential weighting, multi-armed bandits, weighting of experts, stochastic optimization, randomization.

UDC: 519.658

MSC: Primary 90C15; Secondary 68W20

Received: 03.09.2014
Revised: 29.10.2014

DOI: 10.7868/S0044466915040043


 English version:
Computational Mathematics and Mathematical Physics, 2015, 55:4, 580–596

Bibliographic databases:


© Steklov Math. Inst. of RAS, 2024