RUS  ENG
Full version
JOURNALS // Problemy Peredachi Informatsii // Archive

Probl. Peredachi Inf., 2015 Volume 51, Issue 2, Pages 99–113 (Mi ppi2173)

This article is cited in 8 papers

Large Systems

One-armed bandit problem for parallel data processing systems

A. V. Kolnogorov

Applied Mathematics and Information Science Department, Yaroslav-the-Wise Novgorod State University, Yaroslav, Russia

Abstract: We consider the minimax setting for the one-armed bandit problem, i.e., for the two-armed bandit problem with a known distribution function of incomes corresponding to the first action. Incomes that correspond to the second action have normal distribution functions with unit variance and an unknown mathematical expectation. According to the main theorem of game theory, the minimax strategy and minimax risk are sought for as Bayesian, corresponding to the worst-case prior distribution. Results can be applied to parallel data processing systems if there are two processing methods available with an a priori known efficiency of the first.

UDC: 621.391.1+503.5

Received: 02.09.2014
Revised: 25.02.2015


 English version:
Problems of Information Transmission, 2015, 51:2, 177–191

Bibliographic databases:


© Steklov Math. Inst. of RAS, 2024