A. V. Kolnogorov, “One-armed bandit problem for parallel data processing systems”, Probl. Peredachi Inf., 2015, Volume 51, Issue 2,Pages <nobr>99

This article is cited in 11 papers

Large Systems

One-armed bandit problem for parallel data processing systems

A. V. Kolnogorov

Applied Mathematics and Information Science Department, Yaroslav-the-Wise Novgorod State University, Yaroslav, Russia

Abstract: We consider the minimax setting for the one-armed bandit problem, i.e., for the two-armed bandit problem with a known distribution function of incomes corresponding to the first action. Incomes that correspond to the second action have normal distribution functions with unit variance and an unknown mathematical expectation. According to the main theorem of game theory, the minimax strategy and minimax risk are sought for as Bayesian, corresponding to the worst-case prior distribution. Results can be applied to parallel data processing systems if there are two processing methods available with an a priori known efficiency of the first.

UDC: 621.391.1+503.5

Received: 02.09.2014
Revised: 25.02.2015