A. V. Kolnogorov, “Two-armed bandit problem for parallel data processing systems”, Probl. Peredachi Inf., 2012, Volume 48, Issue 1,Pages <nobr>83

This article is cited in 6 papers

Large Systems

Two-armed bandit problem for parallel data processing systems

A. V. Kolnogorov

Applied Mathematics and Information Science Chair, Yaroslav-the-Wise Novgorod State University

Abstract: We consider application of the two-armed bandit problem to processing a large number $N$ of data where two alternative processing methods can be used. We propose a strategy which at the first stages, whose number is at most $r-1$, compares the methods, and at the final stage applies only the best one obtained from the comparison. We find asymptotically optimal parameters of the strategy and observe that the minimax risk is of the order of $N^\alpha$, where $\alpha=2^{r-1}/(2^r-1)$. Under parallel processing, the total operation time is determined by the number $r$ of stages but not by the number $N$ of data.

UDC: 621.391.1+503.5

Received: 22.03.2011
Revised: 19.09.2011