RUS  ENG
Full version
JOURNALS // Matematicheskaya Teoriya Igr i Ee Prilozheniya // Archive

Mat. Teor. Igr Pril., 2017 Volume 9, Issue 4, Pages 69–87 (Mi mgta209)

This article is cited in 3 papers

Decision-making model under presence of experts as a modified multi-armed bandit problem

Dmitriy S. Smirnov, Ekaterina V. Gromova

Saint-Petersburg State University, Faculty of Applied Mathematics and Control Process

Abstract: The modified multi-armed bandit problem is formulated in the paper which allows the player to use so-called expert hints in the decision making process. As a player in this problem is meant some automated system that uses a certain strategy (algorithm) for making a decision under conditions of uncertainty. The approach is developed for the case of $m$ experts. A modification of the well-known UCB1 algorithm is proposed to solve the multi-armed bandit problem. The results of a numerical experiment are given in order to show influence of expert hints on the player's payoff.

Keywords: multi-armed bandit problem, decision making, optimization methods, machine learning algorithms.

UDC: 519.81, 004.021, 004.942
BBK: 22.18



© Steklov Math. Inst. of RAS, 2024