Dmitriy S. Smirnov, Ekaterina V. Gromova, “Decision-making model under presence of experts as a modified multi-armed bandit problem”, Mat. Teor. Igr Pril., 2017, Volume 9, Issue 4,Pages <nobr>69

This article is cited in 3 papers

Decision-making model under presence of experts as a modified multi-armed bandit problem

Dmitriy S. Smirnov, Ekaterina V. Gromova

Saint-Petersburg State University, Faculty of Applied Mathematics and Control Process

Abstract: The modified multi-armed bandit problem is formulated in the paper which allows the player to use so-called expert hints in the decision making process. As a player in this problem is meant some automated system that uses a certain strategy (algorithm) for making a decision under conditions of uncertainty. The approach is developed for the case of $m$ experts. A modification of the well-known UCB1 algorithm is proposed to solve the multi-armed bandit problem. The results of a numerical experiment are given in order to show influence of expert hints on the player's payoff.

Keywords: multi-armed bandit problem, decision making, optimization methods, machine learning algorithms.

UDC: 519.81, 004.021, 004.942
BBK: 22.18