S. A. Knyazyatov, G. G. Malinetskiy, “The solution of the problem of bluff detection in the game «I-doubt-it» based on reinforcement learning”, Keldysh Institute preprints, 2018,170, 21 pp.

This article is cited in 2 papers

The solution of the problem of bluff detection in the game «I-doubt-it» based on reinforcement learning

S. A. Knyazyatov, G. G. Malinetskiy

Abstract: In this paper we consider the construction of an algorithm based on reinforcement learning for the problem of recognizing and using a bluff on the example of a card game «I-doubt-it». The constructed algorithm has the 'intellectual ability' to restructure its behavior strategy and to evaluate possible moves based on previous experience.This class of algorithms used to make decisions in rapidly changing environments. The method and results of comparing algorithms among themselves, the results of games of the best algorithms with a real opponent are obtained. The effect of 'overfitting' is detected, increasing the number of training batches, in some cases, does not improve, but worsens the quality of the algorithm.

Keywords: reinforcement learning, mathematical modeling, $Q$-learning, SARSA($\lambda$) method, bluff detection algorithm, bluff imitation, neural networks, high-speed decision making.

DOI: 10.20948/prepr-2018-170