RUS  ENG
Full version
JOURNALS // Preprints of the Keldysh Institute of Applied Mathematics // Archive

Keldysh Institute preprints, 2018 170, 21 pp. (Mi ipmp2529)

This article is cited in 2 papers

The solution of the problem of bluff detection in the game «I-doubt-it» based on reinforcement learning

S. A. Knyazyatov, G. G. Malinetskiy


Abstract: In this paper we consider the construction of an algorithm based on reinforcement learning for the problem of recognizing and using a bluff on the example of a card game «I-doubt-it». The constructed algorithm has the 'intellectual ability' to restructure its behavior strategy and to evaluate possible moves based on previous experience.This class of algorithms used to make decisions in rapidly changing environments. The method and results of comparing algorithms among themselves, the results of games of the best algorithms with a real opponent are obtained. The effect of 'overfitting' is detected, increasing the number of training batches, in some cases, does not improve, but worsens the quality of the algorithm.

Keywords: reinforcement learning, mathematical modeling, $Q$-learning, SARSA($\lambda$) method, bluff detection algorithm, bluff imitation, neural networks, high-speed decision making.

DOI: 10.20948/prepr-2018-170



Bibliographic databases:


© Steklov Math. Inst. of RAS, 2024