RUS  ENG
Full version
JOURNALS // Teoriya Veroyatnostei i ee Primeneniya // Archive

Teor. Veroyatnost. i Primenen., 1978 Volume 23, Issue 2, Pages 313–330 (Mi tvp3039)

This article is cited in 10 papers

The existence of a stationary $\varepsilon$-optimal policy for a finite Markov chain

E. A. Faĭnberg

Moscow State University of Railway Communications

Abstract: The existence of a stationary average reward $\varepsilon$-optimal policy is proved for discrete time Markov decision chains with finitely many states, compact sets of actions, continuous transition functions and upper semicontinuous reward functions.

Received: 02.03.1976


 English version:
Theory of Probability and its Applications, 1979, 23:2, 297–313

Bibliographic databases:


© Steklov Math. Inst. of RAS, 2024