E. A. Faĭnberg, “The existence of a stationary <nobr>$\varepsilon$</nobr>-optimal policy for a finite Markov chain”, Teor. Veroyatnost. i Primenen., 1978, Volume 23, Issue 2,Pages <nobr>313

This article is cited in 10 papers

The existence of a stationary $\varepsilon$-optimal policy for a finite Markov chain

E. A. Faĭnberg

Moscow State University of Railway Communications

Abstract: The existence of a stationary average reward $\varepsilon$-optimal policy is proved for discrete time Markov decision chains with finitely many states, compact sets of actions, continuous transition functions and upper semicontinuous reward functions.

Received: 02.03.1976