V. M. Azanov, A. N. Tarasov, “Probabilistic criterion-based optimal retention of trajectories of a discrete-time stochastic system in a given tube: bilateral estimation of the Bellman function”, Avtomat. i Telemekh., 2020, Issue 10,Pages <nobr>93

This article is cited in 2 papers

Stochastic Systems

Probabilistic criterion-based optimal retention of trajectories of a discrete-time stochastic system in a given tube: bilateral estimation of the Bellman function

V. M. Azanov, A. N. Tarasov

Moscow Aviation Institute (National Research University), Moscow, Russia

Abstract: This paper examines an optimal control problem with a probabilistic criterion for retaining the trajectories of a discrete-time stochastic system in given sets. The dynamic programming method is employed for obtaining the isobells of levels 1 and 0 of the Bellman function, two-sided estimates for the right-hand side of the dynamic programming equation, two-sided estimates for the Bellman function, and the optimal-value function of the probabilistic criterion. These results are then used for deriving an approximate formula for the optimal control. As an illustrative example the problem of keeping an inverted pendulum in the neighborhood of an unstable equilibrium is considered.

Keywords: discrete-time systems, stochastic optimal control, probabilistic criterion, dynamic programming, Bellman function, inverted pendulum.

Presented by the member of Editorial Board: A. V. Nazin

Received: 24.12.2019
Revised: 20.05.2020
Accepted: 09.07.2020

DOI: 10.31857/S0005231020100037