Abstract:
We consider a stimulation model with Markov dynamics and discounted optimality criteria in case of discrete time and infinite planning horizon. In this model, the regulator has an economic impact on the executor, choosing a stimulating function that depends on the system state and the actions of the executor, who employs positional control strategies. System dynamics, revenues of the regulator and costs of the executor depend on the system state and the executor’s actions. We show that finding an approximate solution of the (inverse) Stackelberg game reduces to solving the optimal control problem with criterion equal to the difference between the revenue of the regulator and the costs of the executor. Here the $\varepsilon$-optimal strategy of the regulator is to economically motivate the executor to follow this optimal control strategy.