RUS  ENG
Full version
JOURNALS // Informatika i Ee Primeneniya [Informatics and its Applications] // Archive

Inform. Primen., 2021 Volume 15, Issue 3, Pages 41–50 (Mi ia742)

This article is cited in 2 papers

Routing jobs to heterogeneous parallel queues using distributed policy gradient algorithm

M. G. Konovalov, R. V. Razumchik

Federal Research Center “Computer Science and Control” of the Russian Academy of Sciences, 44-2 Vavilov Str., Moscow 119333, Russian Federation

Abstract: The problem of dispatching to heterogeneous servers, operating independently in parallel, is considered. Each server has a single processor and a dedicated FIFO (first in, first out) queue of infinite capacity. Homogeneous jobs (without preceding constraints) arrive one by one to the dispatcher which immediately makes a routing decision. Both jobs interarrival times and their sizes are assumed to be independent and identically distributed random variables with general distributions. Upon making a decision, full information about the current system's state, including the arriving job size, is available to the dispatcher. The problem is to minimize the long-run system's mean response time. A new sample-path-based policy gradient algorithm is proposed which allows one to construct such a policy. Its main ingredients are the dynamically changing discretization of the continuous state space and individual policy gradient algorithms acting in each cell. Simple numerical examples are given which demonstrate that the new algorithm can outperform best known solutions and is applicable in quite general cases.

Keywords: heterogeneous parallel queues, Markov chains with continuous state space, sojourn time optimization.

Received: 13.07.2021

DOI: 10.14357/19922264210306



© Steklov Math. Inst. of RAS, 2024