I. A. Salenek, Ya. A. Seliverstov, S. A. Seliverstov, E. A. Sofronova, “Improving the quality of route generation in SUMO based on data from detectors using reinforcement learning”, Computer Research and Modeling, 2024, Volume 16, Issue 1,Pages <nobr>137

SPECIAL ISSUE

Improving the quality of route generation in SUMO based on data from detectors using reinforcement learning

I. A. Salenek^a, Ya. A. Seliverstov^b, S. A. Seliverstov^b, E. A. Sofronova^c

^a St. Petersburg Institute of Informatics and Automation of the Russian Academy of Sciences, 39 14-th Line VO, St. Petersburg, 199178, Russia
^b Solomenko Institute of Transport Problems of the Russian Academy of Sciences, 13 12-th Line VO, St. Petersburg, 199178, Russia
^c Federal Research Center “Computer Science and Control” of the Russian Academy of Sciences, 44/2 Vavilova st., Moscow, 119333, Russia

Abstract: This work provides a new approach for constructing high-precision routes based on data from transport detectors inside the SUMO traffic modeling package. Existing tools such as flowrouter and routeSampler have a number of disadvantages, such as the lack of interaction with the network in the process of building routes. Our rlRouter uses multi-agent reinforcement learning (MARL), where the agents are incoming lanes and the environment is the road network. By performing actions to launch vehicles, agents receive a reward for matching data from transport detectors. Parameter Sharing DQN with the LSTM backbone of the Q-function was used as an algorithm for multi-agent reinforcement learning.
Since the rlRouter is trained inside the SUMO simulation, it can restore routes better by taking into account the interaction of vehicles within the network with each other and with the network infrastructure. We have modeled diverse traffic situations on three different junctions in order to compare the performance of SUMO’s routers with the rlRouter. We used Mean Absoluter Error (MAE) as the measure of the deviation from both cumulative detectors and routes data. The rlRouter achieved the highest compliance with the data from the detectors. We also found that by maximizing the reward for matching detectors, the resulting routes also get closer to the real ones. Despite the fact that the routes recovered using rlRouter are superior to the routes obtained using SUMO tools, they do not fully correspond to the real ones, due to the natural limitations of induction-loop detectors. To achieve more plausible routes, it is necessary to equip junctions with other types of transport counters, for example, camera detectors.

Keywords: transport modeling, multi-agent reinforcement learning, intelligent transport systems

UDC: 519.876.5, 519.179.2

Received: 21.11.2023
Accepted: 21.12.2023

Language: English

DOI: 10.20537/2076-7633-2024-16-1-137-146