N. V. Nor, “Reinforcement Learning in the Task of Spherical Robot Motion Control”, Rus. J. Nonlin. Dyn., 2024, том 20, номер 2,страницы 295

Nonlinear engineering and robotics

Reinforcement Learning in the Task of Spherical Robot Motion Control

N. V. Nor

Lomonosov Moscow State University, Leninsikie gory 1, Moscow, 119991 Russia

Аннотация: This article discusses one of the DDPG (Deep Deterministic Policy Gradient) reinforcement learning algorithms applied to the problem of motion control of a spherical robot. Inside the spherical robot shell there is a platform with a wheel, and the robot is simulated in the MuJoCo physical simulation environment.
The goal is to teach the robot to move along an arbitrary closed curve with minimal error.
The output control algorithm is a pair of trained neural networks — actor and critic, where the actor-network is used to obtain the control torques applied to the robot wheel and the criticnetwork is only involved in the learning process. The results of the training are shown below, namely how the robot performs the motion along ten arbitrary trajectories, where the main quality functional is the average error magnitude over the trajectory length scale. The algorithm is implemented using the PyTorch machine learning library.

Ключевые слова: control, control of a mechanical system, spherical robot, mechanics, artificial intelligence, reinforcement learning, Q-learning, DDPG, actor-critic, multilayer neural network, MuJoCo, PyTorch

MSC: 70Q05, 68T40, 70E55, 68T05, 93E35

Поступила в редакцию: 28.11.2022
Принята в печать: 19.03.2024

Язык публикации: английский

DOI: 10.20537/nd240501