RUS
ENG
Full version
JOURNALS
// Proceedings of Machine Learning Research (PMLR)
// Archive
Proc. Mach. Learn. Res. (PMLR), 2024, Volume 247,
Pages
4511–4547
(Mi pmlr3)
Improved high-probability bounds for the temporal difference learning algorithm via exponential stability
Sergey Samsonov
a
,
Daniil Tiapkin
bc
,
Alexey Naumov
ad
,
Eric Moulines
b
a
HSE University, Moscow, Russia
b
Centre de Mathématiques Appliquées – CNRS – École polytechnique – Institut Polytechnique de Paris, France
c
Université Paris-Saclay, CNRS, Laboratoire de mathématiques d'Orsay, France
d
Steklov Mathematical Institute of Russian Academy of Sciences, Moscow, Russia
Language:
English
©
Steklov Math. Inst. of RAS
, 2025