RUS  ENG
Full version
JOURNALS // Intelligent systems. Theory and applications // Archive

Intelligent systems. Theory and applications, 2022 Volume 26, Issue 3, Pages 64–73 (Mi ista481)

Part 2. Special Issues in Intellectual Systems Theory

Residual network with recurrent structures

L. Jianga, Zh. Cuib, Z. Wanga

a Lomonosov Moscow State University, Faculty of Mechanics and Mathematics
b Lomonosov Moscow State University, Faculty of Computational Mathematics and Cybernetics

Abstract: We introduce a recurrent structure (spatially) on residual networks, which can improve the performance of the network while saving parameters. We investigate the behaviour of recurrent structures in residual networks based on Riemannian manifolds, introducing curvature as a metric for neural networks. We also experimentally verify that the gain due to the recurrent structure is related to the curvature, and demonstrate the generality of the recurrent structure as a method to improve the performance of the network.

Keywords: Neural Netrowks, Riemannian geometry, Recurrent structures, Manifold, Transformers.



© Steklov Math. Inst. of RAS, 2024