RUS  ENG
Full version
JOURNALS // Doklady Rossijskoj Akademii Nauk. Mathematika, Informatika, Processy Upravlenia // Archive

Dokl. RAN. Math. Inf. Proc. Upr., 2025 Volume 527, Pages 495–522 (Mi danma704)

SPECIAL ISSUE: ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING TECHNOLOGIES

Loss barcode: a topological measure of escapability in loss landscapes

S. A. Barannikovab, D. S. Voronkovaac, A. Mironenkoa, I. Trofimova, A. Korotinac, G. Sotnikova, E. V. Burnaevac

a Skolkovo Institute of Science and Technology
b CNRS, IMJ, Paris City University, Paris, France
c Artificial Intelligence Research Institute, Moscow

Abstract: Neural network training is commonly based on SGD. However, the understanding of SGD's ability to converge to good local minima, given the non-convex nature of loss functions and the intricate geometric characteristics of loss landscapes, remains limited. In this paper, we apply topological data analysis methods to loss landscapes to gain insights into the learning process and generalization properties of deep neural networks. We use the loss function topology to relate the local behavior of gradient descent trajectories with the global properties of the loss surface. For this purpose, we define the neural network's Topological Obstructions score (“TO-score”) with the help of robust topological invariants, barcodes of the loss function, which quantify the escapability of local minima for gradient-based optimization. Our two principal observations are: 1) the loss barcode of the neural network decreases with increasing depth and width, therefore the topological obstructions to learning diminish; 2) in certain situations there is a connection between the length of minima segments in the loss barcode and the minima's generalization errors. Our statements are based on extensive experiments with fully connected, convolutional, and transformer architectures and several datasets including MNIST, FMNIST, CIFAR10, CIFAR100, SVHN, and multilingual OSCAR text dataset.

Keywords: topological data analysis, deep neural networks, loss surface, SGD, persistence barcodes.

UDC: 519.6

Received: 21.08.2025
Accepted: 28.09.2025

DOI: 10.7868/S2686954325070422



Bibliographic databases:


© Steklov Math. Inst. of RAS, 2025