RUS  ENG
Full version
JOURNALS // Zhurnal Vychislitel'noi Matematiki i Matematicheskoi Fiziki // Archive

Zh. Vychisl. Mat. Mat. Fiz., 2025 Volume 65, Number 3, Pages 275–293 (Mi zvmmf11936)

Optimal control

Sparse and transferable universal singular vectors attack

K. Kuvshinovaab, O. Tsymboiac, I. Oseledetsdc

a Sber AI Lab, Moscow, Russia
b Skolkovo Institute of Science and Technology, Moscow, Russia
c Artificial Intelligence Research Institute (AIRI), Moscow, Russia
d Moscow Institute of Physics and Technology, Moscow, Russia

Abstract: Mounting concerns about neural networks’ safety and robustness call for a deeper understanding of models’ vulnerability and research in adversarial attacks. Motivated by this, we propose a novel universal attack that is highly efficient in terms of transferability. In contrast to the existing $(p,q)$-singular vectors approach, we focus on finding sparse singular vectors of Jacobian matrices of the hidden layers by employing the truncated power iteration method. We discovered that using resulting vectors as adversarial perturbations can effectively attack the original model and models with entirely different architectures, highlighting the importance of sparsity constraint for attack transferability. Moreover, we achieve results comparable to dense baselines while damaging less than 1% of pixels and utilizing only 256 samples for perturbation fitting. Our algorithm also admits higher attack magnitude without affecting the human ability to solve the task, and damaging 5% of pixels attains more than a 50% fooling rate on average across models. Finally, our findings demonstrate the vulnerability of state-of-the-art models to universal sparse attacks and highlight the importance of developing robust machine learning systems.

Key words: computer vision, adversarial attacks.

UDC: 519.853

Received: 06.11.2024
Accepted: 06.11.2024

Language: English

DOI: 10.31857/S0044466925030044


 English version:
Computational Mathematics and Mathematical Physics, 2025, 65:3, 503–521

Bibliographic databases:


© Steklov Math. Inst. of RAS, 2025