RUS  ENG
Full version
JOURNALS // Zapiski Nauchnykh Seminarov POMI // Archive

Zap. Nauchn. Sem. POMI, 2023 Volume 530, Pages 96–112 (Mi znsl7435)

Translate your gibberish: black-box adversarial attack on machine translation systems

A. Chertkovab, O. Tsymboicd, M. Pautova, I. Oseledetsaeb

a Skolkovo Institute of Science and Technology, Moscow, Russia
b Institute of Numerical Mathematics, Russian Academy of Sciences
c Moscow Institute of Physics and Technology, Moscow, Russia
d Sber AI Lab, Moscow, Russia
e AIRI, Moscow, Russia

Abstract: Neural networks are deployed widely in natural language processing tasks on the industrial scale, and perhaps most often they are used as compounds of automatic machine translation systems. In this work, we present a simple approach to fool state of the art machine translation tools in the task of translation from Russian to English and vice versa. Using a novel black-box gradient-free tensor-based optimizer, we show that many online translation tools, such as Google, DeepL, and Yandex, may both produce wrong or offensive translations for nonsensical adversarial input queries and refuse to translate seemingly benign input phrases. This vulnerability may interfere with understanding a new language and simply worsen the user's experience while using machine translation systems, and, hence, additional improvements of these tools are required to establish better translation.

Key words and phrases: natural language processing, machine translation, adversarial attack, black-box optimization.

UDC: 81.322.4

Received: 06.09.2023

Language: English



© Steklov Math. Inst. of RAS, 2024