A. Yu. Egorova, I. M. Zatsman, V. V. Kosarik, V. A. Nuriev, “Instability of neural machine translation”, Sistemy i Sredstva Inform., 2020, Volume 30, Issue 2,Pages <nobr>124

This article is cited in 5 papers

Instability of neural machine translation

A. Yu. Egorova, I. M. Zatsman, V. V. Kosarik, V. A. Nuriev

Institute of Informatics Problems, Federal Research Center "Computer Science and Control" of the Russian Academy of Sciences, 44-2 Vavilov Str., Moscow 119333, Russian Federation

Abstract: The paper describes an experiment focused on studying the instability of neural machine translation (NMT). In the course of a year, an array of text fragments in Russian was repeatedly translated into French. The time step was one month. To produce translations, the Google's NMT system was used. The experiment helps reveal the instability of NMT, i. e., it shows that translations of a given text fragment tend to change with time but not always improving the quality. The generated translations were linguistically annotated, which led to uncovering several different types of the NMT instability. While annotating, a previously designed classification of machine translation errors was employed. It was altered to meet the objectives of the experiment, the ultimate goal of which was to obtain a frequency distribution of different types of the NMT instability. Yet, the first step of the experiment limited itself to only categorizing the NMT instability, and it is this very step that the paper describes. As the empirical data, the experiment uses Russian–French annotations generated in a supracorpora database. Each annotation contains a fragment of the source Russian text, its translation into French, and the description of translation errors occurring there.

Keywords: machine translation, instability, translation monitoring, linguistic annotation, instability types.

Received: 03.03.2020

DOI: 10.14357/08696527200212