RUS  ENG
Full version
JOURNALS // Sistemy i Sredstva Informatiki [Systems and Means of Informatics] // Archive

Sistemy i Sredstva Inform., 2021 Volume 31, Issue 2, Pages 139–151 (Mi ssi772)

This article is cited in 2 papers

Indicator-based evaluation of machine translation instability

A. Yu. Egorova, I. M. Zatsman, M. G. Kruzhkov, V. A. Nuriev

Institute of Informatics Problems, Federal Research Center "Computer Science and Control" of the Russian Academy of Sciences, 44-2 Vavilov Str., Moscow 119133, Russian Federation

Abstract: The paper presents data collected by tracking performance of a neural machine translation (NMT) engine and results of translation errors analysis. Indicator-based evaluation of NMT instability was carried out as a part of an experiment that involved 250 Russian text fragments. Each month for the duration of one year, these fragments were translated into French using the Google Translate NMT engine. The translations were recorded and annotated in a supracorpora database; the annotations include types of translation errors found in the translations by language experts. This procedure resulted in a series of 12 annotated translations for each of the 250 Russian fragments. The annotations include not only the types of errors found in the translations but also the types of NMT instability which indicate dynamics of translation quality or lack thereof. The paper aims to provide comparative analysis of Google Translate performance that takes into account the temporal variation aspect.

Keywords: neural machine translation (NMT), instability of machine translation, supracorpora database, indicator-based evaluation, linguistic annotation, NMT instability types.

Received: 09.03.2021

DOI: 10.14357/08696527210213



© Steklov Math. Inst. of RAS, 2024