RUS  ENG
Full version
JOURNALS // Sistemy i Sredstva Informatiki [Systems and Means of Informatics] // Archive

Sistemy i Sredstva Inform., 2020 Volume 30, Issue 4, Pages 124–137 (Mi ssi741)

This article is cited in 3 papers

Machine translation: Indicator-based evaluation of training progress in neural processing

A. Yu. Egorova, I. M. Zatsman, M. G. Kruzhkov, V. A. Nuriev

Institute of Informatics Problems, Federal Research Center "Computer Science and Control" of the Russian Academy of Sciences, 44-2 Vavilov Str., Moscow 119333, Russian Federation

Abstract: The paper presents data collected while observing training progress of a neural machine translation (NMT) engine. The observed training progress received qualitative evaluation based on a set of indicators. Two hundred and fifty text fragments in Russian were used as experimental material for the study. For the duration of one year, every month these fragments were translated into French using the publicly available Google's NMT engine. The produced translations were recorded and annotated by language experts in a supracorpora database which resulted in a series of 12 annotated translations for each of the 250 Russian fragments. The annotations include labels of translation errors which enables researchers to determine the NMT instability types according to the changes of translation quality or lack thereof. The goal of this paper is to describe the newly developed indicator-based approach and to provide an example of its application to evaluation of a neural network training progress.

Keywords: neural machine translation, instability of machine translation, indicator-based evaluation, linguistic annotation, instability types.

Received: 14.09.2020

DOI: 10.14357/08696527200412



Bibliographic databases:


© Steklov Math. Inst. of RAS, 2024