Abstract:
The paper discusses an information technology that supports expertise of machine translations. The technology has been developed to meet the following conditions: ($i$) there are connectives in all translated contexts; ($ii$) the connectives may be both one-word (khotya ‘although,’ a ‘and’) and multiword (da esche ‘and beside this,’ no zato ‘but instead’); and ($iii$) between words making up a given connective, there may be a space (esli (space) tak ‘if (space) then’). With this technology, expertise of machine translations develops through three main stages: ($i$) linguistic annotation of machine translations in a supracorpora database; ($ii$) quantitative processing of annotations; and ($iii$) linguistic analysis of annotations and quantitative data. The paper describes technological aspects of the first two stages. The examples given are only those with multiword connectives. Source sentences chosen for machine translation have been collected from literary texts.
Keywords:supracorpora database, machine translation, classification of errors, technology supporting expertise, linguistic annotation, corpus linguistics, connectives.