RUS  ENG
Full version
JOURNALS // Sistemy i Sredstva Informatiki [Systems and Means of Informatics] // Archive

Sistemy i Sredstva Inform., 2014 Volume 24, Issue 2, Pages 131–142 (Mi ssi349)

This article is cited in 1 paper

Method for extracting single-word translation correspondences from parallel texts using distributional semantics models

Yu. I. Morozova, E. B. Kozerenko, M. M. Sharnin

Institute of Informatics Problems, Russian Academy of Sciences, 44-2 Vavilov Str., Moscow 119333, Russian Federation

Abstract: The paper deals with problems of corpus research of linguistic units. The task of extracting translation correspondences from a parallel corpus is defined. An overview of existing approaches to this task is provided. The paper focuses on the approach to extracting translation correspondences based on distributional semantics models. The paper describes the theoretical model developed by the authors as well as its software implementation. A test parallel corpus of patent texts in French and English was compiled for the purpose of this research. The paper provides results of an experiment aimed at extracting single-word translation correspondences from the test parallel corpus.

Keywords: extracting translation correspondences; alignment; parallel texts; parallel corpus; distributional semantics; vector space model.

Received: 27.03.2014

DOI: 10.14357/08696527140209



Bibliographic databases:


© Steklov Math. Inst. of RAS, 2025