RUS  ENG
Full version
JOURNALS // Informatika i Ee Primeneniya [Informatics and its Applications] // Archive

Inform. Primen., 2013 Volume 7, Issue 1, Pages 82–89 (Mi ia247)

This article is cited in 1 paper

Parallel texts alignment strategies: the semantic aspects

E. B. Kozerenko

IPI RAN

Abstract: The paper deals with the problems of design and development of the linguistically motivated mechanisms for parallel texts alignment and grammatical (functional semantic) matches extraction for design of statistical language portraits to be further incorporated into the hybrid models of machine translation. By hybrid models are meant those employing both statistical and rule-based mechanisms within one framework for natural language processing. The presented approach consists in the use of the starting (“seed”) grammar to be further enriched by the matches discovered in parallel texts. The seed grammar is used featuring the cognitive and functional characteristics of phrase structures. The grammar formalism employed is the Cognitive Transfer Grammar based on the transfemes (bilingual phrase structures). The interpreters' transformations existing in parallel texts are of primary importance to the research efforts.

Keywords: alignment; parallel texts; syntax; semantics; phrase structures; hybrid models; machine translation.



© Steklov Math. Inst. of RAS, 2024