RUS  ENG
Full version
JOURNALS // Sistemy i Sredstva Informatiki [Systems and Means of Informatics] // Archive

Sistemy i Sredstva Inform., 2019 Volume 29, Issue 1, Pages 180–193 (Mi ssi632)

This article is cited in 11 papers

Information transformations of parallel texts in knowledge extraction

A. A. Goncharov, I. M. Zatsman

Institute of Informatics Problems, Federal Research Center "Computer Science and Control" of the Russian Academy of Sciences, 44-2 Vavilov Str., Moscow 119133, Russian Federation

Abstract: The paper examines the task of goal-oriented discovery and filling of lacunas in linguistic typologies considered as forms of knowledge representation. The process of solving this task includes several repeated stages which collectively form one iteration of the proposed solution to the task of goal-oriented knowledge discovery in parallel texts required to fill the lacunas. Parallel texts as an information resource are transformed in the process of solving this task. The purpose of the paper is to describe the types of information transformations of parallel texts that are used during early stages of the process of knowledge*discovery and filling of lacunas in linguistic typologies. As a part of knowledge discovery, first, the parallel texts are fragmented into objects of interpretation and then, the search for potential sources of knowledge capable to fill the lacunas is performed. This paper considers this fragmentation process as one of the information transformation types of parallel texts.

Keywords: discovery of lacunas, filling of lacunas, linguistic typology, knowledge extraction from parallel texts, corpus linguistics, objects of interpretation.

Received: 15.02.2019

DOI: 10.14357/08696527190115



Bibliographic databases:


© Steklov Math. Inst. of RAS, 2024