RUS  ENG
Full version
JOURNALS // Informatika i Ee Primeneniya [Informatics and its Applications] // Archive

Inform. Primen., 2020 Volume 14, Issue 4, Pages 108–116 (Mi ia704)

This article is cited in 7 papers

Evolution of classifications in supracorpora databases

A. A. Goncharov, I. M. Zatsman, M. G. Kruzhkov

Institute of Informatics Problems, Federal Research Center “Computer Science and Control” of the Russian Academy of Sciences, 44-2 Vavilov Str., Moscow 119333, Russian Federation

Abstract: The paper examines the task of recording changes to descriptions of meanings of German modal verbs in the process of annotating parallel German-Russian texts within a supracorpora database. This task was used as a case study to analyze the specifics of using dynamic classification systems (DCS) in information systems. The distinctive feature of a DCS is that semantic content of its concepts may change in the process of annotation which often entails the need to reclassify previously annotated data according to the changes made. This paper aims to answer the following questions: ($i$) What factors may have an impact on the need to edit and/or reclassify the annotations created prior to the concept changes? and ($ii$) What kind of operations may be used to represent the changes to concepts in the DCS? The paper describes seven types of possible changes and enumerates the corresponding operations applied to the DCS concepts in the process of annotation. The operations are grouped in three categories depending on how they affect the need to reclassify the previously created annotations.

Keywords: dynamic classification, faceted classification, reclassification, supracorpora databases.

Received: 05.10.2020

DOI: 10.14357/19922264200415



© Steklov Math. Inst. of RAS, 2024