RUS  ENG
Full version
JOURNALS // Sistemy i Sredstva Informatiki [Systems and Means of Informatics] // Archive

Sistemy i Sredstva Inform., 2018 Volume 28, Issue 4, Pages 156–167 (Mi ssi615)

This article is cited in 12 papers

Supracorpora database of connectives: Design-oriented evolution of the term system

I. M. Zatsman, M. G. Kruzhkov

Institute of Informatics Problems, Federal Research Center "Computer Science and Control" of the Russian Academy of Sciences, 44-2 Vavilov Str., Moscow 119333, Russian Federation

Abstract: This article examines the process of design-oriented evolution of the term system for supracorpora databases (SCDB) which represent a new category of information resources in linguistics. The SCDB is based on parallel texts, i. e., texts placed alongside their translations and aligned with them at the sentence level. Although SCDBs are designed for annotation of a wide variety of linguistic items and their correspondences, this article specifically considers annotation of connectives. The annotation-centered design of SCDBs has led to emergence of new entities and notions in computer linguistics, and in the beginning of 2017, a custom term system was proposed for them. On one hand, the proposed terms are used by linguists in order to describe new knowledge generated as a result of annotation and investigation of linguistic units. On the other hand, these terms serve as a basis for design of the SCDB architecture and the associated dataware, lingware, and software. Since the first description of the terminology, the range of tasks accomplished with SCDBs has expanded significantly; hence, there is the need to further develop the initial design-oriented term system.

Keywords: supracorpora databases, term systems, annotation of linguistic units, parallel texts, corpus linguistics, connectives.

Received: 07.09.2018

DOI: 10.14357/08696527180415



Bibliographic databases:


© Steklov Math. Inst. of RAS, 2024