RUS  ENG
Full version
JOURNALS // Informatika i Ee Primeneniya [Informatics and its Applications] // Archive

Inform. Primen., 2017 Volume 11, Issue 1, Pages 100–108 (Mi ia463)

This article is cited in 22 papers

Supracorpora database on connectives: term system development

Anna A. Zaliznyakab, I. M. Zatsmanb, O. Yu. Inkovac

a Institute of Linguistics, Russian Academy of Sciences, 1-1 Bolshoy Kislovskiy Per., Moscow 125009, Russian Federation
b Institute of Informatics Problems, Federal Research Center “Computer Science and Control” of the Russian Academy of Sciences, 44-2 Vavilov Str., Moscow 119333, Russian Federation
c University of Geneva, 22 Bd des Philosophes, CH-1205 Geneva 4, Switzerland

Abstract: The article considers a supracorpora database (SCDB) — a new type of linguistic information resource. The SCDB contains aligned parallel texts wherein source language sentences are aligned with target language sentences. One distinctive feature of the SCDB is that it supports annotating the examined linguistic items (in this case, connectives). Another important feature is that cross-linguistic annotating makes it possible to reveal a wide spectrum of new entities and concepts, both in informatics and linguistics. For description of these entities and concepts, a new multidisciplinary term system is proposed. On the one hand, the proposed terms are used by linguists for description of new basic knowledge generated as a result of contrastive analysis of Russian connectives. On the other hand, the design of architecture and functional subsystems of the SCDB is based on these terms, and they are used for the development of respective information, linguistic and software tools. Finally, the term system is required for comparison of the presented outcomes of the project with similar results of other projects.

Keywords: supracorpora database; term system; connectives; linguistic annotation; parallel texts; corpus linguistics; chronotypical faceted classification.

Received: 17.01.2017

DOI: 10.14357/19922264170109



Bibliographic databases:


© Steklov Math. Inst. of RAS, 2025