RUS  ENG
Full version
JOURNALS // Informatika i Ee Primeneniya [Informatics and its Applications] // Archive

Inform. Primen., 2017 Volume 11, Issue 3, Pages 123–131 (Mi ia493)

This article is cited in 12 papers

Statistical data as information source for linguistic analysis of Russian connectors

O.  Inkova, N. Popkova

Institute of Informatics Problems, Federal Research Center “Computer Science and Control” of the Russian Academy of Sciences, 44-2 Vavilov Str., Moscow 119333

Abstract: The aim of this paper is to describe statistical data gathered from the supracorpora database (SCDB) of connectors for further analysis of their formal and functional properties. Until now, these properties have usually been described applying semantic analysis, while corpus data, if used at all, have not been subject to statistical processing. It is automatically generated and verifiable information, collected from texts corpora that can be one of the most reliable tools in the analysis of linguistic units, including connectors. The paper shows what statistics one may obtain from the SCDB and how to use it in the linguistic analysis in case of tol'ko, a polyfunctional linguistic unit that can be a part of multicomponent and two-place connectors.

Keywords: annotation of connectors; corpus linguistics; supracorpora databases; parallel texts; statistical data.

Received: 11.07.2017

Language: English

DOI: 10.14357/19922264170314



Bibliographic databases:


© Steklov Math. Inst. of RAS, 2024