RUS  ENG
Full version
JOURNALS // Informatika i Ee Primeneniya [Informatics and its Applications] // Archive

Inform. Primen., 2020 Volume 14, Issue 1, Pages 113–120 (Mi ia652)

This article is cited in 1 paper

Analytical textology in intelligent processing systems for unstructured data

E. B. Kozerenkoa, M. Yu. Mikheevb, N. V. Somina, L. I. Ehrlichb, K. I. Kuznetsova

a Institute of Informatics Problems, Federal Research Center “Computer Science and Control” of the Russian Academy of Sciences, 44-2 Vavilov Str., Moscow 119133, Russian Federation
b Research Computing Center Lomonosov Moscow State University, 1, bld. 4 Leninskie Gory, Moscow, GSP-1, 119991, Russian Federation

Abstract: The paper presents a new field of research at the intersection of linguistics, computer science, and philology involving logical and statistical methods of analyzing unstructured data in the form of natural language texts in order to solve a number of the tasks of extracting explicit and implicit knowledge from texts using a semantics-oriented linguistic processor, forming lexical statistical representations of texts, building analytical conclusions, discovery of the author's idiostyle and textual similarity of literary works based on the analysis of service words and other microtext elements; identifying the sentiment of texts, building a full profile of the author's text based on the superposition of methods. The example of the textological analysis of the “Blue Book” of the “Petersburg Diary” by Zinaida Hippius is considered.

Keywords: natural language processing, statistical methods, cognitive technology, lexical semantic analysis, knowledge extraction from texts, analytical systems.

Received: 15.01.2020

DOI: 10.14357/19922264200115



© Steklov Math. Inst. of RAS, 2024