RUS  ENG
Full version
JOURNALS // Informatika i Ee Primeneniya [Informatics and its Applications] // Archive

Inform. Primen., 2014 Volume 8, Issue 4, Pages 70–77 (Mi ia345)

This article is cited in 2 papers

False texts: classification and methods of identification of text documents with imitations and substitution of authorship

M. Yu. Mikheevab, N. V. Somina, I. V. Galinaa, O. V. Zolotaryevc, E. B. Kozerenkoa, Yu. I. Morozovaa, M. M. Charninea

a Institute of Informatics Problems, Russian Academy of Sciences, 44-2 Vavilov Str., Moscow 119333, Russian Federation
b Research Computer Center, M. V. Lomonosov Moscow State Uviversity (MGU NIVC), 1-52 Leninskiye Gory, GSP-1, Moscow 119991, Russian Federation
c Russian New University, 22 Radio Str., Moscow 105005, Russian Federation

Abstract: Modern textual space, including the Internet, is enormous and is constantly updated with new texts. All text documents can be divided into two large groups: “good texts” and that might be called “false texts”. So far, the industry of false texts flow production has become so massive that there is an urgent need to study this phenomenon and to develop effective methods of detection of such text documents. The purpose of the paper is to give an adequate description of the concept of false text as information and linguistic phenomenon and suggest some approaches to the identification of such texts.

Keywords: text generation; natural language processing; statistical analysis of language objects; plagiarism; typology of false texts.

Received: 01.11.2014

DOI: 10.14357/19922264140409



Bibliographic databases:


© Steklov Math. Inst. of RAS, 2024