RUS  ENG
Full version
JOURNALS // Informatsionnye Tekhnologii i Vychslitel'nye Sistemy // Archive

Informatsionnye Tekhnologii i Vychslitel'nye Sistemy, 2010 Issue 2, Pages 42–49 (Mi itvs12)

DATA PROCESSING

The task of clusterization of text documents

M. V. Khachumov

Peoples' Friendship University of Russia, Moscow

Abstract: Improvement of text documents clusterization technology based on number clusters optimization and their initial allocation, and also a choice of the most adequate metrics are considered. The results received during experiments confirm efficiency of the offered approach.

Keywords: text, data clustering, class, vector, metrics, centre of cluster, heading, experiment.



© Steklov Math. Inst. of RAS, 2024