Abstract:
Improvement of text documents clusterization technology based on number clusters optimization and their initial allocation, and also a choice of the most adequate metrics are considered. The results received during experiments confirm efficiency of the offered approach.
Keywords:text, data clustering, class, vector, metrics, centre of cluster, heading, experiment.