Abstract:
A scheme for solving the problem of binary clustering of semistructured data is proposed. Different ways of representing the input data of clustering problem are considered. Methods of consecutive reduction and consecutive association of clusters, and a model of initial placement of the clusters are considered. Estimates of the number of clusters for solving clustering problem are given. Method of binary clustering of the points located on a circle is offered.