RUS  ENG
Full version
JOURNALS // Diskretnyi Analiz i Issledovanie Operatsii // Archive

Diskretn. Anal. Issled. Oper., 2023 Volume 30, Issue 3, Pages 96–110 (Mi da1329)

Ptas for problems of vector choice and clustering with different centers

A. V. Pyatkin

Sobolev Institute of Mathematics, 4 Akad. Koptyug Avenue, 630090 Novosibirsk, Russia

Abstract: Two close in statements problems are considered. The first one is clustering, i. e. partitioning the set of $d$-dimensional Euclidean vectors into the given number of clusters with different types of centers so that the total dispersion would be minimum. By dispersion here we mean the sum of squared distances between the elements of the clusters and their centers. There are three types of centers: an arbitrary point (clearly, the centroid is the best choice), a point of the initial set (so-called medoid) or a fixed point of the space given in advance. The sizes of the clusters are also given as a part of the input. The second problem is the vector subset choice problem, which is finding a subset of vectors of fixed cardinality having the minimum sum of squared distances between its elements and the centroid. For each of these problems a PTAS is constructed. Bibliogr. 23.

Keywords: clustering, centroid, medoid, approximation, PTAS, MSSC.

UDC: 519.8+518.25

Received: 07.02.2023
Revised: 24.04.2023
Accepted: 25.04.2023

DOI: 10.33048/daio.2023.30.763



© Steklov Math. Inst. of RAS, 2025