RUS  ENG
Full version
JOURNALS // Diskretnyi Analiz i Issledovanie Operatsii // Archive

Diskretn. Anal. Issled. Oper., 2020 Volume 27, Issue 2, Pages 5–16 (Mi da948)

This article is cited in 2 papers

Computational complexity of the problem of choosing typical representatives in a 2-clustering of a finite set of points in a metric space

I. A. Borisovaab

a Novosibirsk State University, 2 Pirogov Street, 630090 Novosibirsk, Russia
b Sobolev Institute of Mathematics, 4 Acad. Koptyug Avenue, 630090 Novosibirsk, Russia

Abstract: We consider the computational complexity of one extremal problem of choosing a subset of $p$ points from some given $2$-clustering of a finite set in a metric space. The chosen subset of points has to describe the given clusters in the best way from the viewpoint of some geometric criterion. This is a formalization of an applied problem of data mining which consists in finding a subset of typical representatives of a dataset composed of two classes based on the function of rival similarity. The problem is proved to be NP-hard. To this end, we polynomially reduce to the problem one of the well-known problems NP-hard in the strong sense, the $p$-median problem. Bibliogr. 15.

Keywords: NP-hard problem, typical representative, rival similarity, $p$-median problem, data mining.

UDC: 519.254

Received: 10.09.2018
Revised: 24.12.2019
Accepted: 19.02.2020

DOI: 10.33048/daio.2020.27.631


 English version:
Journal of Applied and Industrial Mathematics, 2020, 14:2, 242–248

Bibliographic databases:


© Steklov Math. Inst. of RAS, 2024