RUS  ENG
Full version
JOURNALS // Vestnik of Astrakhan State Technical University. Series: Management, Computer Sciences and Informatics // Archive

Vestn. Astrakhan State Technical Univ. Ser. Management, Computer Sciences and Informatics, 2019 Number 4, Pages 106–114 (Mi vagtu605)

This article is cited in 1 paper

COMPUTER SOFTWARE AND COMPUTING EQUIPMENT

Increasing quality of classifying objects using new metrics of clustering

R. Yu. Deminaa, I. M. Azmukhamedovb

a Astrakhan State Technical University, Astrakhan, Russian Federation
b Astrakhan State University, Astrakhan, Russian Federation

Abstract: The article touches upon one of the main problems of machine learning — clustering objects. It has been widely used in various subject areas: marketing, sociology, psychology, etc. Clusterization algorithms, as a rule, are based on a metric that reflects the distance between objects. However, in some cases it is not practical to use the distance between objects. In certain situations, it is possible to say that one object is similar to the other, the latter being not similar to the former. The original picture and its copy may serve as an example. For such cases, a measure of object similarity is proposed in the work, which shows how many features of one object are contained in another one. A similarity matrix is built on this measure, the analysis of which allows revealing clusters of mutually similar objects. When testing the proposed clustering method, the Rand index (the proportion of correctly connected or unrelated objects) made 0.93. There has been proposed an algorithm that allows to form a set of objects absolutely different from each other. A set of objects formed in this way can later become a learning set for classifiers and increase their fidelity in recognition.

Keywords: clustering, metric, comparison, degree of likeliness, training set, object’s features, Rand index.

UDC: 004.855

Received: 19.09.2019

DOI: 10.24143/2072-9502-2019-4-106-114



© Steklov Math. Inst. of RAS, 2024