RUS  ENG
Full version
JOURNALS // Informatika i Ee Primeneniya [Informatics and its Applications] // Archive

Inform. Primen., 2025 Volume 19, Issue 3, Pages 67–72 (Mi ia955)

Classification of small sets of data of large dimension

A. A. Grushoa, N. A. Grushoa, M. I. Zabezhailoa, V. V. Kulchenkovb, E. E. Timoninaa

a Federal Research Center “Computer Science and Control” of the Russian Academy of Sciences, 44-2 Vavilov Str., Moscow 119133, Russian Federation
b VTB Bank, 43-1 Vorontsovskaya Str., Moscow 109147, Russian Federation

Abstract: The problem of classifying of data of very large dimension is considered, while only a limited set of training samples of such data is used. Under these conditions, the possibility of using cause-and-effect relationships in solving classification problems of the specified type is checked. Problem solving is based on the existence of cause-and-effect relationships of unknown causes with the observed partially determined effects of these causes in incoming new data. Training on small set of data is used. The problems are solved in conditions when the size of the data and the number of possible data properties tend to infinity. Asymptotic conditions for unambiguous classification of new data were found. In a particular case, the classification problem was investigated in the presence of random distortions of deterministic effects in the data. The conditions for the possibility of training without a teacher are formulated. The work shows the fundamental possibilities of applying cause-and-effect relationships in the tasks of medical diagnostics, identifying fraudulent schemes in the financial sector, and assessing situational awareness in cybersecurity.

Keywords: classification of data of large dimension, artificial intelligence, cause-and-effect relationships.

Received: 19.05.2025
Accepted: 15.08.2025

DOI: 10.14357/19922264250308



© Steklov Math. Inst. of RAS, 2025