RUS  ENG
Full version
JOURNALS // Avtomatika i Telemekhanika // Archive

Avtomat. i Telemekh., 2021 Issue 4, Pages 140–160 (Mi at15527)

Intellectual Control Systems, Data Analysis

Entropy-randomized method for the reconstruction of missing data

Yu. A. Dubnovabc, V. Yu. Polishchukde, Yu. S. Popkovafg, Yu. M. Polishchukh, A. V. Mel'nikovh, E. S. Sokolh

a Federal Research Center “Computer Science and Control,” Russian Academy of Sciences, Moscow, 119333 Russia
b National Research University Higher School of Economics, Moscow, 101000 Russia
c Moscow Institute of Physics and Technology, Dolgoprudnyi, Moscow oblast, 141700 Russia
d Tomsk Polytechnic University, Tomsk, 634050 Russia
e Institute of Monitoring of Climatic and Ecological Systems of the Siberian Branch of the Russian Academy of Sciences, Tomsk, 634055 Russia
f ORT Braude Academic College of Engineering, University of Haifa, Karmiel, 2161002 Israel
g Trapeznikov Institute of Control Sciences, Russian Academy of Sciences, Moscow, 117997 Russia
h Ugra Research Institute of Information Technologies, Khanty-Mansiisk, 628011 Russia

Abstract: The article deals with the problem of reconstructing missing data in data collections for machine learning problems. We propose a new randomized method for missing data reconstruction based on the technology of entropy-robust estimation and generation of ensembles of random variables. The method is similar to the use of an auxiliary regression to reconstruct missing values, but unlike the latter, no additional constraints are imposed on the likelihood function of errors in the sample in the case of entropy estimation and small amounts of data are permissible; this becomes extremely relevant in problems where the amount of data for training is limited and the omissions are not systematic. The proposed method is used to reconstruct missing data on the areas of thermokarst lakes in the Arctic zone of the Russian Federation as measured from satellite images.

Keywords: missing data reconstruction, entropy-based estimation, randomized machine learning, thermokarst lake, Arctic.

Presented by the member of Editorial Board: A. I. Mikhal'skii

Received: 24.07.2020
Revised: 27.10.2020
Accepted: 08.12.2020

DOI: 10.31857/S0005231021040061


 English version:
Automation and Remote Control, 2021, 82:4, 670–686

Bibliographic databases:


© Steklov Math. Inst. of RAS, 2024