RUS  ENG
Full version
JOURNALS // Avtomatika i Telemekhanika // Archive

Avtomat. i Telemekh., 2018 Issue 8, Pages 129–147 (Mi at14742)

This article is cited in 3 papers

Optimization, System Analysis, and Operations Research

Deep learning model selection of suboptimal complexity

O. Yu. Bakhteeva, V. V. Strijovb

a Moscow Institute of Physics and Technology, Moscow, Russia
b Dorodnicyn Computing Centre, Russian Academy of Sciences, Moscow, Russia

Abstract: We consider the problem of model selection for deep learning models of suboptimal complexity. The complexity of a model is understood as the minimum description length of the combination of the sample and the classification or regression model. Suboptimal complexity is understood as an approximate estimate of the minimum description length, obtained with Bayesian inference and variational methods. We introduce probabilistic assumptions about the distribution of parameters. Based on Bayesian inference, we propose the likelihood function of the model. To obtain an estimate for the likelihood, we apply variational methods with gradient optimization algorithms. We perform a computational experiment on several samples.

Keywords: classification, regression, deep learning, model selection, Bayesian inference, variational inference, complexity.

Presented by the member of Editorial Board: F. T. Aleskerov

Received: 02.04.2017


 English version:
Automation and Remote Control, 2018, 79:8, 1474–1488

Bibliographic databases:


© Steklov Math. Inst. of RAS, 2024