RUS  ENG
Полная версия
ЖУРНАЛЫ // Наносистемы: физика, химия, математика // Архив

Наносистемы: физика, химия, математика, 2023, том 14, выпуск 6, страницы 613–625 (Mi nano1228)

MATHEMATICS

Toward nanomagnetic implementation of energy-based machine learning

Igor S. Lobanov

Faculty of Physics, ITMO University, Lomonosova Str. 9, Saint Petersburg, 191002 Russia

Аннотация: Some approaches to machine learning (ML) such as Boltzmann machines (BM) can be reformulated as energy based models, which are famous for being trained by minimization of free energy. In the standard contrastive divergence (CD) learning the model parameters optimization is driven by competition of relaxation forces appearing in the target system and the model one. It is tempting to implement a physical device having natural relaxation dynamics matching minimization of the loss function of the ML model. In the article, we propose a general approach for the design of such devices. We systematically reduce the BM, the restricted BM and BM for classification problems to energy based models. For each model we describe a device capable of learning model parameters by relaxation. We compare simulated dynamics of the models using CD, Monte-Carlo method and Langevin dynamics. Benchmarks of the proposed devices on generation and classification of hand-written digits from MNIST dataset are provided.

Ключевые слова: machine learning, Boltzmann machine, energy based model, dissipative training.

Поступила в редакцию: 10.10.2023
Исправленный вариант: 11.11.2023
Принята в печать: 07.12.2023

DOI: 10.17586/2220-8054-2023-14-6-613-625



Реферативные базы данных:


© МИАН, 2024