RUS  ENG
Full version
JOURNALS // Doklady Rossijskoj Akademii Nauk. Mathematika, Informatika, Processy Upravlenia // Archive

Dokl. RAN. Math. Inf. Proc. Upr., 2022 Volume 508, Pages 106–108 (Mi danma346)

ADVANCED STUDIES IN ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING

Incremental learning of topic models for finding trend topics in scientific publications

N. A. Gerasimenkoa, A. S. Chernyavskya, M. A. Nikiforovaa, M. D. Nikitina, K. V. Vorontsovb

a Sberbank, Moscow, Russia
b Federal Research Center "Computer Science and Control" of Russian Academy of Sciences, Moscow, Russia

Abstract: With a soaring number of scientific publications and rapid emergence of new directions and approaches, the scientific community faces the task of timely identification of trends. By a trend, we mean a semantically homogeneous topic characterized by a steady lexical kernel and a sharp, often exponential increase in the number of publications [1]. Examples of trends in machine learning are “LSTM”, “deep learning”, “word2vec”, “BERT”, and “fake news detection”. For real-time detection of trend topics from a stream of scientific publications, we use incremental methods of probabilistic topic modeling. An ARTM-based approach to early trend detection has been shown to outperform popular classical and neural network approaches to this task. A dataset of 91 trends for performance evaluation has been manually collected and made available for public use.

Keywords: incremental topic modeling, detection of research trends, ARTM.

UDC: 004.8

Presented: V. B. Betelin
Received: 28.10.2022
Revised: 28.10.2022
Accepted: 01.11.2022

DOI: 10.31857/S2686954322070086


 English version:
Doklady Mathematics, 2022, 106:suppl. 1, S97–S98

Bibliographic databases:


© Steklov Math. Inst. of RAS, 2024