Abstract:
With a soaring number of scientific publications and rapid emergence of new directions and approaches, the scientific community faces the task of timely identification of trends. By a trend, we mean a semantically homogeneous topic characterized by a steady lexical kernel and a sharp, often exponential increase in the number of publications [1]. Examples of trends in machine learning are “LSTM”, “deep learning”, “word2vec”, “BERT”, and “fake news detection”. For real-time detection of trend topics from a stream of scientific publications, we use incremental methods of probabilistic topic modeling. An ARTM-based approach to early trend detection has been shown to outperform popular classical and neural network approaches to this task. A dataset of 91 trends for performance evaluation has been manually collected and made available for public use.
Keywords:incremental topic modeling, detection of research trends, ARTM.
UDC:
004.8
Presented:V. B. Betelin Received: 28.10.2022 Revised: 28.10.2022 Accepted: 01.11.2022