RUS  ENG
Full version
JOURNALS // Doklady Rossijskoj Akademii Nauk. Mathematika, Informatika, Processy Upravlenia // Archive

Dokl. RAN. Math. Inf. Proc. Upr., 2023 Volume 514, Number 2, Pages 385–394 (Mi danma482)

SPECIAL ISSUE: ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING TECHNOLOGIES

Do we benefit from the categorization of the news flow in the stock price prediction problem?

T. D. Kulikovaa, E. Yu. Kovtunb, S. A. Budennyyc

a Faculty of Computer Science, National Research University "Higher School of Economics", Moscow, Russian Federation
b Sber AI Lab, Moscow, Russian Federation
c Artificial Intelligence Research Institute, Moscow, Russian Federation

Abstract: The power of machine learning is widely leveraged in the task of company stock price prediction. It is essential to incorporate historical stock prices and relevant external world information for constructing a more accurate predictive model. The sentiments of the financial news connected with the company can become such valuable knowledge. However, financial news has different topics, such as Macro, Markets, or Product news. The adoption of such categorization is usually out of scope in a market research. In this work, we aim to close this gap and explore the effect of capturing the news topic differentiation in the stock price prediction problem. Initially, we classify the financial news stream into 20 pre-defined topics with the pre-trained model. Then, we get sentiments and explore the topic of news group sentiment labeling. Moreover, we conduct the experiments with the several well-proved models for time series forecasting, including the Temporal Convolutional Network (TCN), the D-Linear, the Transformer, and the Temporal Fusion Transformer (TFT). In the results of our research, utilizing the information from separate topic groups contributes to a better performance of deep learning models compared to the approach when we consider all news sentiments without any division.

Keywords: financial news, stock market, BERT, topic classification, sentiment analysis, time-series forecasting, deep learning, external data.

UDC: 517.54

Presented: A. A. Shananin
Received: 04.09.2023
Revised: 08.09.2023
Accepted: 18.10.2023

DOI: 10.31857/S2686954323601926


 English version:
Doklady Mathematics, 2023, 108:suppl. 2, S503–S510

Bibliographic databases:


© Steklov Math. Inst. of RAS, 2024