RUS  ENG
Full version
JOURNALS // Vestnik of Astrakhan State Technical University. Series: Management, Computer Sciences and Informatics // Archive

Vestn. Astrakhan State Technical Univ. Ser. Management, Computer Sciences and Informatics, 2023 Number 1, Pages 25–35 (Mi vagtu737)

COMPUTER SOFTWARE AND COMPUTING EQUIPMENT

Analyzing algorithms and solutions for automatic generation of news article leads in social networks by using artificial intelligence

A. I. Egunova, R. S. Komarov, Yu. S. Vechkanova, O. I. Egunova, D. P. Sidorov, S. D. Shibaikin, V. V. Nikulin

National Research Ogarev Mordovia State University, Saransk, Russia

Abstract: The article highlights the approaches to automatic abstracting the articles. When publishing articles on social networks, the editors of news portals need to create a short abstract of each article spending a minimum of time. Prompt and simultaneous placement of the publications on all registered resources is facilitated by automatic generation of leads. There is proposed to apply the artificial intelligence algorithms trained on corpora of the Russian texts. There are three approaches to text abstracting for the automated formation of article leads: extractive, abstract, and combined. There is carried out comparative analysis of the methods of extractive and abstract approaches in the frames of solving the problem by using neural network models of machine learning. Different stages of extractive abstracting are analyzed using both simple and more complex methods of LexRank, TextRank and on top of Deep Learning. The compared abstract models were selected as the most suitable ones for abstracting the news articles on top of the BERT model. More complex generating texts process the data in parallel, which speeds up processing, but requires training on large corpora of news documents. When using the abstract models Pointer General Network and MBART the information processing time is reduced and work efficiency increases.

Keywords: summarization, abstracting, vector, token, encoding, decoding, generating.

UDC: 004.912

Received: 19.09.2022
Accepted: 12.01.2023

DOI: 10.24143/2072-9502-2023-1-25-35



© Steklov Math. Inst. of RAS, 2024