RUS  ENG
Full version
JOURNALS // Modelirovanie i Analiz Informatsionnykh Sistem // Archive

Model. Anal. Inform. Sist., 2023 Volume 30, Number 4, Pages 394–417 (Mi mais811)

Artificial intelligence

Semantic rule-based sentiment detection algorithm for Russian publicism sentences

A. Yu. Poletaev, I. V. Paramonov, E. I. Boychuk

P.G. Demidov Yaroslavl State University, 14, Sovetskaya str., Yaroslavl, Yaroslavl Region, 150003, Russia

Abstract: The article is devoted to the task of sentiment detecton of Russian sentences, which is understood as the author's attitude on the sentence topic expressed through linguistic expression features. Today most studies on this subject utilize texts of colloquial style, limiting the applicability of their results to other styles of speech, particularly to the publicism.
To fill the gap, the authors developed a novel publisism sentences oriented sentiment detection algorithm. The algorithm recursively applies appropriate rules to sentence parts represented as constituency trees. Most of the rules were proposed by a philology expert, based on knowledge on the expression features from Russian philology, and algorithmized using constituency trees generated by the algorithm. A decision tree and a sentiment vocabulary are also used in the work. The article contains the results of evaluation of the algorithm on the publicism sentences corpus OpenSentimentCorpus, F-measure is 0.80. The results of errors analysis are also presented.

Keywords: sentiment analysis, sentiment detection, semantic rules, publicism, constituency tree.

UDC: 004.912+10.02.21

MSC: 68T50

Received: 06.11.2023
Revised: 24.11.2023
Accepted: 29.11.2023

DOI: 10.18255/1818-1015-2023-4-394-417



© Steklov Math. Inst. of RAS, 2024