RUS  ENG
Full version
JOURNALS // Modelirovanie i Analiz Informatsionnykh Sistem // Archive

Model. Anal. Inform. Sist., 2013 Volume 20, Number 2, Pages 70–79 (Mi mais298)

Construction of a Model for the Cross-Domain Opinion Word Extraction

N. V. Loukachevitcha, I. I. Chetviorkinb

a Lomonosov Moscow State University, Leninskiye Gory, 1, Build. 4, Research Computing Center, Moscow, GSP-1, 119991, Russia
b Lomonosov Moscow State University, Leninskiye Gory 1, Build. 52, Faculty of Computational Mathematics and Cybernetics, Moscow, GSP-1, 119991, Russia

Abstract: In this paper we consider a new approach for domain-specific opinion word extraction in the Russian language. We propose a set of statistical features and an algorithm combination that can extract opinion words in a particular domain. The extraction model was trained in the movie domain and then applied to four other domains. The quality of the obtained sentiment lexicons was evaluated intrinsically on the base of an expert markup and remained on the high level during the model transfer to various domains. Finally, our method is adapted to the movie domain in English and it demonstrated good results.

Keywords: sentiment analysis, opinion words, domain adaptation.

UDC: 004.853

Received: 26.11.2012



© Steklov Math. Inst. of RAS, 2024