RUS  ENG
Full version
JOURNALS // Sistemy i Sredstva Informatiki [Systems and Means of Informatics] // Archive

Sistemy i Sredstva Inform., 2015 Volume 25, Issue 1, Pages 20–33 (Mi ssi391)

This article is cited in 2 papers

Combining corpus and thesaurus information for extracting sentiment words

N. V. Loukachevitch, I. I. Chetviorkin

Research Computing Center, M. V. Lomonosov Moscow State University, 4 Leninskie Gory, Moscow 119991, Russian Federation

Abstract: The paper describes a combined approach to extraction of a domain-specific sentiment lexicon. At first, an initial version of a domain-specific lexicon is obtained by application of a supervised model. At the second stage, the ordered list of sentiment words is refined using the thesaurus information. This combined model is applied to several domains and at last, the domain-specific sentiment lexicons are united to create an improved version of the Russian sentiment lexicon in the generalized domain of products.

Keywords: sentiment analysis; domain adaptation; natural language processing; thesaurus.

Received: 20.01.2015

DOI: 10.14357/08696527150102



Bibliographic databases:


© Steklov Math. Inst. of RAS, 2024