N. V. Loukachevitch, I. I. Chetviorkin, “Combining corpus and thesaurus information for extracting sentiment words”, Sistemy i Sredstva Inform., 2015, Volume 25, Issue 1,Pages <nobr>20

This article is cited in 2 papers

Combining corpus and thesaurus information for extracting sentiment words

N. V. Loukachevitch, I. I. Chetviorkin

Research Computing Center, M. V. Lomonosov Moscow State University, 4 Leninskie Gory, Moscow 119991, Russian Federation

Abstract: The paper describes a combined approach to extraction of a domain-specific sentiment lexicon. At first, an initial version of a domain-specific lexicon is obtained by application of a supervised model. At the second stage, the ordered list of sentiment words is refined using the thesaurus information. This combined model is applied to several domains and at last, the domain-specific sentiment lexicons are united to create an improved version of the Russian sentiment lexicon in the generalized domain of products.

Keywords: sentiment analysis; domain adaptation; natural language processing; thesaurus.

Received: 20.01.2015

DOI: 10.14357/08696527150102