V. V. Petrochenkov, A. O. Kazennikov, “A statistical tagger for morphological tagging of Russian language texts”, Avtomat. i Telemekh., 2013, Issue 10,Pages <nobr>154

This article is cited in 2 papers

Topical issue

A statistical tagger for morphological tagging of Russian language texts

V. V. Petrochenkov^a, A. O. Kazennikov^b

^a Institute for Information Transmission Problems (Kharkevich Institute), Russian Academy of Sciences, Moscow, Russia
^b Moscow State Institute of Radiotechnics, Electronics, and Automation, Moscow, Russia

Abstract: We consider a method of constructing a statistical tagger for automated morphological tagging for Russian language texts. In this method, each word is assigned with a tag that contains information about the part of speech and a full set of the word's morphological characteristics. We employ the set of morphological characteristics used in the SynTagRus corpus whose material has been used to train the tagger. The tagger is based on the SVM (Support Vector Machine) approach. The developed tagger has proven to be efficient and has shown high tagging quality.

Presented by the member of Editorial Board: A. V. Bernshtein

Received: 11.03.2013