F. N. Solovyev, A. M. Chepovskii, “An extension of the short text language identification model”, Artificial Intelligence and Decision Making, 2017, Issue 1,Pages <nobr>21

Natural language processing

An extension of the short text language identification model

F. N. Solovyev^a, A. M. Chepovskii^b

^a Moscow Polytechnic University
^b HSE University, Moscow

Abstract: In our work we address the problem of the natural language identification in short texts. A Bayesian classifier is employed. We propose an extension of the language identification model by the incorporation of the new cyrillic languages of the russian small nations.

Keywords: statistical language model, natural language identification, languages of russian small nations.