T. V. Batura, F. A. Mursin, D. F. Semich, S. K. Sagnayeva, S. Zh. Tazhibayeva, M. N. Bakiev, A. S. Yerimbetova, A. M. Bakiyeva, “Using the link grammar parser in the study of Turkic languages”, Eurasian Journal of Mathematical and Computer Applications, 2016, том 4, выпуск 2,страницы 14

Using the link grammar parser in the study of Turkic languages

T. V. Batura^a, F. A. Mursin^a, D. F. Semich^a, S. K. Sagnayeva^b, S. Zh. Tazhibayeva^b, M. N. Bakiev^b, A. S. Yerimbetova^b, A. M. Bakiyeva^a

^a A.P. Ershov Institute of Informatics Systems, Russian Academy of Sciences, Siberian Branch, Novosibirsk State University, 6, Acad. Lavrentjev pr., Novosibirsk 630090, Russia
^b L.N. Gumilyov Eurasian National University, 2, Satpayev St., Astana 010008, Kazakhstan

Аннотация: Growing amount of information on the Internet and rapid development of social networks make the task of text processing increasingly actual. In this paper we propose an algorithm for the comparison of sentences and introduce certain measures of the closeness (similarity) between the sentences. The estimation of the relevance of documents should be based on the context of a search query and should not be limited only by keywords, their similarity or frequency. So proposed measures take into account lexical, syntactic and semantic relations between words. One of the problems we solve in the current time is the development of a parser like Link Grammar Parser for Turkic languages most frequent in the Internet, such as Kazakh, Uzbek (Cyrillic and Roman alphabets), and Turkish. The results of our research are planned to be used in different information retrieval systems.

Ключевые слова: natural language processing, syntactic analysis, Link Grammar Parser, relevance, Turkic languages.

MSC: 68T50, 68P20, 68Q42

Поступила в редакцию: 08.06.2016

Язык публикации: английский