RUS  ENG
Full version
JOURNALS // Proceedings of the Institute for System Programming of the RAS // Archive

Proceedings of ISP RAS, 2025 Volume 37, Issue 2, Pages 237–246 (Mi tisp978)

What status of the vasyugan khanty vernacular do calculations on the LingvoDoc platform support from the point of view of systemic morphological characteristics?

V. V. Vorobevaab, I. V. Novitskayac

a Tomsk Polytechnic University
b Ivannikov Institute for System Programming of the RAS
c Tomsk State University

Abstract: In this study the object of analysis is Vasyugan Khanty. Its status raises controversial opinions in Khanty studies. To clarify the status of the Vasyugan idiom as a separate dialect or as an accent of the Vakh-Vasyugan dialect, we employed modern methods of language data analysis. We used corpus data of the two varieties of the Khanty language, namely, Vakh Khanty and Vasyugan Khanty, available at the LingvoDoc platform to calculate their morphological proximity by means of the online virtual laboratory tool. The analysis results point to the fact that the morphological systems of the Vakh and Vasyugan Khanty vernaculars coincide by 98%, which confirms their morphological unity and affiliation with one and the same dialectal continuum. The machine analysis of the morphological dictionaries, cognate groups and transcriptions identified only three autonomous affixes in each idiom. Due to the fact that the volumes of the corpus data of the two varieties are unbalanced, the unique autonomous morphological affixes in each idiom can be regarded as a tentative argument subject to correction.

Keywords: Khanty language Vasyugan dialect, field data, text corpora, LingvoDoc, data analysis, language documentation.

DOI: 10.15514/ISPRAS-2025-37(2)-17



© Steklov Math. Inst. of RAS, 2025