RUS  ENG
Full version
JOURNALS // Proceedings of the Institute for System Programming of the RAS // Archive

Proceedings of ISP RAS, 2019 Volume 31, Issue 5, Pages 153–164 (Mi tisp461)

Usage of i-vectors for automated determination of a similarity level between languages

A. A. Bērziņš

University of Latvia

Abstract: The article describes results of applying i-vectors-based (both LID and SID) speech identification methods to define a kind of a distance between languages (in a wide sense of the word – including dialects and any other forms of spoken language). Spontaneous speech recordings of many enough speakers of languages are used on the input of the method. The experiments were carried out at recordings of Latvian and Latgalian dialects, but the method is applicable to any other idioms. Cosine similarity, Euclidean metric, standardized Euclidean metric, Jordan (or Chebyshov) metric and city block (or L1) metric were tried out. Cosine similarity worked well for SID i-vectors, but for unknown reasons was senseless for LID i-vectors.  Jordan metric worked well for LID, but was not good enough for SID i-vectors. Standardization of the Euclidean metric does not gave any improvement. Thus, the conclusions are: 1) both SID and LID vectors of full length recordings of spontaneous speech are characterizing and representing languages good enough to be used for detection of a distance between languages; 2) the best metrics for such tasks are Euclidean and L1 (for arithmetic mean vectors computed from i-vectors of all informants coordinate by coordinate).

Keywords: speech, idiom, language, dialect, i-vector, LID, SID, recording, proximity of languages, distance between languages.

DOI: 10.15514/ISPRAS-2019-31(5)-12



© Steklov Math. Inst. of RAS, 2024