RUS  ENG
Full version
JOURNALS // Uchenye Zapiski Kazanskogo Universiteta. Seriya Fiziko-Matematicheskie Nauki // Archive

Kazan. Gos. Univ. Uchen. Zap. Ser. Fiz.-Mat. Nauki, 2009 Volume 151, Book 3, Pages 229–239 (Mi uzku801)

Transformation of metrics used in clusterization methods for building the phylogenetic language trees

V. D. Solovyeva, R. F. Fashutdinovb

a Kazan State University
b Institute of Problems of Information of Academy of Sciences of Republic of Tatarstan, Kazan

Abstract: As large typological databases appeared a few years ago, the problem of data mining (as clusterization of languages) arose. Usually phylogenetic algorithms based on Hamming-distance are used for these purposes. But it was found out in cluster analysis that some other metrics give better results. In the paper two new metrics are proposed and it is shown on a great number of linguistic examples that phylogenetic algorithms based on these metrics give better results.

Keywords: linguistic database, metrics, phylogenetic algorithms.

UDC: 81+004.9

Received: 12.05.2009



© Steklov Math. Inst. of RAS, 2025