V. D. Solovyev, R. F. Fashutdinov, “Transformation of metrics used in clusterization methods for building the phylogenetic language trees”, Kazan. Gos. Univ. Uchen. Zap. Ser. Fiz.-Mat. Nauki, 2009, Volume 151, Book 3,Pages <nobr>229

Transformation of metrics used in clusterization methods for building the phylogenetic language trees

V. D. Solovyev^a, R. F. Fashutdinov^b

^a Kazan State University
^b Institute of Problems of Information of Academy of Sciences of Republic of Tatarstan, Kazan

Abstract: As large typological databases appeared a few years ago, the problem of data mining (as clusterization of languages) arose. Usually phylogenetic algorithms based on Hamming-distance are used for these purposes. But it was found out in cluster analysis that some other metrics give better results. In the paper two new metrics are proposed and it is shown on a great number of linguistic examples that phylogenetic algorithms based on these metrics give better results.

Keywords: linguistic database, metrics, phylogenetic algorithms.

UDC: 81+004.9

Received: 12.05.2009