RUS  ENG
Full version
JOURNALS // Informatika i Ee Primeneniya [Informatics and its Applications] // Archive

Inform. Primen., 2010 Volume 4, Issue 1, Pages 12–17 (Mi ia13)

This article is cited in 4 papers

Asymptotic distributions of basic statistics in geometric representation for high-dimensional data and their error bounds

Yu. Kawaguchia, V. V. Ulyanovb, Ya. Fujikoshia

a Department of Mathematics, Graduate School of Science and Engineering, Chuo University, Tokyo, Japan
b M. V. Lomonosov Moscow State University, Faculty of Computational Mathematics and Cybernetics

Abstract: In geometric representation of n observations on p variables, it is necessary to examine asymptotic behaviors of the three statistics; the length of a p-dimensional observation vector, the distance between two independent observation vectors, and the angle between these observation vectors. Hall et al. (2005) found the asymptotic values of these three statistics in high-dimensional framework when the dimension p tends to infinity, while the sample size n is fixed. In this paper, their results are extended by deriving asymptotic expansions of the distributions of the three statistics. Further, computable error bounds for the limiting distributions of the length and the distance were obtained. These results will be useful to obtain statistical insights in middle — as well as in high-dimensional data sets.

Keywords: asymptotic expansions; error bounds; high-dimensional data; geometric representation.



© Steklov Math. Inst. of RAS, 2025