RUS  ENG
Full version
JOURNALS // Bulletin of Irkutsk State University. Series Mathematics // Archive

Bulletin of Irkutsk State University. Series Mathematics, 2018 Volume 25, Pages 63–78 (Mi iigum346)

This article is cited in 1 paper

Graph clustering based on modularity variation estimations

N. N. Martynova, O. V. Khandarovab, F. V. Khandarova

a Buryat State University, Ulan-Ude, Russian Federation
b Institute for Mongolian, Buddhist and Tibetan Studies SB RAS, Ulan-Ude, Russian Federation

Abstract: Graph clustering is one of the constantly actual data analysis problems. There are various statements of this problem. In this paper we consider the statement of search for a partition of a graph vertices set into disjoint subsets. In such a way, that the density of connections between the vertices of one subset was higher than that between the vertices of different subsets.
There is a lot of various approaches, and many of them use such as an a posteriori estimate of clustering quality, as modularity. The modularity functional, taking the value from $[-1, 1]$, allows us to formally evaluate the quality of partitioning into subsets. This paper deals with an approach, instead of calculating the modularity, using a less computationally complex estimate of modularity change in the operation of clusters union.
Four theorems for different graph types are formulated, presenting the possibility of application of suggested estimate, instead of direct modularity calculations. A greedy algorithmic scheme and also AMVE (Algorithm based on Modularity Variation Estimation) — simple greedy algorithm based on the scheme are proposed.
An attempt of comparative analysis on the test problemet of AMVE with heuristic clustering algorithms implemented in actual data analysis software is described. It is shown the comparative advantage of AMVE, both in terms of speed and quality of clustering.
Also, the cases on the use of developed algorithm and its implementation for data analysis in sociology and history and criticism of literature are mentioned. In these cases, investigated graphs based on social networks data (the ratio of "friendship" in the social network between users used as the graph edges). It is demonstrated a slight superiority of AMVE in clustering quality compared to the known algorithms Louvain and Walktrap.

Keywords: graph clustering, modularity, community detection, social network analysis.

UDC: 519.176:519.178

MSC: 05C70

Received: 08.08.2018

DOI: 10.26516/1997-7670.2018.25.63



Bibliographic databases:


© Steklov Math. Inst. of RAS, 2024