Top-Down Stochastic Block Partitioning: Turning Graph Clustering Upside Down
dc.contributor.author | Wanye, Frank | en |
dc.contributor.author | Gleyzer, Vitaliy | en |
dc.contributor.author | Kao, Edward | en |
dc.contributor.author | Feng, Wu-chun | en |
dc.date.accessioned | 2025-10-01T17:56:31Z | en |
dc.date.available | 2025-10-01T17:56:31Z | en |
dc.date.issued | 2025-07-20 | en |
dc.date.updated | 2025-10-01T07:46:12Z | en |
dc.description.abstract | Stochastic block partitioning (SBP) is a statistical inference-based algorithm for clustering vertices within a graph. It has been shown to be statistically robust and highly accurate even on graphs with a complex structure, but its poor scalability limits its usability to smaller-sized graphs. In this manuscript we argue that one reason for its poor scalability is the agglomerative, or bottom-up, nature of SBP’s algorithmic design; the agglomerative computations cause high memory usage and create a large search space that slows down statistical inference, particularly in the algorithm’s initial iterations. To address this bottleneck, we propose Top-Down SBP, a novel algorithm that replaces the agglomerative (bottom-up) block merges in SBP with a block-splitting operation. This enables the algorithm to start with all vertices in one cluster and subdivide them over time into smaller clusters. We show that Top-Down SBP is up to 7.7× faster than Bottom-Up SBP without sacrificing accuracy and can process larger graphs than Bottom-Up SBP on the same hardware due to an up to 4.1× decrease in memory usage. Additionally, we adapt existing methods for accelerating Bottom- Up SBP to the Top-Down approach, leading to up to 13.2× speedup over accelerated Bottom-Up SBP and up to 403× speedup over sequential Bottom-Up SBP on 64 compute nodes. Thus, Top-Down SBP represents substantial improvements to the scalability of SBP, enabling the analysis of larger datasets on the same hardware. | en |
dc.description.version | Published version | en |
dc.format.mimetype | application/pdf | en |
dc.identifier.doi | https://doi.org/10.1145/3731545.3731589 | en |
dc.identifier.uri | https://hdl.handle.net/10919/137886 | en |
dc.language.iso | en | en |
dc.publisher | ACM | en |
dc.rights | Creative Commons Attribution 4.0 International | en |
dc.rights.holder | The author(s) | en |
dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ | en |
dc.title | Top-Down Stochastic Block Partitioning: Turning Graph Clustering Upside Down | en |
dc.type | Article - Refereed | en |
dc.type.dcmitype | Text | en |