An Integrated Approach for Accelerating Stochastic Block Partitioning

dc.contributor.authorWanye, Franken
dc.contributor.authorGleyzer, Vitaliyen
dc.contributor.authorKao, Edwarden
dc.contributor.authorFeng, Wu-chunen
dc.date.accessioned2024-03-04T15:47:51Zen
dc.date.available2024-03-04T15:47:51Zen
dc.date.issued2023-01-01en
dc.description.abstractCommunity detection, or graph partitioning, is a fundamental problem in graph analytics with applications in a wide range of domains including bioinformatics, social media analysis, and anomaly detection. Stochastic block partitioning (SBP) is a community detection algorithm based on sequential Bayesian inference. SBP is highly accurate even on graphs with a complex community structure. However, it does not scale well to large real-world graphs that can contain upwards of a million vertices due to its sequential nature. Approximate methods that break computational dependencies improve the scalability of SBP via parallelization and data reduction. However, these relaxations can lead to low accuracy on graphs with complex community structure. In this paper, we introduce additional synchronization steps through vertex-level data batching to improve the accuracy of such methods. We then leverage batching to develop a high-performance parallel approach that improves the scalability of SBP while maintaining accuracy. Our approach is the first to integrate data reduction, shared-memory parallelization, and distributed computation, thus efficiently utilizing distributed computing resources to accelerate SBP. On a one-million vertex graph processed on 64 compute nodes with 128 cores each, our approach delivers a speedup of 322x over the sequential baseline and 6.8x over the distributed-only implementation. To the best of our knowledge, this Graph Challenge submission is the highest-performing SBP implementation to date and the first to process the one-million vertex graph using SBP.en
dc.description.versionAccepted versionen
dc.format.extentPages 1-7en
dc.identifier.doihttps://doi.org/10.1109/HPEC58863.2023.10363599en
dc.identifier.isbn9798350308600en
dc.identifier.orcidFeng, Wu-chun [0000-0002-6015-0727]en
dc.identifier.urihttps://hdl.handle.net/10919/118255en
dc.publisherIEEEen
dc.rightsIn Copyrighten
dc.rights.urihttp://rightsstatements.org/vocab/InC/1.0/en
dc.titleAn Integrated Approach for Accelerating Stochastic Block Partitioningen
dc.title.serial2023 IEEE High Performance Extreme Computing Conference, HPEC 2023en
dc.typeConference proceedingen
dc.type.otherConference Proceedingen
pubs.finish-date2023-09-29en
pubs.organisational-group/Virginia Techen
pubs.organisational-group/Virginia Tech/Engineeringen
pubs.organisational-group/Virginia Tech/Engineering/Computer Scienceen
pubs.organisational-group/Virginia Tech/Faculty of Health Sciencesen
pubs.organisational-group/Virginia Tech/All T&R Facultyen
pubs.organisational-group/Virginia Tech/Engineering/COE T&R Facultyen
pubs.start-date2023-09-25en

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Integrated-SBP-HPEC_2023.pdf
Size:
420.71 KB
Format:
Adobe Portable Document Format
Description:
Accepted version
License bundle
Now showing 1 - 1 of 1
Name:
license.txt
Size:
1.5 KB
Format:
Plain Text
Description: