Partitioning Trillion-edge Graphs in Minutes [article]

George M Slota, Sivasankaran Rajamanickam, Karen Devine, Kamesh Madduri
2016 arXiv   pre-print
We introduce XtraPuLP, a new distributed-memory graph partitioner designed to process trillion-edge graphs. XtraPuLP is based on the scalable label propagation community detection technique, which has been demonstrated as a viable means to produce high quality partitions with minimal computation time. On a collection of large sparse graphs, we show that XtraPuLP partitioning quality is comparable to state-of-the-art partitioning methods. We also demonstrate that XtraPuLP can produce partitions
more » ... f real-world graphs with billion+ vertices in minutes. Further, we show that using XtraPuLP partitions for distributed-memory graph analytics leads to significant end-to-end execution time reduction.
arXiv:1610.07220v1 fatcat:f6dfzvx32zhqjd7c2muqykla3q