Abstract
We investigate how graph partitioning adversely affects the performance of graph analytics. We demonstrate that graph partitioning induces extra work during graph traversal and that graph partitions have markedly different connectivity than the original graph. By consequence, increasing the number of partitions reaches a tipping point after which overheads quickly dominate performance gains. Moreover, we show that the heuristic to balance CPU load between graph partitions by balancing the number of edges is inappropriate for a range of graph analyses. However, even when it is appropriate, it is sub-optimal due to the skewed degree distribution of social networks. Based on these observations, we propose GraphGrind, a new graph analytics system that addresses the limitations incurred by graph partitioning. We moreover propose a NUMA-aware extension to the Cilk programming language and obtain a scale-free yet NUMA-aware parallel programming environment which underpins NUMA-aware scheduling in GraphGrind. We demonstrate that Graph-Grind outperforms state-of-the-art graph analytics systems for shared memory including Ligra, Polymer and Galois.
Original language | English |
---|---|
Title of host publication | Proceedings of the International Conference on Supercomputing. ACM, 2017 |
Place of Publication | Chicago, Illinois |
Publisher | ACM |
Pages | 16:1-16:10 |
Number of pages | 10 |
ISBN (Print) | 978-1-4503-5020-4 |
DOIs | |
Publication status | Published - 14 Jun 2017 |
Event | International Conference on Supercomputing 2017 - Chicago, United States Duration: 14 Jun 2017 → 16 Jun 2017 http://press3.mcs.anl.gov/ics2017/cfp/ |
Conference
Conference | International Conference on Supercomputing 2017 |
---|---|
Abbreviated title | ICS 2017 |
Country/Territory | United States |
City | Chicago |
Period | 14/06/17 → 16/06/17 |
Internet address |