Automating vectorized distributed graph computation

Wenyue Zhao, Yang Cao, Peter Buneman, Jia Li, Nikos Ntarmos

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Multi-instance graph algorithms interleave the evaluation of multiple instances of the same algorithm with different inputs over the same graph. They have been shown to be significantly faster than traditional serial and batch evaluation, by sharing computation across instances. However, writing correct multi-instance algorithms is challenging; and in this work, we describe AutoMI, a framework for automatically converting vertex-centric graph algorithms into their vectorized multi-instance versions. We also develop an algebraic characterization of algorithms that can benefit best from multi-instance computation with simpler and faster streamlined vectorization. This allows users to decide when to use such optimization and instruct AutoMI to make the best use of SIMD vectorization. Using 6 real-life graphs, we show that AutoMI-converted multi-instance algorithms are 9.6 to 29.5 times faster than serial evaluation, 7.1 to 26.4 times faster than batch evaluation, and are even 2.6 to 4.6 times faster than existing highly optimized handcrafted multi-instance algorithms without vectorization.
Original languageEnglish
Title of host publicationSIGMOD '25
Subtitle of host publicationProceedings of the 2025 International Conference on Management of Data
PublisherACM
Publication statusAccepted/In press - 2 Aug 2024
EventThe 2025 ACM SIGMOD/PODS International Conference on Management of Data - Intercontinental Berlin, Berlin, Germany
Duration: 22 Jun 202527 Jun 2025
https://2025.sigmod.org/

Publication series

NameProceedings of the ACM-SIGMOD International Conference on Management of Data
PublisherACM
ISSN (Electronic)0730-8078

Conference

ConferenceThe 2025 ACM SIGMOD/PODS International Conference on Management of Data
Abbreviated titleSIGMOD/PODS 2025
Country/TerritoryGermany
CityBerlin
Period22/06/2527/06/25
Internet address

Keywords / Materials (for Non-textual outputs)

  • graph computation
  • SIMD vectorization
  • auto vectorization
  • algebaric characterization

Fingerprint

Dive into the research topics of 'Automating vectorized distributed graph computation'. Together they form a unique fingerprint.

Cite this