Limits of Region-based Dynamic Binary Parallelization

Tobias J.K. Edler von Koch, Björn Franke

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Efficiently executing sequential legacy binaries on chip multi-processors (CMPs) composed of many, small cores is one of today's most pressing problems. Single-threaded execution is a suboptimal option due to CMPs' lower single-core performance, while multi-threaded execution relies on prior parallelization, which is severely hampered by the low-level binary representation of applications compiled and optimized for a single-core target. A recent technology to address this problem is Dynamic Binary Parallelization (DBP), which creates a Virtual Execution Environment (VEE) taking advantage of the underlying multicore host to transparently parallelize the sequential binary executable. While still in its infancy, DBP has received broad interest within the research community. The combined use of DBP and thread-level speculation (TLS) has been proposed as a technique to accelerate legacy uniprocessor code on modern CMPs. In this paper, we investigate the limits of DBP and seek to gain an understanding of the factors contributing to these limits and the costs and overheads of its implementation. We have performed an extensive evaluation using a parameterizable DBP system targeting a CMP with light-weight architectural TLS support. We demonstrate that there is room for a significant reduction of up to 54% in the number of instructions on the critical paths of legacy SPEC CPU2006 benchmarks. However, we show that it is much harder to translate these savings into actual performance improvements, with a realistic hardware-supported implementation achieving a speedup of 1.09 on average.
Original languageEnglish
Title of host publicationProceeding VEE '13 Proceedings of the 9th ACM SIGPLAN/SIGOPS international conference on Virtual execution environments
PublisherACM
Pages13-22
Number of pages10
ISBN (Print)978-1-4503-1266-0
DOIs
Publication statusPublished - 1 Mar 2013

Keywords

  • automatic parallelization, dynamic binary parallelization, runtime systems, thread-level speculation, transactional memory

Fingerprint Dive into the research topics of 'Limits of Region-based Dynamic Binary Parallelization'. Together they form a unique fingerprint.

Cite this