VectorPU: A Generic and Efficient Data-container and Component Model for Transparent Data Transfer on GPU-based Heterogeneous Systems

Lu Li, Christoph Kessler

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

We present VectorPU, a C++ based programming framework providing high-level and efficient unified memory access on heterogeneous systems, in particular GPU-based systems. VectorPU consists of a light-weight runtime library providing a generic, "smart" data-container abstraction for transparent software caching of array operands with programmable memory coherence, and a light-weight component model realized by macro-based data access annotations. VectorPU thereby enables a flexible unified memory view with data transfer and device memory management abstracted away from programmers, while keeping the efficiency of expert-written code with manual data movement and memory management. We provide a prototype of VectorPU for (CUDA) GPU-based systems, and show that it can achieve 1.40× to 13.29× speedup over good quality code using Nvidia's Unified Memory by experiments on several machines ranging from laptops to supercomputer nodes, with Kepler and Maxwell GPUs. We also show the expressiveness and wide applicability of VectorPU, and its low overhead and equal efficiency compared to expert-written code.
Original languageEnglish
Title of host publicationProceedings of the 8th Workshop and 6th Workshop on Parallel Programming and Run-Time Management Techniques for Many-core Architectures and Design Tools and Architectures for Multicore Embedded Computing Platforms
Place of PublicationNew York, NY, USA
PublisherACM
Pages7-12
Number of pages6
ISBN (Print)978-1-4503-4877-5
DOIs
Publication statusPublished - 25 Jan 2017
Event8th Workshop on Parallel Programming and Run-Time Management Techniques for Many-core Architectures and 6th Workshop on Design Tools and Architectures for Multicore Embedded Computing Platforms - Stockholm, Sweden
Duration: 28 Jan 2017 → …
http://conferences.microlab.ntua.gr/ditam-parma2017/

Publication series

NamePARMA-DITAM '17
PublisherACM

Conference

Conference8th Workshop on Parallel Programming and Run-Time Management Techniques for Many-core Architectures and 6th Workshop on Design Tools and Architectures for Multicore Embedded Computing Platforms
Abbreviated titlePARMA-DITAM 2017
Country/TerritorySweden
CityStockholm
Period28/01/17 → …
Internet address

Keywords / Materials (for Non-textual outputs)

  • Coherence Management, Flow Signature, GPGPU, Programming Framework, Run-time System, VectorPU

Fingerprint

Dive into the research topics of 'VectorPU: A Generic and Efficient Data-container and Component Model for Transparent Data Transfer on GPU-based Heterogeneous Systems'. Together they form a unique fingerprint.

Cite this