OpenCL Task Partitioning in the Presence of GPU Contention

Dominik Grewe, Zheng Wang, Michael O'Boyle

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract / Description of output

Heterogeneous multi- and many-core systems are increasingly prevalent in the desktop and mobile domains. On these systems it is common for programs to compete with co-running programs for resources. While multi-task scheduling for CPUs is a well-studied area, how to partitioning and map computing tasks onto the heterogeneous system in the presence of GPU contention (i.e. multiple programs compete for the GPU) remains an outstanding problem.

In this paper we consider the problem of partitioning OpenCL kernels on a CPU-GPU based system in the presence of contention on the GPU. We propose a machine learning-based approach that predicts the optimal partitioning of OpenCL kernels, explicitly taking GPU contention into account. Our predictive model achieves a speed-up of 1.92 over a scheme that always uses the GPU. When compared to two state-of-the-art dynamic approaches our model achieves speed-ups of 1.54 and 2.56 respectively.
Original languageEnglish
Title of host publicationLanguages and Compilers for Parallel Computing
Subtitle of host publication26th International Workshop, LCPC 2013, San Jose, CA, USA, September 25--27, 2013. Revised Selected Papers
EditorsCălin Cașcaval, Pablo Montesinos
PublisherSpringer International Publishing
Number of pages15
ISBN (Electronic)978-3-319-09967-5
ISBN (Print)978-3-319-09966-8
Publication statusPublished - Oct 2014

Publication series

NameLecture Notes in Computer Science
PublisherSpringer International Publishing
ISSN (Print)0302-9743


Dive into the research topics of 'OpenCL Task Partitioning in the Presence of GPU Contention'. Together they form a unique fingerprint.

Cite this