Edinburgh Research Explorer

Automatic Exploration of Datacenter Performance Regimes

Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Peter Bodik
  • Rean Griffith
  • Charles Sutton
  • Armando Fox
  • Michael I. Jordan
  • David A. Patterson

Related Edinburgh Organisations

Original languageEnglish
Title of host publicationProceedings of the 1st Workshop on Automated Control for Datacenters and Clouds (ACDC 2009)
Place of PublicationNew York, NY, USA
PublisherACM
Pages1-6
Number of pages6
ISBN (Print)978-1-60558-585-7
DOIs
Publication statusPublished - 2009

Abstract

Horizontally scalable Internet services present an opportunity to use automatic resource allocation strategies for system management in the datacenter. In most of the previous work, a controller employs a performance model of the system to make decisions about the optimal allocation of resources. However, these models are usually trained offline or on a small-scale deployment and will not accurately capture the performance of the controlled application. To achieve accurate control of the web application, the models need to be trained directly on the production system and adapted to changes in workload and performance of the application. In this paper we propose to train the performance model using an exploration policy that quickly collects data from different performance regimes of the application. The goal of our approach for managing the exploration process is to strike a balance between not violating the performance SLAs and the need to collect sufficient data to train an accurate performance model, which requires pushing the system close to its capacity. We show that by using our exploration policy, we can train a performance model of a Web 2.0 application in less than an hour and then immediately use the model in a resource allocation controller.

    Research areas

  • automatic control

ID: 11254159