Maximizing VMs' IO performance on overcommitted CPUs with fairness

Tong Xing, Cong Xiong, Chuan Ye, Qi Wei, Javier Picorel, Antonio Barbalace

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

To improve resource utilization and reduce costs many Cloud providers adopt virtual machines (VMs) overcommitment. While effective, this strategy may lead to adverse outcomes, significantly affecting a VM IO performance when one virtual CPU (vCPU) is preempted by another vCPU within the same runqueue of the VM scheduler -- i.e., same physical CPU (pCPU). Additionally, the responsiveness of a VM is reduced during the inactive time of the vCPU, and it necessitates an extra schedule timeslice to react to any IO event. While such problems have been studied in academia and industry, no previous solution has been deployed in production. This is because for example certain solutions require modifications of the guest VM, which is in contrast with industry requirements.

We propose Anubis, a new IO-aware VM scheduler targeting Linux KVM, the most popular VMM in today's Clouds, without requiring any guest VM modifications. Anubis shortens the IO event pending time by lightweight monitoring IO events including interrupt delivery and KVM exit. For the vCPU running the IO activity, Anubis provides an accurate boost, which is exclusively active only during the period when the vCPU has IO activity. While the IO performance is maximized, Anubis still guarantees fairness among VMs. The vCPU that doesn't have IO activity and belongs to the same VM will voluntarily yield the computing resources to counterbalance the unfairness created by the vCPU that has been given a performance boost. Overall, Anubis is a practical solution that provides close-to-non-overcommit performance for IO workloads in VM overcommitted scenarios.
Original languageEnglish
Title of host publicationProceedings of the 2023 ACM Symposium on Cloud Computing
Place of PublicationNew York, NY, USA
PublisherAssociation for Computing Machinery
Pages93–108
Number of pages16
ISBN (Electronic)9798400703874
DOIs
Publication statusPublished - 31 Oct 2023
EventThe 14th Symposium on Cloud Computing - Chaminade Resort & Spa, Santa Cruz, United States
Duration: 30 Oct 20231 Nov 2023
Conference number: 14
https://acmsocc.org/2023/

Symposium

SymposiumThe 14th Symposium on Cloud Computing
Abbreviated titleSoCC'23
Country/TerritoryUnited States
CitySanta Cruz
Period30/10/231/11/23
Internet address

Keywords / Materials (for Non-textual outputs)

  • IO performance
  • KVM
  • Linux
  • overcommit
  • compute resources
  • fair scheduling
  • low-latency
  • virtualization

Fingerprint

Dive into the research topics of 'Maximizing VMs' IO performance on overcommitted CPUs with fairness'. Together they form a unique fingerprint.

Cite this