Invalidate or Update? Revisiting Coherence for Tomorrow’s Cache Hierarchies

Mingcan Zhu, Amna Shahab, Antonis Katsarakis, Boris Grot

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Shared on-chip last-level caches (LLCs) play a key role in capturing the large working sets of today’s data-intensive workloads. However, they pose a fundamental scalability challenge in the transistor-limited post-Moore regime. Recent work has argued for Next-Generation LLCs (NG-LLC) based on private caches in die-stacked DRAM, which can provide hundreds of MBs of per-core LLC capacity at similar access latency to today’s shared LLCs. While NG-LLCs offer a number of advantages, their private design exposes long-latency inter-core reads for read/write shared data, which hurt performance in parallel workloads. One way to eliminate the long latency of reads to read/write shared data is through the use of updating coherence protocols that eagerly push updates from a writer core into caches of recent readers. Alas, these protocols are known to generate excess cache and interconnect traffic that can be detrimental to overall performance. While hybrid protocols that try to alleviate the problem by combining invalidating and updating protocols have been proposed, we find their performance benefit to be small for NG-LLCs.
This work observes that the number of writes to a read/write shared cache block is likely to be stable over several consecutive write/read iterations. Based on this insight, we propose the 1-Updateprotocol that records the number of writes without an intervening read by a sharer, and subsequently uses there corded value to send at most one update after that number of writes has taken place. We have formally verified 1-Update and show that it achieves high efficacy in covering remote misses for read/write shared cache blocks while minimizing excess cache and interconnect traffic.
Original languageEnglish
Title of host publication2021 30th International Conference on Parallel Architectures and Compilation Techniques (PACT)
PublisherIEEE
Pages226-241
Number of pages16
ISBN (Electronic)978-1-6654-4278-7
ISBN (Print)978-1-6654-4279-4
DOIs
Publication statusPublished - 18 Oct 2021
Event30th International Conference on Parallel Architectures and Compilation Techniques - Online
Duration: 26 Sep 202129 Sep 2021
http://pact21.snu.ac.kr/

Conference

Conference30th International Conference on Parallel Architectures and Compilation Techniques
Abbreviated titlePACT 2021
Period26/09/2129/09/21
Internet address

Keywords

  • cache coherence
  • LLC
  • write invalidate/update

Fingerprint

Dive into the research topics of 'Invalidate or Update? Revisiting Coherence for Tomorrow’s Cache Hierarchies'. Together they form a unique fingerprint.

Cite this