Aggressive GPU cache bypassing with monolithic 3D-based NoC

Cong Thuan Do, Cheol Hong Kim, Sung Woo Chung

Research output: Contribution to journalArticlepeer-review


Cache bypassing is widely employed to alleviate cache contention and pollution in GPUs. However, cache bypassing often puts more pressure on the network-on-chip (NoC) since the bypassed requests need to traverse the NoC to reach the lower-level memories, thus worsening the NoC congestion. In this paper, we propose an aggressive GPU cache bypassing technique (called SC-Table) to alleviate cache contention and pollution. The SC-Table relies on 2-bit saturating counters (SCs) to store the bypass history of warps. Memory requests issued by a warp are allowed to bypass the L1D when the corresponding SC’s value reaches the bypass threshold. In addition, we adopt the monolithic 3D-based NoC (M3D NoC) to provide better NoC throughput and latency. The combination of the SC-Table and the M3D NoC improves GPU performance by 34.6%, on average, over the baseline where there is no cache bypassing and the traditional 2D NoC is adopted.

Original languageEnglish
JournalJournal of Supercomputing
Publication statusAccepted/In press - 2022


  • 3D network-on-chip
  • Cache bypassing

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Software
  • Information Systems
  • Hardware and Architecture


Dive into the research topics of 'Aggressive GPU cache bypassing with monolithic 3D-based NoC'. Together they form a unique fingerprint.

Cite this