GL2#

On gfx115x, GL2 Cache (RDNA naming for what Instinct documentation refers to as L2/TCC) is the last-level GFX on-chip cache for most clients before traffic reaches the memory system.

After GL2, traffic proceeds toward DRAM through GCEA and related interfaces; those panels are documented under GCEA.

For Instinct L2 (TCC) coherence, channel hashing, and fabric metrics on CDNA architecture across MI-series GPUs, see L2 cache (TCC) under CDNA-CDNA4.

Note

The GL2 Cache is also referred to as GL2C in some contexts. Hardware counter names (for example, GL2C_HIT_sum) retain the GL2C prefix.

GL2 Cache panels#

GL2 Cache performance#

Metric

Description

Unit

GL2 Cache Hit Rate

Percentage of GL2 cache requests serviced from cache without forwarding through GCEA (Graphics Core Efficiency Arbiter). GL2 cache is the GPU’s final on-chip cache before accessing system memory. Higher hit rates significantly reduce memory bandwidth pressure and improve performance.

Percent

GL2 Cache Hits

Number of GL2 cache requests that were serviced from cache. High hit counts relative to total requests indicate effective caching. Low hit counts suggest memory-bound workloads or poor data locality.

Count per Normalization Unit

GL2 Cache Misses

Number of GL2 cache requests that missed and required fetching through GCEA (Graphics Core Efficiency Arbiter). High miss counts indicate heavy memory traffic. Consider improving data locality or reducing working set size.

Count per Normalization Unit

GL2 Cache request statistics#

Metric

Description

Unit

Total Requests

Total number of requests received by the GL2 cache from all GL1 caches and other clients. This represents the aggregate memory traffic at the GPU’s final on-chip cache.

Count per Normalization Unit

Read Requests

Number of read requests to the GL2 cache. High read counts indicate memory-intensive load operations. Compare with hit rate to assess GL2 cache effectiveness for reads.

Count per Normalization Unit

Write Requests

Number of write requests to the GL2 cache. Write traffic includes stores and cache writebacks from GL1. High write counts indicate write-intensive workloads.

Count per Normalization Unit

Atomic Requests

Number of atomic memory operations processed by the GL2 cache. Atomics provide thread-safe memory access but may serialize at contended addresses. High counts may impact performance.

Count per Normalization Unit

EA Read Requests (DRAM)

Number of read requests forwarded from GL2 cache through GCEA (Graphics Core Efficiency Arbiter). These requests go toward system memory, though they may be served by system-level caches before reaching DRAM. High counts indicate memory-bound workloads.

Count per Normalization Unit

EA Write Requests (DRAM)

Number of write requests forwarded from GL2 cache through GCEA (Graphics Core Efficiency Arbiter). These include evictions and writebacks going toward system memory. High counts indicate write-intensive workloads consuming memory bandwidth.

Count per Normalization Unit

GL2 Cache bandwidth#

Metric

Description

Unit

GL2 Cache Read BW

Achieved read bandwidth at the GL2 cache level. This represents total read traffic from GL1 caches to GL2. High bandwidth indicates memory-intensive read patterns.

Bytes/s

GL2 Cache Write BW

Achieved write bandwidth at the GL2 cache level. This represents total write traffic from GL1 caches to GL2. High bandwidth indicates memory-intensive write patterns.

Bytes/s

Memory chart: GL2 cache#

The following Memory Chart table aligns with the on-screen flow through GL2.

Memory chart - GL2 cache#

Metric

Description

Unit

GL2 Cache Hit Rate

Percentage of L2 cache requests that hit in cache. GL2 cache is the GPU’s final on-chip cache before accessing system memory via GCEA. High hit rates reduce memory bandwidth pressure. Low hit rates cause traffic to system memory, increasing latency.

Percent