GL2#

On gfx1151, GL2C (RDNA naming for what Instinct documentation refers to as L2/TCC) is the last-level on-chip cache for most clients.

Traffic leaving GL2 heads toward GCEA and DRAM through the DRAM read/write, SARB, and return interfaces in the panel YAMLs.

For Instinct L2 (TCC) coherence, channel hashing, and fabric metrics on CDNA architecture across MI-series GPUs, see L2 cache (TCC) under CDNA-CDNA4.

GL2C panels#

GL2C cache performance#

Metric

Description

Unit

GL2C Hit Rate

Percentage of GL2C requests that hit in the cache. GL2C is the last-level cache before system memory.

Percent

GL2C Hits

Number of requests that hit in the GL2C cache.

Count per Normalization Unit

GL2C Misses

Number of requests that missed in the GL2C cache and went to memory.

Count per Normalization Unit

GL2C request statistics#

Metric

Description

Unit

Total Requests

Number of requests of all types to the GL2C cache (counter GL2C_REQ). This is measured at the tag block (request path into GL2C tag / lookup). Not guaranteed to equal Read + Write + Atomic; those use separate perf events and may omit or overlap traffic relative to this aggregate.

Count per Normalization Unit

Read Requests

Number of read requests to the GL2C cache (GL2C_READ).

Count per Normalization Unit

Write Requests

Number of write requests to the GL2C cache (GL2C_WRITE).

Count per Normalization Unit

Atomic Requests

Number of atomic requests at the GL2C cache (GL2C_ATOMIC).

Count per Normalization Unit

EA Read Requests (DRAM)

Number of GL2C/EA read requests destined for DRAM (memory controller). gl2c_perf_sel_ea_rdreq_dram (GL2C_EA_RDREQ_DRAM). Reported values are summed over GL2C instances.

Count per Normalization Unit

EA Write Requests (DRAM)

Number of GL2C/EA write requests destined for DRAM (memory controller). gl2c_perf_sel_ea_wrreq_dram (GL2C_EA_WRREQ_DRAM). Reported values are summed over GL2C instances.

Count per Normalization Unit

GL2C bandwidth#

Metric

Description

Unit

GL2C Read BW

Read bandwidth from GL2C_READ_{32,64,128}_REQ byte accounting plus GL2C_COMPRESSED_READ_96_REQ (96 B compressed reads). EA 96 B reads use GL2C_EA_RDREQ_96B.

Bytes/s

GL2C Write BW

Achieved write bandwidth at the GL2C level.

Bytes/s

GCEA and DRAM interfaces#

DRAM read interface#

Metric

Description

Unit

DRAM Read Requests

Number of read requests sent to system memory (DRAM) per client group. RDNA3.5 uses DDR5/LPDDR5 system memory instead of HBM.

Count per Normalization Unit

DRAM Read Chained Requests

Number of chained (sequential) read request chains sent to DRAM. Chained requests improve memory efficiency for sequential access patterns.

Count per Normalization Unit

DRAM Read Banks Active

Number of DRAM banks with pending read requests (saturates at 16). More active banks indicate better memory parallelism.

Count per Normalization Unit

DRAM Read Size (32B increments)

Total size of DRAM read requests in 32-byte increments.

Count per Normalization Unit

DRAM write interface#

Metric

Description

Unit

DRAM Write Requests

Number of write requests sent to system memory (DRAM) per client group.

Count per Normalization Unit

DRAM Write Chained Requests

Number of chained (sequential) write request chains sent to DRAM.

Count per Normalization Unit

DRAM Write Banks Active

Number of DRAM banks with pending write requests (saturates at 16).

Count per Normalization Unit

DRAM Write Size (32B increments)

Total size of DRAM write requests in 32-byte increments.

Count per Normalization Unit

System Arbiter (SARB)#

Metric

Description

Unit

SARB Busy

Percentage of cycles the System Arbiter was actively processing requests. Per sample: 100 * GCEA_SARB_BUSY / GCEA_ALWAYS_COUNT (when GCEA_ALWAYS_COUNT != 0). GCEA_ALWAYS_COUNT is the cycle-count denominator for these percentages.

Percent

SARB Stalled

Percentage of cycles the System Arbiter was stalled (has valid request but downstream not ready). Per sample: 100 * GCEA_SARB_STALLED / GCEA_ALWAYS_COUNT (when GCEA_ALWAYS_COUNT != 0).

Percent

SARB Starving

Percentage of cycles the System Arbiter was starving (no valid requests but downstream ready). Per sample: 100 * GCEA_SARB_STARVING / GCEA_ALWAYS_COUNT (when GCEA_ALWAYS_COUNT != 0).

Percent

SARB Total Requests

Total requests per virtual channel through the System Arbiter.

Count per Normalization Unit

SARB DRAM Requests

DRAM-bound requests per virtual channel through the System Arbiter.

Count per Normalization Unit

Return interface#

Metric

Description

Unit

Read Returns

Number of read data returns from the external memory interface.

Count per Normalization Unit

Write Returns

Number of write acknowledgements from the external memory interface.

Count per Normalization Unit

Probe Requests

Number of probe-invalidate requests (cache coherency operations).

Count per Normalization Unit