GCEA#

GCEA names the stage where traffic leaves the on-chip cache hierarchy toward DRAM, covering memory-channel interfaces, the system arbiter (SARB), and return paths. Use these panels when analyzing bandwidth limits or memory-controller behavior after GL2.

The sections below list RDNA3.5 (gfx1151) GCEA / DRAM interface metric descriptions.

For on-chip GL2 cache tables, see GL2.

Note

Panel YAMLs label DRAM read/write, SARB, and return-interface metrics together under this memory hierarchy stage.

DRAM read interface#

Metric

Description

Unit

DRAM Read Requests

Number of read requests sent to system memory. RDNA 3.5 APUs use DDR5/LPDDR5 system memory. High read request counts indicate memory-intensive workloads that may be bandwidth limited.

Count per Normalization Unit

DRAM Read Chained Requests

Number of sequential read request chains to system memory. Chained requests combine multiple sequential accesses for better memory efficiency. High chain counts indicate good memory access patterns.

Count per Normalization Unit

DRAM Read Banks Active

Number of system memory banks with pending read requests. More active banks enable better memory-level parallelism. Low bank utilization may indicate suboptimal access patterns limiting bandwidth.

Count per Normalization Unit

DRAM Read Size (32B increments)

Total volume of read data transferred from system memory. This metric helps quantify memory read bandwidth consumption. Compare with theoretical bandwidth to assess memory utilization.

Count per Normalization Unit

DRAM write interface#

Metric

Description

Unit

DRAM Write Requests

Number of write requests sent to system memory. High write request counts indicate write-intensive workloads. Combined with read requests, this shows total memory traffic.

Count per Normalization Unit

DRAM Write Chained Requests

Number of sequential write request chains to system memory. Chained writes combine sequential stores for better efficiency. High chain counts indicate good write access patterns.

Count per Normalization Unit

DRAM Write Banks Active

Number of system memory banks with pending write requests. Higher bank utilization enables better write bandwidth. Low values may indicate access pattern issues.

Count per Normalization Unit

DRAM Write Size (32B increments)

Total volume of write data transferred to system memory. This quantifies memory write bandwidth consumption. High values indicate write-intensive workloads.

Count per Normalization Unit

System Arbiter (SARB)#

Metric

Description

Unit

SARB Busy

Percentage of cycles the System Arbiter is actively processing memory requests. High busy percentage indicates significant off-chip memory traffic. This is a key indicator of memory-bound behavior.

Percent

SARB Stalled

Percentage of cycles the System Arbiter has requests but cannot make progress due to downstream backpressure. High stall rates indicate memory bandwidth saturation or system memory contention.

Percent

SARB Starving

Percentage of cycles the System Arbiter has no requests to process. High starvation indicates compute-bound workloads or effective caching reducing memory traffic.

Percent

SARB Total Requests

Total memory requests processed by the System Arbiter. This represents all traffic between the GPU caches and system memory interface, including reads, writes, and atomics.

Count per Normalization Unit

SARB DRAM Requests

Memory requests specifically destined for system memory (DRAM). This excludes requests serviced by other system components. High counts indicate memory-intensive workloads.

Count per Normalization Unit

Return interface#

Metric

Description

Unit

Read Returns

Number of completed read operations returning data from system memory. Each return delivers requested data to the GPU caches. High counts indicate successful memory read traffic.

Count per Normalization Unit

Write Returns

Number of completed write acknowledgements from system memory. Each return confirms successful data write. High counts indicate write-intensive workloads completing successfully.

Count per Normalization Unit

Probe Requests

Number of cache coherency probe operations. These occur when external agents or system components require cache synchronization. High counts may indicate coherency traffic impacting performance.

Count per Normalization Unit

Memory chart: GCEA to system memory#

The following Memory Chart table aligns with the on-screen flow from GCEA out to system memory.

Memory chart - GCEA to system memory#

Metric

Description

Unit

SARB Utilization

Percentage of cycles the System Arbiter is actively processing memory requests. The arbiter handles traffic between the GPU caches and system memory interface. High utilization indicates significant off-chip memory activity.

Percent

DRAM Read Bandwidth

Achieved read bandwidth to system memory. RDNA 3.5 APUs use DDR5/LPDDR5 system memory. High read bandwidth indicates memory-intensive read patterns. This is a key metric for identifying memory-bound workloads.

Bytes/s

DRAM Write Bandwidth

Achieved write bandwidth to system memory. High write bandwidth indicates write-intensive workloads. Combined with read bandwidth, this shows total memory traffic between the GPU and system memory.

Bytes/s

SARB Stall Rate

Percent of GCEA sample cycles the System Arbiter (SARB) was stalled. Formula: 100 * GCEA_SARB_STALLED_sum / GCEA_ALWAYS_COUNT_sum.

Percent

DRAM Read Requests

DRAM read requests per client group from the GCEA memory path (GCEA_RDRAM_REQ_PER_CLIGRP_sum / $denom).

Requests per Normalization Unit

DRAM Write Requests

DRAM write requests per client group from the GCEA memory path (GCEA_WDRAM_REQ_PER_CLIGRP_sum / $denom).

Requests per Normalization Unit

Read Returns

Valid read-return events on the GCEA return path per normalization unit (GCEA_RRET_VLD_sum / $denom).

Count per Normalization Unit

Write Returns

Valid write-return events on the GCEA return path per normalization unit (GCEA_WRET_VLD_sum / $denom).

Count per Normalization Unit