Local data share (LDS)

Local data share (LDS)#

LDS Speed-of-Light#

Warning

The theoretical maximum throughput for some metrics in this section are currently computed with the maximum achievable clock frequency, as reported by rocminfo, for an accelerator. This may not be realistic for all workloads.

The LDS speed-of-light chart shows a number of key metrics for the LDS as a comparison with the peak achievable values of those metrics.

CDNA

Metric	Description	Unit
Utilization	Indicates what percent of the kernel’s duration the LDS was actively executing instructions (including, but not limited to, load, store, atomic and HIP’s `__shfl` operations). Calculated as the ratio of the total number of cycles LDS was active over the total CU cycles.	Percent
Access Rate	Indicates the percentage of SIMDs in the VALU [1] actively issuing LDS instructions, averaged over the lifetime of the kernel. Calculated as the ratio of the total number of cycles spent by the scheduler issuing LDS instructions over the total CU cycles.	Percent
Theoretical Bandwidth Utilization	Indicates the maximum amount of bytes that could have been loaded from, stored to, or atomically updated in the LDS divided as percentage of theoretical peak. Does not take into account the execution mask of the wavefront when the instruction was executed. See the LDS bandwidth example for more detail.	Percent
Bank Conflict Rate	Indicates the percentage of active LDS cycles that were spent servicing bank conflicts. Calculated as the ratio of LDS cycles spent servicing bank conflicts over the number of LDS cycles that would have been required to move the same amount of data in an uncontended access. [2]	Percent

CDNA 2

Metric	Description	Unit
Utilization	Indicates what percent of the kernel’s duration the LDS was actively executing instructions (including, but not limited to, load, store, atomic and HIP’s `__shfl` operations). Calculated as the ratio of the total number of cycles LDS was active over the total CU cycles.	Percent
Access Rate	Indicates the percentage of SIMDs in the VALU [1] actively issuing LDS instructions, averaged over the lifetime of the kernel. Calculated as the ratio of the total number of cycles spent by the scheduler issuing LDS instructions over the total CU cycles.	Percent
Theoretical Bandwidth Utilization	Indicates the maximum amount of bytes that could have been loaded from, stored to, or atomically updated in the LDS divided as percentage of theoretical peak. Does not take into account the execution mask of the wavefront when the instruction was executed. See the LDS bandwidth example for more detail.	Percent
Bank Conflict Rate	Indicates the percentage of active LDS cycles that were spent servicing bank conflicts. Calculated as the ratio of LDS cycles spent servicing bank conflicts over the number of LDS cycles that would have been required to move the same amount of data in an uncontended access. [2]	Percent

CDNA 3

Metric	Description	Unit
Utilization	Indicates what percent of the kernel’s duration the LDS was actively executing instructions (including, but not limited to, load, store, atomic and HIP’s `__shfl` operations). Calculated as the ratio of the total number of cycles LDS was active over the total CU cycles.	Percent
Access Rate	Indicates the percentage of SIMDs in the VALU [1] actively issuing LDS instructions, averaged over the lifetime of the kernel. Calculated as the ratio of the total number of cycles spent by the scheduler issuing LDS instructions over the total CU cycles.	Percent
Theoretical Bandwidth Utilization	Indicates the maximum amount of bytes that could have been loaded from, stored to, or atomically updated in the LDS divided as percentage of theoretical peak. Does not take into account the execution mask of the wavefront when the instruction was executed. See the LDS bandwidth example for more detail.	Percent
Bank Conflict Rate	Indicates the percentage of active LDS cycles that were spent servicing bank conflicts. Calculated as the ratio of LDS cycles spent servicing bank conflicts over the number of LDS cycles that would have been required to move the same amount of data in an uncontended access. [2]	Percent

CDNA 4

Metric	Description	Unit
Utilization	Indicates what percent of the kernel’s duration the LDS was actively executing instructions (including, but not limited to, load, store, atomic and HIP’s `__shfl` operations). Calculated as the ratio of the total number of cycles LDS was active over the total CU cycles.	Percent
Access Rate	Indicates the percentage of SIMDs in the VALU [1] actively issuing LDS instructions, averaged over the lifetime of the kernel. Calculated as the ratio of the total number of cycles spent by the scheduler issuing LDS instructions over the total CU cycles.	Percent
Theoretical Bandwidth Utilization	Indicates the maximum amount of bytes that could have been loaded from, stored to, or atomically updated in the LDS divided as percentage of theoretical peak. Does not take into account the execution mask of the wavefront when the instruction was executed. See the LDS bandwidth example for more detail.	Percent
Bank Conflict Rate	Indicates the percentage of active LDS cycles that were spent servicing bank conflicts. Calculated as the ratio of LDS cycles spent servicing bank conflicts over the number of LDS cycles that would have been required to move the same amount of data in an uncontended access. [2]	Percent

Footnotes

Statistics#

The LDS statistics panel gives a more detailed view of the hardware: