hipBLASLt environment variables#
This section describes the important hipBLASLt environment variables, which are grouped by functionality.
Logging and debugging#
The logging and debugging environment variables for hipBLASLt are collected in the following table. For more information, see Use logging and heuristics.
Environment variable |
Value |
---|---|
HIPBLASLT_LOG_LEVEL Controls the verbosity level of hipBLASLt logging output.
|
0: Off (logging disabled, default)
1: Error (only errors are logged)
2: Trace (API calls with kernel launches log parameters)
3: Hints (performance improvement suggestions)
4: Info (general library execution information)
5: API trace (detailed API call parameters)
|
HIPBLASLT_LOG_MASK Controls logging output using bit mask flags (can be combined).
|
0: Off
1: Error
2: Trace
4: Hints
8: Info
16: API trace
32: Bench
64: Profile
128: Extended profile
|
HIPBLASLT_LOG_FILE Specifies path to logging file. Can contain
%i for process ID replacement. |
Path to log file (for example,
logfile_%i.log )If not defined: log messages printed to stdout
|
HIPBLASLT_ENABLE_MARKER Enables marker trace for ROCProfiler profiling.
|
0 or unset: Disable marker trace
1: Enable marker trace
|
Offline tuning#
The offline tuning environment variables for hipBLASLt are collected in the following table. For more information, see Use hipBLASLt offline tuning.
Environment variable |
Value |
---|---|
HIPBLASLT_TUNING_FILE Specifies file to store tuning results with best solution indices for GEMM problems.
|
Path to tuning file (for example,
tuning.txt )File stores optimal kernel indices for reuse
|
HIPBLASLT_TUNING_OVERRIDE_FILE Specifies file to load tuning results and override default kernel selection.
|
Path to tuning file (for example,
tuning.txt )Loads previously saved optimal kernel choices
|
HIPBLASLT_TUNING_USER_MAX_WORKSPACE Sets maximum workspace size constraint during tuning stage.
|
Integer value in bytes (default: 128 * 1024 * 1024)
Limits workspace size for solution selection
|
Stream-K configuration#
The Stream-K configuration environment variables for hipBLASLt are collected in the following table. For more information, see Use Stream-K with hipBLASLt.
Environment variable |
Value |
---|---|
TENSILE_SOLUTION_SELECTION_METHOD Controls hipBLASLt kernel selection strategy for GEMM operations.
|
0: Default (standard tuned libraries, no Stream-K)
2: Stream-K (enables Stream-K library for consistent performance)
This variable has no effect on the AMD Instinctâ„¢ MI350 series. Stream-K is always used.
|
TENSILE_STREAMK_DYNAMIC_GRID Controls Stream-K dynamic grid size selection behavior.
|
0: Disable dynamic grid (use all available compute units)
3: Default (automatically pick optimal workgroup count)
|
TENSILE_STREAMK_FIXED_GRID Overrides default grid size with specified number of workgroups for Stream-K kernels.
|
Integer value specifying number of workgroups
Example: 64 (limits GEMM kernels to 64 workgroups)
|
TENSILE_STREAMK_MAX_CUS Sets maximum number of compute units for Stream-K kernels.
|
Integer value specifying maximum compute units
Example: 32 (limits GEMM kernels to 32 compute units)
Default: All available compute units
|