- --fill0 [std::vector<std::string>]#
Fill parameter with 0s
- --fill1 [std::vector<std::string>]#
Fill parameter with 1s
- --gpu#
Compile on the gpu
- --cpu#
Compile on the cpu
- --ref#
Compile on the reference implementation
- --enable-offload-copy#
Enable implicit offload copying
- --disable-fast-math#
Disable fast math optimization
- --exhaustive-tune#
Perform an exhaustive search to find the fastest version of generated kernels for selected backend
- --fp16#
Quantize for fp16
- --int8#
Quantize for int8
- --fp8#
Quantize for Float8E4M3FNUZ type