- --fill0 [std::vector<std::string>]#
 
Fill parameter with 0s
- --fill1 [std::vector<std::string>]#
 
Fill parameter with 1s
- --gpu#
 
Compile on the gpu
- --cpu#
 
Compile on the cpu
- --ref#
 
Compile on the reference implementation
- --enable-offload-copy#
 
Enable implicit offload copying
- --disable-fast-math#
 
Disable fast math optimization
- --exhaustive-tune#
 
Perform an exhaustive search to find the fastest version of generated kernels for selected backend
- --fp16#
 
Quantize for fp16
- --int8#
 
Quantize for int8
- --fp8#
 
Quantize for Float8E4M3FNUZ type