
--fill0 [std::vector<std::string>]#

Fill parameter with 0s

--fill1 [std::vector<std::string>]#

Fill parameter with 1s


Compile on the gpu


Compile on the cpu


Compile on the reference implementation


Enable implicit offload copying


Disable fast math optimization


Perform an exhaustive search to find the fastest version of generated kernels for selected backend


Quantize for fp16


Quantize for int8