Tensile documentation#
Tensile is a tool for creating a benchmark-driven backend library for GEMMs, GEMM-like problems (such as batched GEMM), N-dimensional tensor contractions, and anything else that multiplies two multi-dimensional objects together on AMD GPU.
The code is open source and hosted at: ROCm/Tensile
Getting Started
Concepts
Reference