/home/docs/checkouts/readthedocs.org/user_builds/advanced-micro-devices-composable-kernel/checkouts/develop/include/ck_tile/core/arch/mma/wmma/wmma_gfx11.hpp Source File#
wmma_gfx11.hpp
Go to the documentation of this file.
28 // already been completed. (e.g. option 2 above). These expect duplicated inputs and pre-shuffled
Definition: amdgcn_mma.hpp:10
WmmaCtrlFlags
Common wmma control flags for gfx11 and gfx12.
Definition: wmma.hpp:13
typename impl::ext_vector< T, N >::type ext_vector_t
Definition: vector_type.hpp:84
Meta-tag for the WMMA operation. This will be used in the MmaOp struct to identify the operation as a...
Definition: wmma_gfx11.hpp:47
constexpr static WmmaCtrlFlags InputSignA
Definition: wmma_gfx11.hpp:50
constexpr static WmmaCtrlFlags InputSignB
Definition: wmma_gfx11.hpp:52
constexpr static WmmaCtrlFlags AccumBits
Definition: wmma_gfx11.hpp:59
constexpr static WmmaCtrlFlags AccumSign
Definition: wmma_gfx11.hpp:54
ext_vector_t< fp16_t, 16 > BVecType
Definition: wmma_gfx11.hpp:87
ext_vector_t< fp32_t, 8 > CVecType
Definition: wmma_gfx11.hpp:88
ext_vector_t< fp16_t, 16 > AVecType
Definition: wmma_gfx11.hpp:86
static CK_TILE_DEVICE auto exec(AVecType const &aVec, BVecType const &bVec, CVecType const &cVec) -> CVecType
Definition: wmma_gfx11.hpp:103
This is the default MmaOp policy. Instances of this class are to be used as MmaOp policies....
Definition: amdgcn_mma.hpp:82
static constexpr index_t kCNLane
Definition: amdgcn_mma.hpp:101
static constexpr index_t kAMBlock
Definition: amdgcn_mma.hpp:92
static constexpr index_t kAMLane
Definition: amdgcn_mma.hpp:95
static constexpr index_t kCM1PerLane
Definition: amdgcn_mma.hpp:103
static constexpr index_t kBNBlock
Definition: amdgcn_mma.hpp:93
static constexpr index_t kCM0PerLane
Definition: amdgcn_mma.hpp:102
static constexpr index_t kABKLane
Definition: amdgcn_mma.hpp:97
static constexpr index_t kBNLane
Definition: amdgcn_mma.hpp:96
static constexpr index_t kABKPerLane
Definition: amdgcn_mma.hpp:98
static constexpr index_t kCMLane
Definition: amdgcn_mma.hpp:100