Getting Started
Python API
Gluon
Triton MLIR Dialects
Programming Guide
Computes matrix-multiplication of a * b + acc using AMD WMMA instruction.
a (tensor) – The operand a to be multiplied.
b (tensor) – The operand b to be multiplied.
acc (tensor) – The accumulator tensor.