triton.experimental.gluon.language.NVMMASharedLayout
- class triton.experimental.gluon.language.NVMMASharedLayout(self, swizzle_byte_width: int, element_bitwidth: int, rank: int = 2, transposed: bool = False, fp4_padded: bool = False, cga_layout: List[List[int]] = <factory>)
Represents a layout for shared memory suitable for NVIDIA MMA operations.
- Parameters:
swizzle_byte_width (int) – Width in bytes for swizzling.
element_bitwidth (int) – Bitwidth of element type.
rank (int) – Rank of the tensor.
transposed (bool) – Whether the layout is transposed.
fp4_padded (bool) – Whether FP4 padding is used.
cga_layout (Optional[List[List[int]]]) – Bases describing CTA tiling.
- __init__(self, swizzle_byte_width: int, element_bitwidth: int, rank: int = 2, transposed: bool = False, fp4_padded: bool = False, cga_layout: List[List[int]] = <factory>) → None
Methods
__init__(self, swizzle_byte_width, ...)format_hardware_view(self, shape)format_tensor_view(self, shape)mangle(self)Attributes
fp4_paddedget_default_forReturns an NVMMASharedLayout with default swizzling for a given shape.
ranktransposedtypeswizzle_byte_widthelement_bitwidthcga_layout