triton.experimental.gluon.language.BlockedLayout

class triton.experimental.gluon.language.BlockedLayout(self, size_per_thread: ~typing.List[int], threads_per_warp: ~typing.List[int], warps_per_cta: ~typing.List[int], order: ~typing.List[int], cga_layout: ~typing.List[~typing.List[int]] = <factory>)

Represents a blocked layout, partitioning a tensor across threads, warps, and CTAs.

Parameters:
  • size_per_thread (List[int]) – Number of elements per thread per dimension.

  • threads_per_warp (List[int]) – Number of threads per warp per dimension.

  • warps_per_cta (List[int]) – Number of warps per CTA per dimension.

  • order (List[int]) – The ordering of dimensions for partitioning.

  • cga_layout (Optional[List[List[int]]]) – Bases describing how CTAs tile each dimension.

__init__(self, size_per_thread: ~typing.List[int], threads_per_warp: ~typing.List[int], warps_per_cta: ~typing.List[int], order: ~typing.List[int], cga_layout: ~typing.List[~typing.List[int]] = <factory>) None

Methods

__init__(self, size_per_thread, ...)

format_hardware_view(self, shape)

format_tensor_view(self, shape)

mangle(self)

Attributes

rank

type

size_per_thread

threads_per_warp

warps_per_cta

order

cga_layout