triton.experimental.gluon.language.nvidia.blackwell.tma.async_copy_shared_to_global

triton.experimental.gluon.language.nvidia.blackwell.tma.async_copy_shared_to_global(tensor_desc, coord, src, _semantic=None)

Store data from shared memory to global memory using TMA.

Parameters:
  • tensor_desc (tensor_descriptor) – Tensor descriptor (tiled).

  • coord (Sequence[int | ttgl.constexpr | ttgl.tensor]) – Coordinates in the destination tensor.

  • src (ttgl.shared_memory_descriptor) – Source memory descriptor.