triton.experimental.gluon.language.nvidia.ampere.async_copy.async_load

triton.experimental.gluon.language.nvidia.ampere.async_copy.async_load(smem, pointer, mask=None, cache_modifier='', eviction_policy='', volatile=False, _semantic=None)

Asynchronously load elements from global memory to shared memory.

Parameters:
  • smem (shared_memory_descriptor) – Destination shared memory descriptor.

  • pointer (tensor) – Source pointer tensor.

  • mask (tensor, optional) – Mask tensor for predicated loads. Defaults to None.

  • cache_modifier (str) – Cache modifier specifier. Defaults to “”.

  • eviction_policy (str) – Eviction policy specifier. Defaults to “”.

  • volatile (bool) – Whether the load is volatile. Defaults to False.