triton.experimental.gluon.language.amd.cdna4.buffer_store

triton.experimental.gluon.language.amd.cdna4.buffer_store(stored_value, ptr, offsets, mask=None, cache=None, _semantic: GluonSemantic = None)

AMD buffer store a tensor directly to global memory via a scalar base pointer and a tensor of offsets instead of a tensor of pointers.

Parameters:
  • stored_value (tensor to be stored) – The tensor to be stored to global memory.

  • ptr (pointer to scalar) – Global memory scalar base pointer to store to.

  • offsets (tensor) – Offsets tensor for the store operation.

  • mask (tensor, optional) – Mask tensor for predicated store. Defaults to None.

  • cache_modifier (str) – Cache modifier specifier. Defaults to “”.