triton.experimental.gluon.language.nvidia.blackwell.mbarrier
Functions
|
Helper function to allocate an mbarrier |
|
Arrive at an mbarrier with a specified count. |
|
Expect a specific number of bytes being copied. |
|
Fence that makes prior mbarrier initialization visible across the CTA cluster. |
|
Initialize an mbarrier with a specified count. |
|
Invalidate an mbarrier, resetting its state. |
|
Wait until the mbarrier object completes its current phase. |
Classes
|
Layout for mbarrier synchronization in Ampere and later architectures. |