triton.language.extra.cuda
Programmatic Dependent Launch
GDC wait is a blocking instruction that waits for all instructions in a prior kernel to complete before continuing. |
|
This operation when launched with programmatic dependent launch signals that the next program may launch once all programs in the current kernel call this function or complete. |