triton.language.extra.cuda

Programmatic Dependent Launch

gdc_wait

GDC wait is a blocking instruction that waits for all instructions in a prior kernel to complete before continuing.

gdc_launch_dependents

This operation when launched with programmatic dependent launch signals that the next program may launch once all programs in the current kernel call this function or complete.