NVIDIA Hopper
Issue a fence to complete asynchronous shared memory operations. |
|
Perform warpgroup MMA (Tensor Core) operations. |
|
Wait until num_outstanding or less warpgroup MMA operations are in-flight. |
Issue a fence to complete asynchronous shared memory operations. |
|
Perform warpgroup MMA (Tensor Core) operations. |
|
Wait until num_outstanding or less warpgroup MMA operations are in-flight. |