NVIDIA Blackwell
Cluster Launch Control (CLC) for Blackwell (SM100+) dynamic persistent kernels. |
|
Allocate tensor memory. |
|
Issue a fence to complete asynchronous shared memory operations. |
|
Represents a tensor memory descriptor handle for Tensor Core Gen5 operations. |
|
Describes the layout for tensor memory in Blackwell architecture. |
|
Describes the layout for tensor memory scales in Blackwell architecture. |