My task is to port caches' latency measurement program written in CUDA to HIP, and for that i need to get characteristics of GPU.
I'm using HIP 4.5.0, and the problem is that for some data from CUDA's deviceInfo
, there're no (direct) analogues from HIP.
They are:
- (Max) Shared memory per multiprocessor
In CUDA, sharedMemPerMultiprocessor
exists. When talking about HIP, i found maxSharedMemoryPerMultiprocessor
. Are they the same? Or first one is always less or equal to the second one and thus can't be replaced?
- Registers per multiprocessor
In CUDA - regsPerMultiprocessor
. In HIP - non-existent (according to this page). Is there's a workaround for capturing number of registers per multiprocessor in this case?
Thank you for your answers!