i am trying to use local memory in my OpenCL kernel.
Following lists are related information.
Device info
GPU: Qualcomm Adreno 420
local memory size: 32768Bytes = 32KB
max work group size: 1024
kernel info A (without local memory usage)
CL_KERNEL_WORK_GROUP_SIZE=1024
CL_KERNEL_LOCAL_MEM_SIZE=0 Bytes
kernel info B (with local memory usage)
CL_KERNEL_WORK_GROUP_SIZE=224
CL_KERNEL_LOCAL_MEM_SIZE=2048 Bytes
Difference between kernel A and B is just usage of local memory. what makes this situation?
if this is register problem, then CL_KERNEL_WORK_GROUP_SIZE
of kernel A should be lower than 1024.
I really want to check register usage, but i can't.
I believe that i have enough local memory, global memory.
Please help.