I'm a new on cuda programming.
I have to GPU profiling using the nvprof about my application.
I find a metric l2_subp0_write_sector_misses that means number of write requests sent to DRAM from slice 0 of L2 cache.
But I don't know what does a slice mean.
Just as I guess, a slice is a sm because Gtx750 have 4 sm and there are 4 slices in events(l2_subp0_write_sector_misses, l2_subp1_write_sector_misses, l2_subp2_write_sector_misses, l3_subp0_write_sector_misses)
However, when I tested an application using 1 thread, all write_misses events have value.
I think that if a slice is a sm, just one write_misses has value and the others are 0.
What is a slice?
And Does it wrong that 1 thread uses 1 sm?