Can NVIDIA Visual Profiler display concurrent kernel execution?

Question

Can NVIDIA Visual Profiler display concurrent kernel execution?

935 views Asked by shadow At 07 August 2012 at 12:33

I have read on many forums that NVIDIA Visual Profiler serializes the program in order to collect timing information.

However in the visual profiler, under context tab, offers advice such as "There is no time overlap between memory copies and kernels on GPU" or if there are overlaps with memory and kernel execution it displays the time of overlap. Also if you look at the following webinar - slide 6 you can see an output trace of overlapping kernels.

I want to know if the profiler can display information regarding concurrent kernel execution (i.e if we run 3 kernels in parallel using 3 different streams, can the profiler show if this is indeed happening in the GPU). If so, where in the visual profiler can I get hold of this information.

Original Q&A

There are 1 answers

**Eugene** · Answer 1 · 2012-08-07T16:17:59+00:00

Eugene On 07 August 2012 at 16:17

Yes.

Both nvprof and Visual Profiler (nvvp) in CUDA Toolkit 5.0 (available as a preview release to registered CUDA developers) support concurrent kernel execution.

TechQA.

Can NVIDIA Visual Profiler display concurrent kernel execution?

There are 1 answers

Yes.

Related Questions in CUDA

Related Questions in NVVP

Popular Questions

Popular Tags

Trending Questions