Non-blocking synchronization of streams in CUDA?

Question

Non-blocking synchronization of streams in CUDA?

2.6k views Asked by spfrnd At 08 August 2016 at 10:24

Is it possible to synchronize two CUDA streams without blocking the host? I know there's cudaStreamWaitEvent, which is non-blocking. But what about the creation and destruction of the events using cudaEventCreate and cudaEventDestroy.

The documentation for cudaEventDestroy says:

In case event has been recorded but has not yet been completed when cudaEventDestroy() is called, the function will return immediately and the resources associated with event will be released automatically once the device has completed event.

What I don't understand here is what the difference is between a recorded event and a completed event. Also this seems to imply that the call is blocking if the event has not yet been recorded.

Anyone who can shed some light on this?

Original Q&A

There are 1 answers

**jefflarkin** · Accepted Answer · 2016-08-08T15:41:20+00:00

You're on the right track by using cudaStreamWaitEvent. Creating events does carry some cost, but they can be created during your application start-up to prevent the creation time from being costly during your GPU routines.

An event is recorded when you you put the event into a stream. It is completed after all activity that was put into the stream before the event has completed. Recording the event basically puts a marker into your stream, which is the thing that enables cudaStreamWaitEvent to stop forward progress on the stream until the event has completed.

TechQA.

Non-blocking synchronization of streams in CUDA?

There are 1 answers

Related Questions in C

Related Questions in CUDA

Related Questions in SYNCHRONIZATION

Related Questions in GPU

Related Questions in CUDA-EVENTS

Popular Questions

Popular Tags

Trending Questions