List Question
20 TechQA 2011-02-23T23:30:41.140000CUDA Warp Synchronization Problem
3k views
Asked by dmc
On today's GPUs, can warps be recombined dynamically?
47 views
Asked by Armin Rigo
Removing __syncthreads() in CUDA warp-level reduction
3.9k views
Asked by small_potato
Why bother to know about CUDA Warps?
11.2k views
Asked by Alex
Compute per-warp histogram without shared memory
159 views
Asked by pem
CUDA Warp Divergence
662 views
Asked by Melenor
Is CUDA warp scheduling deterministic?
655 views
Asked by NothingMore
CUDA kernel with single branch runs 1.5x faster than kernel without branch
711 views
Asked by Stepan Tezyunichev
Some intrinsics named with `_sync()` appended in CUDA 9; semantics same?
645 views
Asked by einpoklum
Pre 8.x equivalent of __reduce_max_sync() in CUDA
259 views
Asked by Serge Rogatch
Do modern nVIDIA GPUs perform sub-warp scheduling of work?
734 views
Asked by einpoklum
OpenGL compute shader mapping to nVidia warps
1k views
Asked by Danol
Questions of resident warps of CUDA
456 views
Asked by Falofter
CUDA coalesced access of FP64 data
342 views
Asked by AstrOne
How do nVIDIA CC 2.1 GPU warp schedulers issue 2 instructions at a time for a warp?
5.4k views
Asked by Ashwin Nanjappa
Do the threads in a CUDA warp execute in parallel on a multiprocessor?
4.7k views
Asked by kar
CUDA warp / block finalization
203 views
Asked by bubnikv
How does a GPU group threads into warps/wavefronts?
2.3k views
Asked by user0002128
Branch based on the WARP ID
437 views
Asked by AmirC