List Question
20 TechQA 2017-08-30T09:45:49.213000NVCC register usage report in __device__ function
2.5k views
Asked by Christopher23
CUDA: Erroneous lmem statistics displayed for sm_20?
812 views
Asked by Ashwin Nanjappa
encountering ptxas.exe warning while trying to run tensorflow with GPU
145 views
Asked by Keren Ben-yehuda
NVCC separate compilation with PTX output
4.1k views
Asked by jozxyqk
What does the --abi-compile=yes option of CUDA ptxas do (which costs registers)?
871 views
Asked by einpoklum
OpenCL including header causes ptxas fatal: Unresolved extern function
127 views
Asked by TheINCGI
How to overcome Stack size warning?
310 views
Asked by unegare
How can I disable the ptxas warning about indeterminable stack size?
823 views
Asked by einpoklum
Why does nvcc refuse to link this simple cooperative-groups program?
178 views
Asked by einpoklum
How to update ptxas (nvidia toolchain) on google compute engine
1.6k views
Asked by Gary Aitken
Setting 32 bit address size in inline PTX
780 views
Asked by Roger Dahl
Debugging inline PTX in Parallel Nsight
726 views
Asked by Roger Dahl
Avoiding unnecessary mov operations in inline PTX
494 views
Asked by Roger Dahl
Interpreting the verbose output of ptxas, part I
8k views
Asked by curiousexplorer
What is the correct way to support `__shfl()` and `__shfl_sync()` instructions?
1.1k views
Asked by Blizzard
How can I implement a custom atomic function involving several variables?
3.6k views
Asked by Doug
compiling ptx code on NVIDIA GPU?
1k views
Asked by Zk1001
Extra register usage with if
202 views
Asked by gmemon
CUDA ptxas Error "function uses too much shared data"
6.6k views
Asked by Geru
Function properties for __internal_trig_reduction_slowpathd
303 views
Asked by raspiede