List Question
20 TechQA 2024-03-28T02:21:36.570000Bank Conflict Issue in CUDA Shared Memory Access
56 views
Asked by Sunjnn
Nsight Compute Range Replay mode usage
40 views
Asked by 최보열
Nsight Compute can not non-interactive Profiler in Windows
46 views
Asked by Ether
How do I analyze register spills with Nsight Compute?
90 views
Asked by Marko Grdinić
use NCU with tensorRT, but got No kernels were profiled
173 views
Asked by wenxin li
CUDA math function register usage
209 views
Asked by Chris Uchytil
Roofline Model with CUDA Manual vs. Nsight Compute
356 views
Asked by Cherry Toska
Unbalanced Memory Read & Write in CUDA
119 views
Asked by Alex Chen
L2 Fabric cache hit rate of CUDA kernels on A100
187 views
Asked by Shulai
With the NSight Compute profiler, can I check cache hit rates for a specific region of memory?
306 views
Asked by einpoklum
Why is the Compute Throughput’s value different from the actual Performance / Peak Performance?
824 views
Asked by TherLF
Can I skip ahead to profile a specific invocation of a specific kernel?
380 views
Asked by einpoklum
ncu-ui won't run: Could not load the Qt platform plugin "xcb" in "" even though it was found
2.4k views
Asked by einpoklum
Nsight Compute says: "Profiling is not supported on this device" - why?
2.6k views
Asked by einpoklum
Filter on partial kernel name with Nsight Compute
870 views
Asked by John
Using ncu to profile pagefault in unified memory
656 views
Asked by Daniel
Which GPU execution dependencies have fixed latency (causing 'Wait' stalls)?
795 views
Asked by einpoklum
Shared memory loads not registered when using Tensor Cores
507 views
Asked by rm95
When does MIO Throttle stall happen?
1.5k views
Asked by rm95
What are the "long" and "short" scoreboards w.r.t. MIO/L1TEX?
3.7k views
Asked by einpoklum