Can I tell nvcc to apply #pragma unroll to all loops in a function?

Question

Can I tell nvcc to apply #pragma unroll to all loops in a function?

5.8k views Asked by einpoklum At 18 December 2013 at 10:14

I have a CUDA kernel with a bunch of loops I want to unroll. Right now I do:

void mykernel(int* in, int* out, int baz) {    
    #pragma unroll
    for(int i = 0; i < 4; i++) {
        foo();
    }
    /* ... */
    #pragma unroll
    for(int i = 0; i < 6; i++) {
        bar();
    }
}

et cetera. I want to tell (hint at) my C/C++ compiler to unroll all of these loops, without needing a separate hint for each loop. However, I don't want to unroll all loops in all code in the file, just in this function.

If this were GCC, I could do:

__attribute__((optimize("unroll-loops")))
void mykernel(int* in, int* out, int baz) {    
    for(int i = 0; i < 4; i++) {
        foo();
    }
    /* ... */
    for(int i = 0; i < 6; i++) {
        bar();
    }
}

Or use option pushing-and-popping. Is there something equivalent I can do with CUDA?

Original Q&A

There are 1 answers

**Roger Dahl** · Accepted Answer · 2013-12-18T16:05:17+00:00

#pragma unroll is the only mechanism for requesting unrolling that is documented in the CUDA C Programming Guide 5.5, and it must be specified before each loop. But the compiler unrolls all "small loops with a known trip count" by default, so you may not need the unroll directives in your first example.

I don't think controlling unrolling at the function level would be all that useful. You should probably initially rely on the compiler to select the best amount of unrolling and then tweak each loop separately if profiling indicates that it could help.

TechQA.

Can I tell nvcc to apply #pragma unroll to all loops in a function?

There are 1 answers

Related Questions in C++

Related Questions in OPTIMIZATION

Related Questions in CUDA

Related Questions in COMPILER-DIRECTIVES

Related Questions in LOOP-UNROLLING

Popular Questions

Trending Questions