How to reduce CUDA context size (Multi-Process Service)

Question

How to reduce CUDA context size (Multi-Process Service)

1k views Asked by alex At 19 December 2019 at 14:21

I followed Robert Crovella's example on how to use Nvidia's Multi-Process Service. According to docs:

2.1.2. Reduced on-GPU context storage

Without MPS each CUDA processes using a GPU allocates separate storage and scheduling resources on the GPU. In contrast, the MPS server allocates one copy of GPU storage and scheduling resources shared by all its clients.

which I understood as the reduction of each of the processes' context sizes, which is possible because they are shared. This would increase free GPU memory and thus enable running more processes in parallel.

Now, back to the example. Without MPS:

And with MPS:

Unfortunately each process still takes virtually the same (~300MB) amount of memory. Isn't this in contradiction to the docs? Is there a way to decrease per process memory consumption?

Original Q&A

There are 2 answers

Raz Rotenberg On 27 February 2020 at 16:08

Indeed, as seen here, in Volta architecture, you can see the processes communicate directly with the GPU, without the MPS server in the middle:

Volta MPS clients submit work directly to the GPU without passing through the MPS server.

This can be easily seen from your first screenshot where the t1034 processes are listed as using the GPU.

On the contrary, in pre-Volta architectures, the client processes communicate with the GPU through the MPS server. This results in seeing only the MPS server process communicating directly with the GPU in the latter screenshot.

**alex** · Accepted Answer · 2019-12-19T16:39:25+00:00

alex On 19 December 2019 at 16:39 BEST ANSWER

Oops, I overeagerly asked before checking the memory usage on the other (pre-Volta) card and yes, there is actually a difference. Let me just post it here for future reference if anyone else stumbled on this problem too:

MPS off:

MPS on:

TechQA.

How to reduce CUDA context size (Multi-Process Service)

There are 2 answers

Related Questions in CUDA

Related Questions in GPU

Related Questions in GPGPU

Related Questions in CUDA-CONTEXT

Related Questions in MULTI-PROCESS-SERVICE

Popular Questions

Popular Tags

Trending Questions