Fine grained Kernel scheduling with MPS

Question

Fine grained Kernel scheduling with MPS

212 views Asked by Ubaid Ullah Hafeez At 21 October 2021 at 13:48

I am working on using NVIDIA CUDA Multi Process Service (MPS) for running multiple TensorFlow inference jobs using the same GPU. For my use-case, when GPU is being shared by more than one processes, I sometimes need to prioritize execution of kernels of one process over the other. Is this supported?

To explain the problem in more detail, consider an example in which we have two processes, p1 and p2 (each with just one kernel execution stream) sharing a GPU.

Scenario: When there are one or more kernels in ready queue for both p1 and p2.

Default MPS behavior (My understanding):

If there is enough resources, execute multiple kernels at the same time from both p1 and p2.

Desired behavior: Ability to decide based on priority if:

Execute kernel of p1 first then p2.
Execute kernel of p2 first then p1.
Incase there is enough resources, execute multiple kernels at the same time from both p1 and p2.

If this kind of customized scheduling is not supported, It will be great if someone can guide what code changes will be needed to make it work.

Original Q&A

There are 1 answers

**talonmies** · Accepted Answer · 2021-10-21T14:14:46+00:00

talonmies On 21 October 2021 at 14:14 BEST ANSWER

I sometimes need to prioritize execution of kernels of one process over the other. Is this supported?

No it is not.

TechQA.

Fine grained Kernel scheduling with MPS

There are 1 answers

Related Questions in TENSORFLOW

Related Questions in CUDA

Related Questions in NVIDIA

Related Questions in CUDA-CONTEXT

Related Questions in MULTI-PROCESS-SERVICE

Popular Questions

Popular Tags

Trending Questions