Pytorch: Send same batch of data to multiple GPUs, and perform ops on each GPU individually

945 views Asked by Saravanabalagi Ramachandran At 08 October 2020 at 12:50

I have the same dataloader to feed data to 4 models, each with a different hyperparameter loaded on a separate GPU. I want to reduce the bottleneck caused by data-loading, so I intend to load the same batch prepared by the dataloader on all GPUs for them to compute individually and perform a backprop step. I already cache data into RAM to avoid disk-bottlenecks when the dataloader in instantiated.

I am trying to:

Send/Broadcast the same batch of data to N GPUs. I guess this is possible only if we can sync/wait for all GPUs to finish ops for one batch, before we can proceed to the next one.
Bonus: Prefetching next batch as soon as one batch is ready (upto P batches) could help ensure continuous flow of data to the GPUs avoiding the wait.

I am not trying to achieve:

Data Parallelism - Split a large batch into N parts, and compute each part on one GPU
Model Parallelism - Split computation of a large model (that won't fit on one GPU) into N (or less) parts and place each part on one GPU.

TechQA.

Pytorch: Send same batch of data to multiple GPUs, and perform ops on each GPU individually

There are 0 answers

Related Questions in PARALLEL-PROCESSING

Related Questions in PYTORCH

Related Questions in SHARED-DATA

Popular Questions

Popular Tags

Trending Questions