How make batch size Optimum for inference ASR model on GPU

76 views Asked by milad jurablu At 24 October 2023 at 07:31

I had trained a Whisper ASR model and I have 8Gb GPU memory, my model take 4Gb of my GPU and I have to calculate the maximum batch size to fit to the model and get the transcribes because my data are around 200k each hour and I have to make it as fastest as possible and my total Parameters of my model is 763857920

I tried to use 10 audio and I got the CUDA out of ..., the data was like this:

Chunk-Size------1.38Mb
chunk-len-------9
GPU-free(Mb)----3777.94MB
gpu-util--------1700.00%
gpu-mem---------200.00%
ERROR-message---Tried to allocate 20.00 MiB

Original Q&A

TechQA.

How make batch size Optimum for inference ASR model on GPU

There are 0 answers

Related Questions in GPU

Related Questions in TORCH

Related Questions in TRANSFORMER-MODEL

Related Questions in WHISPER

Popular Questions

Popular Tags

Trending Questions