Higher error with smaller batch size in pyTorch

14 views Asked by Sanyo Mn At 28 March 2024 at 10:18

In stochastic gradient descent, in a single epoch, smaller batch sizes should give smaller errors since smaller batch size means more updates to the variables. However, in one of my recent experiments using pyTorch when I decrease the batch size (from 512 to 32) the loss after the first epoch increased. Is this possible? What does it say about the training process? Is gradient descent diverging? Or anything else?

Original Q&A

TechQA.

Higher error with smaller batch size in pyTorch

There are 0 answers

Related Questions in PYTORCH

Related Questions in GRADIENT-DESCENT

Popular Questions

Trending Questions