In stochastic gradient descent, in a single epoch, smaller batch sizes should give smaller errors since smaller batch size means more updates to the variables. However, in one of my recent experiments using pyTorch when I decrease the batch size (from 512 to 32) the loss after the first epoch increased. Is this possible? What does it say about the training process? Is gradient descent diverging? Or anything else?
Related Questions in PYTORCH
- Influence of Unused FFN on Model Accuracy in PyTorch
- Conda CMAKE CXX Compiler error while compiling Pytorch
- Which library can replace causal_conv1d in machine learning programming?
- yolo v5 export to torchscript: how to generate constants.pkl
- Pytorch distribute process across nodes and gpu
- My ICNN doesn't seem to work for any n_hidden
- a problem for save and load a pytorch model
- The meaning of an out_channel in nn.Conv2d pytorch
- config QConfig in pytorch QAT
- Can't load the saved model in PyTorch
- How can I convert a flax.linen.Module to a torch.nn.Module?
- Snuffle in PyTorch Dataloader
- Cuda out of Memory but I have no free space
- Can not load scripted model using torch::jit::load
- Should I train my model with a set of pictures as one input data or I need to crop to small one using Pytorch
Related Questions in GRADIENT-DESCENT
- Batch Gradient Descent algorithm in python is returning huge values
- Best way of finding KKT points for a Sympy polynomial
- Higher error with smaller batch size in pyTorch
- In Gradient Descent algorithm, how to induce -2*wx
- How to implement gradient op for a custom tensorflow op, for which the it is hard to derive a mathematical closed form formula for gradient?
- The return type of Dual Annealing OptimiseResult 'fun' is float64 and sometimes it is ndarray with single float64
- Gradient descent weights keep getting larger
- Problem with gradient descent least squares code
- How to Implement Full Batch Gradient Descent with Nesterov Momentum in PyTorch?
- Gradients not changing in co-ordinate descent for logistic regression
- Unable to find out the feature importance list from histgradientboosting classifier
- AttributeError: module 'tensorflow_addons.image' has no attribute 'gradients'
- solving Freudenstein and Roth test function using gradient armijo method
- Backpropagation and gradient descent with python
- finding the maximum of a function using jax
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
Popular Tags
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)