Too much fluctuation in F1 Score curve during meta training with MAML

Question

Too much fluctuation in F1 Score curve during meta training with MAML

139 views Asked by The Exile At 27 December 2024 at 19:23

I am training VGG11 on a custom image dataset for 3-way 5-shot image classification using MAML from learn2learn. I am encapsulating the whole VGG11 model with MAML, i.e., not just the classification head. My hyperparameters are as follows:

Meta LR: 0.001
Fast LR: 0.5
Adaptation steps: 1
First order: False
Meta Batch Size: 5
Optimizer: AdamW

During the training, I noticed that after taking the first outer-loop optimization step, i.e., AdamW.step(), loss skyrockets to very large values, like ten thousands. Is this normal? Also, I am measuring the micro F1 score as accuracy metric of which curve for meta training/validation is as follows:

It is fluctuating too much in my opinion, is this normal? What could be the reason of this? Thanks

Original Q&A

There are 1 answers

**The Exile** · Accepted Answer · 2022-12-27T10:06:17+00:00

The Exile On 27 December 2022 at 10:06 BEST ANSWER

I figured it out. I was using VGG11 with vanilla BatchNorm layers from PyTorch which was not working properly in meta training setup. I removed BatchNorm layers and now it works as expected.

TechQA.

Too much fluctuation in F1 Score curve during meta training with MAML

There are 1 answers

Related Questions in DEEP-LEARNING

Related Questions in PYTORCH

Related Questions in META-LEARNING

Related Questions in FEW-SHOT-LEARNING

Related Questions in LEARN2LEARN

Popular Questions

Popular Tags

Trending Questions