Loss function negative log likelihood giving loss despite perfect accuracy

Question

Loss function negative log likelihood giving loss despite perfect accuracy

746 views Asked by headache666 At 14 July 2020 at 01:18

I am debugging a sequence-to-sequence model and purposely tried to perfectly overfit a small dataset of ~200 samples (sentence pairs of length between 5-50). I am using negative log-likelihood loss in pytorch. I get low loss (~1e^-5), but the accuracy on the same dataset is only 33%.

I trained the model on 3 samples as well and obtained 100% accuracy, yet during training I had loss. I was under the impression that negative log-likelihood only gives loss (loss is in the same region of ~1e^-5) if there is a mismatch between predicted and target label?

Is a bug in my code likely?

Original Q&A

There are 1 answers

**Shai** · Answer 1 · 2020-07-14T04:53:09+00:00

There is no bug in your code.
The way things usually work in deep nets is that the networks predicts the logits (i.e., log-likelihoods). These logits are then transformed to probability using soft-max (or a sigmoid function). Cross-entropy is finally evaluated based on the predicted probabilities.
The advantage of this approach is that is numerically stable, and easy to train with. On the other side, because of the soft-max you can never have "perfect" 0/1 probabilities for your predictions: That is, even when your network has perfect accuracy it will never assign probability 1 to the correct prediction, but "close to one". As a result, the loss will always be positive (albeit small).

TechQA.

Loss function negative log likelihood giving loss despite perfect accuracy

There are 1 answers

Related Questions in NLP

Related Questions in PYTORCH

Related Questions in LOSS

Related Questions in LOG-LIKELIHOOD

Related Questions in SEQUENCE-TO-SEQUENCE

Popular Questions

Popular Tags

Trending Questions