Bug report nltk.translate.bleu_score stopped working on tokens less than or equal to 3

24 views Asked by ShoutOutAndCalculate At 19 November 2023 at 21:21

Plat form Windows 11 Anaconda

import nltk as nltk
nltk.__version__
'3.8.1'

The sentence_bleu ought to return 1 for identical translation

from nltk.translate.bleu_score import sentence_bleu, SmoothingFunction
sentence_bleu([["hi", "hello", "world"]], ["hi", "hello", "world"])
1.2213386697554703e-77

and even with a smooth function it could not help much

smoother = SmoothingFunction()
sentence_bleu([["hi", "hello", "world"]], ["hi", "hello", "world"], smoothing_function=smoother.method4)
0.5757197301274735

However,

sentence_bleu([["hi", "hello", "world", "how"]], ["hi", "hello", "world", "how"])
1.0

This appeared to be a bug in case handling or the summation index.

Original Q&A

TechQA.

Bug report nltk.translate.bleu_score stopped working on tokens less than or equal to 3

There are 0 answers

Related Questions in NLTK

Related Questions in BUG-REPORTING

Related Questions in BLEU

Popular Questions

Popular Tags

Trending Questions