How Can I evaluate WER (Word Error Rate) in ASR ( Automatic Speech Recognition)?
For Example, If I have (Human Ref. Translation) for the sentences and Output of ASR.
I know the equation but I do not know how to calculate it. Do I enter punctuation marks such as a comma and full stop and so on in the calculation of WER?
As well as for sub, ins, and del of words. Is there a specific weight? Each of them when calculated in the equation?
Would anyone who knows about how can we calculate WER for ASR.. please give me an example so I can calculate WER for ASR in multiple sentences that I have in my app
You strip punctuation before calculation and convert everything to lowercase.
You can use this Python package:
https://pypi.org/project/jiwer/
if you need other languages, let us know which ones.