Tesseract identifies a 0 as a Q

Question

Tesseract identifies a 0 as a Q

497 views Asked by Vincent Roye At 19 December 2013 at 08:23

I am using Tesseract OCR for getting an exclusively numeric string in a PDF file. The PDF contains : 66600O3377.pdf but Tesseract recognizes : 66600Q3377.pdf

The input is a TIFF file, the quality is good enough (see the screenshot).

Is there a way to improve the Tesseract accuracy ? I could always change Q for a 0 but I'm afraid of further unexpected mistakes.

enter image description here

Original Q&A

There are 1 answers

**mvp** · Accepted Answer · 2013-12-19T08:37:11+00:00

mvp On 19 December 2013 at 08:37 BEST ANSWER

This is in Tesseract FAQ:

Run a tesseract command like this to only permit digits in input image:

tesseract imagename outputbase digits

TechQA.

Tesseract identifies a 0 as a Q

There are 1 answers

Related Questions in OPTIMIZATION

Related Questions in TIFF

Related Questions in TESSERACT

Related Questions in IDENTIFICATION

Popular Questions

Trending Questions