I have been working on extracting text from images, specifically focusing on seven-segment fonts, using .NET. Unfortunately, my attempts with popular libraries like Tesseract, IronOcr and many more have been unsuccessful, as they seem to excel with normal English fonts.
Here's a brief overview of my tries so far:
- Tesseract (Limited to normal English fonts, unable to recognize seven-segment characters)
- IronOcr (Similar limitations, not suitable for seven-segment fonts)
- Leadtools
- pretrained models
- custom trained models
- some matlab and python projects from internet
- some free OCR Api's
Despite these efforts, I'm facing challenges in accurately extracting text from images with seven-segment fonts.
Link to Images which is to be recognised: https://drive.google.com/drive/folders/1b4S-UQbxaXZPbDfOkTiC1m0qTIt6fdb7
Additionally, I've experimented with image processing techniques, including:
Cropping and zooming to the text region. Applying gray, black and white, and binarization filters.