Extracting total price from a shopping bill

Question

Extracting total price from a shopping bill

426 views Asked by Krishna Jayachandran At 19 December 2016 at 03:15

I am working on an application where I need to get the net price displayed in any shopping bill from its picture. I have already retrieved the editable text from the bill images using "tesseract ocr" API. Now I need to print only the "grand total amount" from the text. How do I extract only that part( total price) from a whole bill having the item name, quantity and price?

Original Q&A

There are 1 answers

**Pang Ho Ming** · Answer 1 · 2016-12-19T03:31:01+00:00

Short answer, I don't think there is a quick/handy method you can call directly.

You need to look into the .hocr file returned from Tesseract(You can google hocr for more info first). The .hocr includes all the bounding box of the text(x, y, width, height, language etc.) then make use of these values, you can determine if words are on the same line (The word 'Total' and the total amount are very likely printed on the same line).

From here you can shortlist the words, add some logical operations (maybe remove all characters/words), then you can get the total value.

ps: My company is working on a similar stuff, but we decided not to use Tesseract, as it is kind of slow and not easy to train (we're dealing with receipts in several languages). We are using Google Vision API.

Hope my answer helps :D

TechQA.

Extracting total price from a shopping bill

There are 1 answers

Related Questions in ALGORITHM

Related Questions in OCR

Related Questions in TESSERACT

Related Questions in IMAGE-RECOGNITION

Popular Questions

Popular Tags

Trending Questions