With iOS Vision Framework, I am able to perform OCR and fetch recognized text from an image using
VNRecognizedTextObservation
Now let say, I have an image in which there is some text paragraph along with a table. The table has many columns and associated rows with it(Refer below image). Is it possible to recognize a particular column's key and values from the table using Vision?
For example, I want to fetch 2014 Retail Sales numbers alone from the below image using Vision. How to do this? Can we use both Vision and CoreML to do this?
Yes it is possible using the vision
you can detect the column's key bounding box and extend the bounding box height