Is it possible for me separate a pdf with pdfminer based on straight horizontal lines?

26 views Asked by At

I have a pdf with muliple tables where I haven't been successful with tabula for extracting its tables.

I noticed every table has a top straight horizontalline and a bottom horizontal line too, do I'd like to extract the separated content between each of these lines, so that later I can separate and organize it into the tables I'll use.

Is it possible to be done?

By the way: Tabula raises an error saying it can't read my document for 'utf-8', 'latin1' and 'ISO' can't read some character.

0

There are 0 answers