To extract both tables and normal text from pdf file

116 views Asked by At

I want to extract text from pdf which contains both normal text and tables such that tables will be extracted by camelot and normal text will be extracted by pypdf2.

I tried extracting tables using camelot but sometimes it extracts text present outside the table as well, considering it as a table with one column.

So, how to restrict camelot only to the table extraction and PyPDF2 only to the text present outside the table?

0

There are 0 answers