Tabula missing last columns - any idea?

70 views Asked by At

any help would be appreciated;

I'm reading the PDF, exporting to csv and loading using Pandas. I could not read the correct way using pdf instead.

However, it is missing the last column

"Quantidade"

Create csv file

tabula.convert_into("https://up2dataweb.blob.core.windows.net/bdi/BDI_03-2_20231124.pdf", "outputopcoes.csv", \
                    output_format="csv", pages=291)

reading csv

df = pd.read_csv('outputopcoes.csv')

df = df.drop(columns=['Unnamed: 10']).rename(columns=df.iloc[0]).iloc[2:]

df

Any idea? Thanks!!

0

There are 0 answers