Is there a way to maintain leading zeros in cells while still using the tabula-py convert_into function? Perhaps by passing something into the 'options' parameter to read them as strings? The documentation didn't seem very clear on what could be used there, but maybe I missed something (Source docs)
# Convert PDF file to csv doc
convert_into(source_path, csv_path, output_format="csv", pages='1-2', stream=True)
If I open the csv in something other than Excel, and actually see the leading zeros still there. The solution that seems to work is to update the pandas.read_csv() function by passing dtype=str like so: