Is it possible to use wildcard in Synapse Notebook?

103 views Asked by At

I want to read parquet files in Synapse Notebook. I tried it using wildcard but the "FileNotFoundError" occurred. The folder structure I want to read is like this. test/year={yyyy}/month={MM}/day={dd}/*.parquet And the code executed like this. df = pd.read_parquet('abfss://[email protected]/test/*/*/*/*.parquet', storage_options = '')

enter image description here

Any answer would be helped. Thank you.

1

There are 1 answers

0
Aswin On

The wildcard character (*) is not supported in the path of the pd.read_parquet() function. It takes * as the absolute filepath and tries to read the file. That is the reason for File not found error. To read all the files under specified folder, you can use spark.read.parquet function.

Code:

df = spark.read.parquet('abfss://[email protected]/test/*/*/*/*')
df.show()

enter image description here