Delta Live Tables saving as corrupt files

Question

Delta Live Tables saving as corrupt files

111 views Asked by Kaleigh Spitzer At 19 October 2023 at 00:47

I'm currently implementing an ETL pipeline using Databricks Delta Live Tables. I specified the storage location as a folder in ADLS. When I run the pipeline and look at the files, the .snappy.parquet files that are getting saved to ADLS have unicode characters in them. I am using very small (around 5 rows each) csv files that don't have any null values or special characters. Has anyone ran into this issue / does anyone know how to solve this?

What I've tried:

Saving to a different ADLS location
- This still resulted in corrupt files in ADLS
Reading the Delta Live Table into a spark dataframe, then writing to ADLS
- This still resulted in corrupt files in ADLS
Changing cluster configuration
- This resulted in an Azure Quota Exceed error

Original Q&A

There are 1 answers

**Bhavani** · Answer 1 · 2023-10-19T10:51:29+00:00

When I tried to view the Delta table, I encountered the same issue as shown below:

enter image description here

The data has Unicode solutions. According to this, the "Underlying Data" of a "Delta Table" is "Stored" in the "Compressed Parquet File Format," i.e., in "snappy. Parquet" File Format.

As per this, Parquet is a binary-based (rather than text-based) file format optimized for computers, so Parquet files aren't directly readable by humans. That may be the reason for getting data with Unicode as above. So, if we want to view the data of a snappy. parquet file, read it in Databricks using the code below:

df = spark.read.format("delta").load("<deltaTablePath>")
df.show()

Then we can view the data of the Delta table as shown below:

enter image description here

Alternatively, read the file using Parquet reading tools or upload it to an online Parquet viewer as shown below:

enter image description here

TechQA.

Delta Live Tables saving as corrupt files

There are 1 answers

Related Questions in DATABRICKS

Related Questions in AZURE-DATABRICKS

Related Questions in DELTA-LIVE-TABLES

Popular Questions

Popular Tags

Trending Questions