Partition data while writing to delta sink

Question

Partition data while writing to delta sink

886 views Asked by Shadman Sadekeen At 10 October 2021 at 07:06

In Azure mapping dataflow we now have option to save files in delta format. But that is only available when we select inline dataset (without data bricks subscription). And when the sink dataset is inline dataset, it does not allow to set partition based on any column.

I can write pyspark code to rewrite the delta table with required partition. But that would incur additional cost.

What could be work arounds for getting good performance on delta data?

Original Q&A

There are 1 answers

**Satya V** · Answer 1 · 2021-10-28T03:18:59+00:00

There was a UI issue that was recently fixed by the engineering team. Until this reflects at your end.

You could do the following as a workaround :

Option 1 :

You can change the type of sink to something else, like a delimited text sink, and you should then see the key columns in Key partitioning. Then, switch the Sink type back to Delta.

Reference : https://learn.microsoft.com/en-us/answers/questions/599075/index.html

Option 2: You could enable the partitioning at the source end.

The partitioned data was flowing as a stream. I was able to achieve the partitioned data as a result

TechQA.

Partition data while writing to delta sink

There are 1 answers

Related Questions in AZURE-SYNAPSE

Related Questions in DATA-PARTITIONING

Related Questions in DELTA-INDEX

Popular Questions

Popular Tags

Trending Questions