Flink Stream processing handling partial failures and avoid reprocessing

Question

Flink Stream processing handling partial failures and avoid reprocessing

82 views Asked by javafan At 03 November 2023 at 00:24

I have Flink stream processing application which read stream of messages from Pulsar Topic, process them and store the file in S3. It perform below operation.

Read messages from Pulsar topic every 30 seconds with TumblingWindow
KeyBy to divide the stream processing based on key.
Process it and store it in S3.
Notify downstream application

Happy path works very well. Problem starts with partial failures and recovery.

Step# 2 can create multiple different streams as there will be different keys in the stream.

Check point 1 Triggered. Stream 1 (Key 1) -- Processing is successful. Stream 2 (Key 2) -- Processing is failed for some reason at step 3 or 4 above. Check point 2 Completed.

If I throw exception, in case stream 2 is failed, It will fail the whole job and reprocess from Checkpoint 1. In this case, Stream 1 will be reprocessed which should not happen.

Is there a way in Flink we can manually avoid acknowledging Pulsar topic for only failed messages or only process failed records after restart. My requirement is to not to perform duplicate processing and reprocess only failed records.

I read savepoint can be one of the solution but did not find any concrete example.

Appreciate your help!!

Original Q&A

There are 2 answers

**kkrugler** · Answer 1 · 2023-11-04T16:21:26+00:00

kkrugler On 04 November 2023 at 16:21

The short answer in "no". Flink tracks source offsets and sink transactions (plus operator state), in order to support efficient exactly once processing.

**David Anderson** · Answer 2 · 2023-11-06T00:48:55+00:00

David Anderson On 06 November 2023 at 00:48

Flink only allows for partial restarts in cases where doing so doesn't compromise correctness -- i.e., in streaming pipelines without any repartitioning. The keyBy in your use case makes this unworkable.

TechQA.

Flink Stream processing handling partial failures and avoid reprocessing

There are 2 answers

Related Questions in APACHE-FLINK

Related Questions in FLINK-STREAMING

Related Questions in APACHE-PULSAR

Popular Questions

Popular Tags

Trending Questions