Is it possible to trigger a spark job via oozie when a folder's size reaches a certain threshold?

Question

Is it possible to trigger a spark job via oozie when a folder's size reaches a certain threshold?

103 views Asked by sanyi14ka At 29 August 2017 at 06:54

For instance, if a folder reaches 100 MB then a spark job should be triggered. I read about the dirSize hdfs el function in oozie, but I'm not sure how to use it. Does it trigger the job when the folder reaches 100 MB, or does it have to be checked periodically in, let's say, every 2 minutes?

Original Q&A

There are 1 answers

**Naveenchandra Patil** · Answer 1 · 2017-08-29T14:39:57+00:00

Naveenchandra Patil On 29 August 2017 at 14:39

1 option for you is to run a oozie coordinator periodically (say for every 2min) to check on the file size, if it attains the specified limit you can trigger the spark job.

TechQA.

Is it possible to trigger a spark job via oozie when a folder's size reaches a certain threshold?

There are 1 answers

Related Questions in HADOOP

Related Questions in APACHE-SPARK

Related Questions in OOZIE

Related Questions in OOZIE-COORDINATOR

Popular Questions

Popular Tags

Trending Questions