What is oozie equivalent for Spark?

Question

What is oozie equivalent for Spark?

549 views Asked by Aravind Yarram At 24 November 2015 at 00:55

We have very complex pipelines which we need to compose and schedule. I see that Hadoop ecosystem has Oozie for this. What are the choices for Spark based jobs when I am running Spark on Mesos or Standalone and doesn't have a Hadoop cluster?

Original Q&A

There are 2 answers

Rakesh On 25 November 2015 at 12:58

Oozie can be used in case of Yarn, for spark there is no built in scheduler available, So you are free to choose any scheduler which works in the cluster mode.

For Mesos I feel Chronos would be the right choice, more info on Chronos

**srinath_perera** · Accepted Answer · 2015-11-26T04:08:58+00:00

Unlike with Hadoop, it is pretty easy to chains things with Spark. So writing a Spark Scala script might be enough. My first recommendation is tying that.

If you like to keep it SQL like, you can try SparkSQL.

If you have a really complex flow, it is worth looking at Google data flow https://github.com/GoogleCloudPlatform/DataflowJavaSDK.

TechQA.

What is oozie equivalent for Spark?

There are 2 answers

Related Questions in HADOOP

Related Questions in APACHE-SPARK

Related Questions in BIGDATA

Related Questions in APACHE-SPARK-1.5

Popular Questions

Trending Questions