How to add a custom node label to task node in EMR

Question

How to add a custom node label to task node in EMR

645 views Asked by Rahul Garg At 28 May 2021 at 08:08

I want to run my spark executors on task nodes only in my AWS EMR cluster and yarn labels are one of the ways to achieve this. I can specify labels during spark-submit. I want to achieve the following

Add a custom label during the cluster start-up.
Associate this label to any node joining my cluster during auto-scaling.

I want to do this so that I can reduce the cost of my cluster by ensuring all executors will run on on-spot instances.

Original Q&A

There are 1 answers

**Rahul Garg** · Answer 1 · 2021-05-31T13:25:22+00:00

We achieved it through the below process.

During the Maser node booting, we run our custom script where we create a new TASK label. EMR creates the Core level automatically.
During Core and Task node booting, we identify what is node type from metadata API and attach the appropriate label to the machine depending on the instance type. If it is the on-demand instance, we attach CORE else we add the TASK label to the node.
When we submit our spark job, we mention to executor node label expression as TASK, which ensures to all executors on TASK node only.

TechQA.

How to add a custom node label to task node in EMR

There are 1 answers

Related Questions in APACHE-SPARK

Related Questions in HADOOP-YARN

Related Questions in AMAZON-EMR

Related Questions in SPOT-INSTANCES

Related Questions in COST-OPTIMIZATION

Popular Questions

Popular Tags

Trending Questions