Yarn autodetect slaves failure

Question

Yarn autodetect slaves failure

64 views Asked by Luís Guilherme At 10 June 2015 at 19:51

This is something that I've found nowhere.

I have a YARN cluster with some slaves. When a slave fails (chaos monkey, scale down, etc.), ResourceManager doesn't "get it". Even a rmadmin -refreshNodes doesn't fix it. ResourceManager keeps listing the failed nodes as RUNNING. How do I do in order for ResourceManager to check for slaves health and remove them when they fail?

Original Q&A

There are 1 answers

**Ramzy** · Answer 1 · 2015-06-10T20:54:17+00:00

Please look into Hadoop Definitive Guide, Chapter 10, Maintenance, Commissioning and Decommissioning Nodes. Looks like you are trying to update the jobtracker with the above command. More elaborate process is mentioned there, along with updating the name node, verifying the progress in web UI, and removing the nodes from include file and slave file

TechQA.

Yarn autodetect slaves failure

There are 1 answers

Related Questions in HADOOP

Related Questions in HADOOP-YARN

Related Questions in HADOOP2

Related Questions in RESOURCEMANAGER

Popular Questions

Popular Tags

Trending Questions