Support Questions

Find answers, ask questions, and share your expertise
Announcements
Celebrating as our community reaches 100,000 members! Thank you!

How can we avoid Resource Manager to retry failed Application Master on the same NodeManager?

avatar
Explorer

Can we avoid Resource Manager to retry failed Application Master on the same NodeManager? My hive job tried to lauch application master in same NM after multiple fails, It leads whole job fail after 4 retries. 

1 ACCEPTED SOLUTION

avatar
Explorer

This response I got from the Cloudera support


"I can see you're running on CDH 5.3.3, and this was added as a feature in YARN-2005, which was included in CDH releases starting from CDH 5.5.0: "YARN-2005: Blacklisting support for scheduling AMs" https://issues.apache.org/jira/browse/YARN-2005,

Unfortunately this can't be backported to your version of CDH and you will have to resort to an upgrade "

 

Another ticket related black listing AMs is

https://issues.apache.org/jira/browse/YARN-4389

This fix releasing with Hadoop 2.8.0.

View solution in original post

1 REPLY 1

avatar
Explorer

This response I got from the Cloudera support


"I can see you're running on CDH 5.3.3, and this was added as a feature in YARN-2005, which was included in CDH releases starting from CDH 5.5.0: "YARN-2005: Blacklisting support for scheduling AMs" https://issues.apache.org/jira/browse/YARN-2005,

Unfortunately this can't be backported to your version of CDH and you will have to resort to an upgrade "

 

Another ticket related black listing AMs is

https://issues.apache.org/jira/browse/YARN-4389

This fix releasing with Hadoop 2.8.0.