Support Questions
Find answers, ask questions, and share your expertise
Announcements
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

How can we avoid Resource Manager to retry failed Application Master on the same NodeManager?

SOLVED Go to solution

How can we avoid Resource Manager to retry failed Application Master on the same NodeManager?

New Contributor

Can we avoid Resource Manager to retry failed Application Master on the same NodeManager? My hive job tried to lauch application master in same NM after multiple fails, It leads whole job fail after 4 retries. 

1 ACCEPTED SOLUTION

Accepted Solutions

Re: How can we avoid Resource Manager to retry failed Application Master on the same NodeManager?

New Contributor

This response I got from the Cloudera support


"I can see you're running on CDH 5.3.3, and this was added as a feature in YARN-2005, which was included in CDH releases starting from CDH 5.5.0: "YARN-2005: Blacklisting support for scheduling AMs" https://issues.apache.org/jira/browse/YARN-2005,

Unfortunately this can't be backported to your version of CDH and you will have to resort to an upgrade "

 

Another ticket related black listing AMs is

https://issues.apache.org/jira/browse/YARN-4389

This fix releasing with Hadoop 2.8.0.

1 REPLY 1

Re: How can we avoid Resource Manager to retry failed Application Master on the same NodeManager?

New Contributor

This response I got from the Cloudera support


"I can see you're running on CDH 5.3.3, and this was added as a feature in YARN-2005, which was included in CDH releases starting from CDH 5.5.0: "YARN-2005: Blacklisting support for scheduling AMs" https://issues.apache.org/jira/browse/YARN-2005,

Unfortunately this can't be backported to your version of CDH and you will have to resort to an upgrade "

 

Another ticket related black listing AMs is

https://issues.apache.org/jira/browse/YARN-4389

This fix releasing with Hadoop 2.8.0.