I ve set up a new Ambari cluster. But cant run any jobs there using yarn because they all get stuck in ACCEPTED state(AM container waits for RM). If I go to the allocated container its state is RUNNING but there are no logs, only message that it is currently in LOCALIZING state.
Job fails eventually due to timeout issues
Ambari version is 18.104.22.168
@Jagadeesan A S I am using 22.214.171.124-1634. But I am still facing this issue. It is resolved in 3.0.0 or 3.1.0?
The annoying part is that the issue is random. It picks any job at any time. The same job runs fine and suddenly it fails with this issue and next time some other job might fail after couple of days which had successfully ran earlier and which will run properly in future. There is no other jobs running during that time.