When Ambari executes tasks like START/STOP/RESTART/... , it timeouts after waiting 20mins ( 1200 seconds).
For example Namenode START command waits for 20mins and then mark the service is Failed/Timeout - in larger clusters NameNodes takes much longer time - How to tell Ambari to wait for much longer?
Add below 2 configs and re-start the service - this change increases the timeout value to 1Hour.
agent.task.timeout=3600
server.task.timeout=3600
Note: This change may not take affect during the Upgrade If it is already scheduled.
Super helpful to know it can be done, but is this the ambari-server conf file you edit or the ambari-agent file? I assume it's ambari-server but it would be nice to have that as a clear call out in the article.