I am deploying a cluster using Blueprint. Sometimes components install on all node perfectly but sometimes it failed to a node with error:
Caught an exception while executing custom service command: <type 'exceptions.Exception'>: Command requires configs with timestamp=1560248016781 but configs on agent have timestamp=1560248014508; Command requires configs with timestamp=1560248016781 but configs on agent have timestamp=1560248014508.
What will be the cause of this error can anyone help or guide me on that.
Also after installation failed I am checking the failed components and reinstalling the components on the same host.
Sometimes its work and sometimes its failing.
Are you sure that all your Cluster Node where the Ambari Server and Agents are running in in Time Sync?
Are you running Some NTPD services to make sure that the time is sync in all these nodes?
The clocks of all the nodes in your cluster and the machine that runs the browser through which you access the Ambari Web interface must be able to synchronize with each other.
What is the version of your Ambari Server?
Have you tried deleting and Adding services to that Failing Node earlier (May be something got cached in Agent cache dir).
I also see similar error reported in the following JIRA although with the currently available info it is hard to say if you are facing the same issue or not: https://issues.apache.org/jira/browse/AMBARI-24176
I am using 220.127.116.11 ambari version. I am not deploying any service manually using my browser. I am doing whole cluster automation using Blueprint.
Is there any way or any parameter we can add in Blueprint So that it will reinstall failed component on that node on which it is failed.
Also is there any API for removing cache?
Is NTPD service need to be compulsory to be run on hosts?