Hi,
One of the edge nodes in the cluster is generating alerts once or twice a week due to missed heartbeat from agent.
INFO 2018-01-29 07:04:38,554 logger.py:71 - call returned (0, '')
INFO 2018-01-29 07:06:04,226 logger.py:71 - call[['test', '-w', '/']] {'sudo': True, 'timeout': 5}
INFO 2018-01-29 07:06:04,233 logger.py:71 - call returned (0, '')
As you can see, there is no logging for 1.5 minutes and it is causing ambari alert for this edge node. How can i track if there was any connectivity issue between server and agent. Sometimes more than 1 heartbeat interval is missing.