Expert Contributor
Posts: 63
Registered: ‎11-04-2016

CM 6.1 hosts status Unknown Health randomly and periodically

[ Edited ]


After I have upgraded to CM/CDH 6.1 from 5.16.1, my hosts randomly and periodically having "Unknown Health" for about a few seconds and then go back to green. I have not seen/found any WARNING nor any ERROR in any logs from hosts or any services.

The entire cluster works without any issue, I have run host inspection and network inspection without any problem. Also, synced time/date a few times just in case but still I can watch my hosts (also services because of the hosts) going grey with "Unknown Health" and back to green randomly for few seconds.


Cloudera Management Service is on one server with 14 cores and 28G memory. I have checked this server activity, it is pretty idle, so the cluster is not a busy cluster. Either way, this is the heap size for the monitorings:

Java Heap Size of Activity Monitor in Bytes: 2GB

Java Heap Size of Alert Publisher in Bytes: 256MB
Java Heap Size of EventServer in Bytes: 1GB
Java Heap Size of Host Monitor in Bytes: 4GB
Java Heap Size of Service Monitor in Bytes: 4GB
Maximum Non-Java Memory of Host Monitor: 8GB
Maximum Non-Java Memory of Service Monitor: 12GB

Do you guys have any advice on how to diagnose the possible issue?



Many thanks.

Screenshot 2019-01-17 16.33.37.pngScreenshot 2019-01-17 16.23.19.pngScreenshot 2019-01-17 16.22.52.png