I'm running a 6-node Hadoop Cluster, which version is HDP 3.1.4, and faced a weird situation so could anyone please help me out.
Checking on Ambari WebUI, NodeManagers which is consisted of 3 nodes seem just fine. However when we push a application into yarn, all the app failed.
Investigating the situation and log files, I found one of the NodeManager node's WebUI not responding and emitting the following log.
WARN org.apache.hadoop.util.SysInfoLinux: Couldn't read /proc/meminfo: can't determine memory settings
By restarting the NodeManager Service, this situation is resolved for a moment but seems reproducible.
Does anyone faced same problem? Any comment is highly appreciated.
Thank you in advance.