<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Ambari-Agent high cpu &amp; Datanode without heartbeat in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Ambari-Agent-high-cpu-Datanode-without-heartbeat/m-p/204051#M68785</link>
    <description>&lt;P&gt;Problem solved!&lt;BR /&gt;In case anyone else encounters the following:&lt;BR /&gt;1. Servers losing heartbeat for no reason&lt;BR /&gt;2. ambari agent always hogs 100% cpu&lt;BR /&gt;3. running "yarn application -list" produces results but slow than other servers.&lt;BR /&gt;4. In general the server is slow&lt;/P&gt;&lt;P&gt;The fix was to set the CPU in BIOS to use profile "maximum performance" &lt;BR /&gt;For some reason the server was set (in our case DELL, but it is in every BIOS out there) to default CPU profile which means low resources for low voltage use.&lt;/P&gt;</description>
    <pubDate>Tue, 03 Oct 2017 19:08:39 GMT</pubDate>
    <dc:creator>Adija1</dc:creator>
    <dc:date>2017-10-03T19:08:39Z</dc:date>
    <item>
      <title>Ambari-Agent high cpu &amp; Datanode without heartbeat</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Ambari-Agent-high-cpu-Datanode-without-heartbeat/m-p/204049#M68783</link>
      <description>&lt;P&gt;Hello&lt;/P&gt;&lt;P&gt;We have a 9 DataNode CentOs 6 cluster running HDP 2.6.1 &amp;amp; Ambari 2.5.1&lt;BR /&gt;Lately we've noticed that one data node lost heartbeat in Ambari. We restarted the ambari agent on the data node but it didn't help.&lt;BR /&gt;We restarted the entire DataNode server but no luck.&lt;BR /&gt;&lt;U&gt;We did notice that:&lt;/U&gt;&lt;BR /&gt;- /usr/bin/python /usr/lib/python2.6/site-packages/ambari_agent/main.py start is always on 100% cpu (centos6), while in all other nodes it barely uses 1%.&lt;BR /&gt;- The ambari-agent.log shows no errors besides a timeout while checking the JCE. It runs the following command:&lt;BR /&gt;"/usr/jdk64/jdk1.8.0_77/bin/java -jar /var/lib/ambari-agent/tools/jcepolicyinfo.jar -tu" and receives a timeout. &lt;BR /&gt;Running the command manually does return a result after 6 seconds, while in other datanodes in just 1 second. (thus the timeout which is 5 seconds).&lt;/P&gt;&lt;P&gt;- ambari agent is 2.5.1.0-159.x86_64 same as all other DN's&lt;BR /&gt;- java version is 1.8.0_77 same as all other DN's&lt;/P&gt;&lt;P&gt;- NodeManager works without a problem on that data node and it runs applications fine. It's just the heartbeat/ambari-agent issue.&lt;BR /&gt;- Name resolution &amp;amp; netwrok between data node &amp;amp; ambari is working fine.&lt;/P&gt;&lt;P&gt;Any advice will be appreciated.&lt;/P&gt;&lt;P&gt;Adi&lt;/P&gt;</description>
      <pubDate>Sun, 01 Oct 2017 21:39:01 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Ambari-Agent-high-cpu-Datanode-without-heartbeat/m-p/204049#M68783</guid>
      <dc:creator>Adija1</dc:creator>
      <dc:date>2017-10-01T21:39:01Z</dc:date>
    </item>
    <item>
      <title>Re: Ambari-Agent high cpu &amp; Datanode without heartbeat</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Ambari-Agent-high-cpu-Datanode-without-heartbeat/m-p/204050#M68784</link>
      <description>&lt;P&gt;More info:&lt;/P&gt;&lt;P&gt;On ambari-server.log i can see that it acknowledges the loss of heartbeat but it seems they do communicated:&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;01 Oct 2017 18:22:39,870  WARN [ambari-hearbeat-monitor] HeartbeatMonitor:159 - &lt;STRONG&gt;Heartbeat lost from host&lt;/STRONG&gt; hdp-dn01-drp.hadoop.local
01 Oct 2017 18:22:39,872  INFO [ambari-hearbeat-monitor] TopologyManager:671 - Hearbeat for host hdp-dn01-drp.hadoop.local lost thus removing it from available hosts.
01 Oct 2017 18:22:39,872  WARN [ambari-hearbeat-monitor] HeartbeatMonitor:174 - Setting component state to UNKNOWN for component METRICS_MONITOR on hdp-dn01-drp.hadoop.local
01 Oct 2017 18:22:39,872  WARN [ambari-hearbeat-monitor] HeartbeatMonitor:174 - Setting component state to UNKNOWN for component DRUID_MIDDLEMANAGER on hdp-dn01-drp.hadoop.local
01 Oct 2017 18:22:39,872  WARN [ambari-hearbeat-monitor] HeartbeatMonitor:174 - Setting component state to UNKNOWN for component DRUID_HISTORICAL on hdp-dn01-drp.hadoop.local
01 Oct 2017 18:22:39,872  WARN [ambari-hearbeat-monitor] HeartbeatMonitor:174 - Setting component state to UNKNOWN for component DATANODE on hdp-dn01-drp.hadoop.local
01 Oct 2017 18:22:39,873  WARN [ambari-hearbeat-monitor] HeartbeatMonitor:174 - Setting component state to UNKNOWN for component NODEMANAGER on hdp-dn01-drp.hadoop.local
01 Oct 2017 18:23:23,042  WARN [qtp-ambari-agent-1563184] HeartBeatHandler:235 - &lt;STRONG&gt;Host is in HEARTBEAT_LOST state&lt;/STRONG&gt; - &lt;STRONG&gt;sending register command&lt;/STRONG&gt;
01 Oct 2017 18:23:30,029  INFO [qtp-ambari-agent-1563184] HeartBeatHandler:425 - agentOsType = centos6
01 Oct 2017 18:23:30,045  INFO [qtp-ambari-agent-1563184] HostImpl:329 - &lt;STRONG&gt;Received host registration&lt;/STRONG&gt;, host=[hostname=hdp-dn01-drp,fqdn=hdp-dn01-drp.hadoop.local,domain=hadoop.local,architecture=x86_64,processorcount=12,physicalprocessorcount=12,osname=centos,osversion=6.8,osfamily=redhat,memory=65901056,uptime_hours=0,mounts=(available=38553776,mountpoint=/,used=10299852,percent=22%,size=51475068,device=/dev/mapper/vg_system-LogVol00,type=ext4)(available=390059,mountpoint=/boot,used=71993,percent=16%,size=487652,device=/dev/sda2,type=ext4)(available=204304,mountpoint=/boot/efi,used=276,percent=1%,size=204580,device=/dev/sda1,type=vfat)(available=1654503504,mountpoint=/grid/0,used=110677756,percent=7%,size=1859652852,device=/dev/sda5,type=ext4)(available=1713507724,mountpoint=/grid/1,used=111041964,percent=7%,size=1922198324,device=/dev/sdb1,type=ext4)(available=1716047400,mountpoint=/grid/2,used=108502288,percent=6%,size=1922198324,device=/dev/sdc1,type=ext4)(available=1716667196,mountpoint=/grid/3,used=107882492,percent=6%,size=1922198324,device=/dev/sdd1,type=ext4)(available=1709492804,mountpoint=/grid/4,used=115056884,percent=7%,size=1922198324,device=/dev/sde1,type=ext4)(available=1710666700,mountpoint=/grid/5,used=113882988,percent=7%,size=1922198324,device=/dev/sdf1,type=ext4)(available=1709508880,mountpoint=/grid/6,used=115040808,percent=7%,size=1922198324,device=/dev/sdg1,type=ext4)(available=1705253584,mountpoint=/grid/7,used=119296104,percent=7%,size=1922198324,device=/dev/sdh1,type=ext4)(available=1708647680,mountpoint=/grid/8,used=115902008,percent=7%,size=1922198324,device=/dev/sdi1,type=ext4)(available=1713886116,mountpoint=/grid/9,used=110663572,percent=7%,size=1922198324,device=/dev/sdj1,type=ext4)(available=1711301604,mountpoint=/grid/10,used=113248084,percent=7%,size=1922198324,device=/dev/sdk1,type=ext4)(available=1712490508,mountpoint=/grid/11,used=112059180,percent=7%,size=1922198324,device=/dev/sdl1,type=ext4)]
, registrationTime=1506871410029, agentVersion=2.5.1.0
01 Oct 2017 18:23:30,045  INFO [qtp-ambari-agent-1563184] TopologyManager:592 - TopologyManager.onHostRegistered: Entering
01 Oct 2017 18:23:30,045  INFO [qtp-ambari-agent-1563184] TopologyManager:594 - TopologyManager.onHostRegistered: &lt;STRONG&gt;host = hdp-dn01-drp.hadoop.local is already associated with the cluster or is currently being processed&lt;/STRONG&gt;
01 Oct 2017 18:23:30,052  INFO [qtp-ambari-agent-1563184] HeartBeatHandler:504 - Recovery configuration set to RecoveryConfig{, type=AUTO_START, maxCount=6, windowInMinutes=60, retryGap=5, maxLifetimeCount=1024, components=null, recoveryTimestamp=1506871410051}&lt;/P&gt;</description>
      <pubDate>Sun, 01 Oct 2017 22:27:07 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Ambari-Agent-high-cpu-Datanode-without-heartbeat/m-p/204050#M68784</guid>
      <dc:creator>Adija1</dc:creator>
      <dc:date>2017-10-01T22:27:07Z</dc:date>
    </item>
    <item>
      <title>Re: Ambari-Agent high cpu &amp; Datanode without heartbeat</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Ambari-Agent-high-cpu-Datanode-without-heartbeat/m-p/204051#M68785</link>
      <description>&lt;P&gt;Problem solved!&lt;BR /&gt;In case anyone else encounters the following:&lt;BR /&gt;1. Servers losing heartbeat for no reason&lt;BR /&gt;2. ambari agent always hogs 100% cpu&lt;BR /&gt;3. running "yarn application -list" produces results but slow than other servers.&lt;BR /&gt;4. In general the server is slow&lt;/P&gt;&lt;P&gt;The fix was to set the CPU in BIOS to use profile "maximum performance" &lt;BR /&gt;For some reason the server was set (in our case DELL, but it is in every BIOS out there) to default CPU profile which means low resources for low voltage use.&lt;/P&gt;</description>
      <pubDate>Tue, 03 Oct 2017 19:08:39 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Ambari-Agent-high-cpu-Datanode-without-heartbeat/m-p/204051#M68785</guid>
      <dc:creator>Adija1</dc:creator>
      <dc:date>2017-10-03T19:08:39Z</dc:date>
    </item>
  </channel>
</rss>

