<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Agent Heartbeat problems on datanodes in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Agent-Heartbeat-problems-on-datanodes/m-p/388927#M246823</link>
    <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;since a few weeks, we have regular warnings on our datanodes :&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;"This role's host has been out of contact with Cloudera Manager for a concerning amount of time."&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;cm-agent is consuming alot of CPU , in particular&amp;nbsp;MonitorDaemon-R (cm-agent PID is 25598):&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;top -H -b -n1 -p 25598&lt;BR /&gt;top - 14:08:21 up 22 days, 6:17, 1 user, load average: 5,10, 5,07, 4,98&lt;BR /&gt;Threads: 35 total, 2 running, 33 sleeping, 0 stopped, 0 zombie&lt;BR /&gt;%Cpu(s): 20,1 us, 21,5 sy, 0,0 ni, 56,5 id, 0,0 wa, 0,0 hi, 1,9 si, 0,0 st&lt;BR /&gt;KiB Mem : 26370784+total, 7788424 free, 38057140 used, 21786227+buff/cache&lt;BR /&gt;KiB Swap: 31457276 total, 31349500 free, 107776 used. 22445827+avail Mem&lt;/P&gt;&lt;P&gt;PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND&lt;BR /&gt;25833 root 20 0 3085980 407988 10016 R 87,5 0,2 124:55.80 MonitorDaemon-R&lt;BR /&gt;25598 root 20 0 3085980 407988 10016 S 0,0 0,2 28:07.22 cmagent&lt;BR /&gt;25748 root 20 0 3085980 407988 10016 S 0,0 0,2 2:49.93 cmagent&lt;BR /&gt;25754 root 20 0 3085980 407988 10016 S 0,0 0,2 0:14.20 Audit-Plugin&lt;BR /&gt;25755 root 20 0 3085980 407988 10016 S 0,0 0,2 0:13.98 Metadata-Plugin&lt;BR /&gt;25756 root 20 0 3085980 407988 10016 S 0,0 0,2 0:14.35 Profile-Plugin&lt;BR /&gt;25800 root 20 0 3085980 407988 10016 S 0,0 0,2 0:00.15 _TimeoutMonitor&lt;BR /&gt;25801 root 20 0 3085980 407988 10016 S 0,0 0,2 0:10.39 HTTPServer _sta&lt;BR /&gt;25802 root 20 0 3085980 407988 10016 S 0,0 0,2 0:00.00 CP Server Worke&lt;BR /&gt;25803 root 20 0 3085980 407988 10016 S 0,0 0,2 0:03.12 CP Server Worke&lt;BR /&gt;25804 root 20 0 3085980 407988 10016 S 0,0 0,2 0:04.83 CP Server Worke&lt;BR /&gt;25805 root 20 0 3085980 407988 10016 S 0,0 0,2 0:00.03 CP Server Worke&lt;BR /&gt;25806 root 20 0 3085980 407988 10016 S 0,0 0,2 0:00.01 CP Server Worke&lt;BR /&gt;25807 root 20 0 3085980 407988 10016 S 0,0 0,2 0:00.00 CP Server Worke&lt;BR /&gt;25808 root 20 0 3085980 407988 10016 S 0,0 0,2 0:00.00 CP Server Worke&lt;BR /&gt;25809 root 20 0 3085980 407988 10016 S 0,0 0,2 0:00.00 CP Server Worke&lt;BR /&gt;25810 root 20 0 3085980 407988 10016 S 0,0 0,2 0:00.00 CP Server Worke&lt;BR /&gt;25811 root 20 0 3085980 407988 10016 S 0,0 0,2 0:00.00 CP Server Worke&lt;BR /&gt;25831 root 20 0 3085980 407988 10016 S 0,0 0,2 0:09.61 Monitor-HostMon&lt;BR /&gt;25832 root 20 0 3085980 407988 10016 S 0,0 0,2 0:27.20 DnsResolutionMo&lt;BR /&gt;25834 root 20 0 3085980 407988 10016 S 0,0 0,2 2:49.89 MonitorDaemon-S&lt;BR /&gt;26279 root 20 0 3085980 407988 10016 S 0,0 0,2 3:02.54 WorkerThread&lt;BR /&gt;26629 root 20 0 3085980 407988 10016 S 0,0 0,2 0:00.36 __run_queue&lt;BR /&gt;26630 root 20 0 3085980 407988 10016 S 0,0 0,2 0:00.39 __run_queue&lt;BR /&gt;26631 root 20 0 3085980 407988 10016 S 0,0 0,2 0:00.46 __run_queue&lt;BR /&gt;26632 root 20 0 3085980 407988 10016 S 0,0 0,2 0:00.80 __run_queue&lt;BR /&gt;26637 root 20 0 3085980 407988 10016 S 0,0 0,2 4:24.79 GM KUDU_TSERVER&lt;BR /&gt;26639 root 20 0 3085980 407988 10016 S 0,0 0,2 0:02.74 Monitor-SolrSer&lt;BR /&gt;26641 root 20 0 3085980 407988 10016 S 0,0 0,2 0:24.39 GM KAFKA_BROKER&lt;BR /&gt;26647 root 20 0 3085980 407988 10016 S 0,0 0,2 0:02.14 GM REGIONSERVER&lt;BR /&gt;26649 root 20 0 3085980 407988 10016 S 0,0 0,2 0:08.91 GM NODEMANAGER&lt;BR /&gt;26651 root 20 0 3085980 407988 10016 S 0,0 0,2 0:00.31 GM OZONE_DATANO&lt;BR /&gt;26653 root 20 0 3085980 407988 10016 S 0,0 0,2 0:08.82 GM DATANODE&lt;BR /&gt;26656 root 20 0 3085980 407988 10016 S 0,0 0,2 0:06.44 GM IMPALAD&lt;BR /&gt;26657 root 20 0 3085980 407988 10016 R 0,0 0,2 21:01.25 ImpalaDaemonQue&lt;/P&gt;&lt;P&gt;What are the next steps identifying the root cause of this issue ?&lt;/P&gt;&lt;P&gt;(CDP 7.1.6)&lt;/P&gt;&lt;P&gt;Thanks in advance for your help.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Tue, 21 Apr 2026 06:29:54 GMT</pubDate>
    <dc:creator>OlivierT</dc:creator>
    <dc:date>2026-04-21T06:29:54Z</dc:date>
    <item>
      <title>Agent Heartbeat problems on datanodes</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Agent-Heartbeat-problems-on-datanodes/m-p/388927#M246823</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;since a few weeks, we have regular warnings on our datanodes :&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;"This role's host has been out of contact with Cloudera Manager for a concerning amount of time."&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;cm-agent is consuming alot of CPU , in particular&amp;nbsp;MonitorDaemon-R (cm-agent PID is 25598):&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;top -H -b -n1 -p 25598&lt;BR /&gt;top - 14:08:21 up 22 days, 6:17, 1 user, load average: 5,10, 5,07, 4,98&lt;BR /&gt;Threads: 35 total, 2 running, 33 sleeping, 0 stopped, 0 zombie&lt;BR /&gt;%Cpu(s): 20,1 us, 21,5 sy, 0,0 ni, 56,5 id, 0,0 wa, 0,0 hi, 1,9 si, 0,0 st&lt;BR /&gt;KiB Mem : 26370784+total, 7788424 free, 38057140 used, 21786227+buff/cache&lt;BR /&gt;KiB Swap: 31457276 total, 31349500 free, 107776 used. 22445827+avail Mem&lt;/P&gt;&lt;P&gt;PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND&lt;BR /&gt;25833 root 20 0 3085980 407988 10016 R 87,5 0,2 124:55.80 MonitorDaemon-R&lt;BR /&gt;25598 root 20 0 3085980 407988 10016 S 0,0 0,2 28:07.22 cmagent&lt;BR /&gt;25748 root 20 0 3085980 407988 10016 S 0,0 0,2 2:49.93 cmagent&lt;BR /&gt;25754 root 20 0 3085980 407988 10016 S 0,0 0,2 0:14.20 Audit-Plugin&lt;BR /&gt;25755 root 20 0 3085980 407988 10016 S 0,0 0,2 0:13.98 Metadata-Plugin&lt;BR /&gt;25756 root 20 0 3085980 407988 10016 S 0,0 0,2 0:14.35 Profile-Plugin&lt;BR /&gt;25800 root 20 0 3085980 407988 10016 S 0,0 0,2 0:00.15 _TimeoutMonitor&lt;BR /&gt;25801 root 20 0 3085980 407988 10016 S 0,0 0,2 0:10.39 HTTPServer _sta&lt;BR /&gt;25802 root 20 0 3085980 407988 10016 S 0,0 0,2 0:00.00 CP Server Worke&lt;BR /&gt;25803 root 20 0 3085980 407988 10016 S 0,0 0,2 0:03.12 CP Server Worke&lt;BR /&gt;25804 root 20 0 3085980 407988 10016 S 0,0 0,2 0:04.83 CP Server Worke&lt;BR /&gt;25805 root 20 0 3085980 407988 10016 S 0,0 0,2 0:00.03 CP Server Worke&lt;BR /&gt;25806 root 20 0 3085980 407988 10016 S 0,0 0,2 0:00.01 CP Server Worke&lt;BR /&gt;25807 root 20 0 3085980 407988 10016 S 0,0 0,2 0:00.00 CP Server Worke&lt;BR /&gt;25808 root 20 0 3085980 407988 10016 S 0,0 0,2 0:00.00 CP Server Worke&lt;BR /&gt;25809 root 20 0 3085980 407988 10016 S 0,0 0,2 0:00.00 CP Server Worke&lt;BR /&gt;25810 root 20 0 3085980 407988 10016 S 0,0 0,2 0:00.00 CP Server Worke&lt;BR /&gt;25811 root 20 0 3085980 407988 10016 S 0,0 0,2 0:00.00 CP Server Worke&lt;BR /&gt;25831 root 20 0 3085980 407988 10016 S 0,0 0,2 0:09.61 Monitor-HostMon&lt;BR /&gt;25832 root 20 0 3085980 407988 10016 S 0,0 0,2 0:27.20 DnsResolutionMo&lt;BR /&gt;25834 root 20 0 3085980 407988 10016 S 0,0 0,2 2:49.89 MonitorDaemon-S&lt;BR /&gt;26279 root 20 0 3085980 407988 10016 S 0,0 0,2 3:02.54 WorkerThread&lt;BR /&gt;26629 root 20 0 3085980 407988 10016 S 0,0 0,2 0:00.36 __run_queue&lt;BR /&gt;26630 root 20 0 3085980 407988 10016 S 0,0 0,2 0:00.39 __run_queue&lt;BR /&gt;26631 root 20 0 3085980 407988 10016 S 0,0 0,2 0:00.46 __run_queue&lt;BR /&gt;26632 root 20 0 3085980 407988 10016 S 0,0 0,2 0:00.80 __run_queue&lt;BR /&gt;26637 root 20 0 3085980 407988 10016 S 0,0 0,2 4:24.79 GM KUDU_TSERVER&lt;BR /&gt;26639 root 20 0 3085980 407988 10016 S 0,0 0,2 0:02.74 Monitor-SolrSer&lt;BR /&gt;26641 root 20 0 3085980 407988 10016 S 0,0 0,2 0:24.39 GM KAFKA_BROKER&lt;BR /&gt;26647 root 20 0 3085980 407988 10016 S 0,0 0,2 0:02.14 GM REGIONSERVER&lt;BR /&gt;26649 root 20 0 3085980 407988 10016 S 0,0 0,2 0:08.91 GM NODEMANAGER&lt;BR /&gt;26651 root 20 0 3085980 407988 10016 S 0,0 0,2 0:00.31 GM OZONE_DATANO&lt;BR /&gt;26653 root 20 0 3085980 407988 10016 S 0,0 0,2 0:08.82 GM DATANODE&lt;BR /&gt;26656 root 20 0 3085980 407988 10016 S 0,0 0,2 0:06.44 GM IMPALAD&lt;BR /&gt;26657 root 20 0 3085980 407988 10016 R 0,0 0,2 21:01.25 ImpalaDaemonQue&lt;/P&gt;&lt;P&gt;What are the next steps identifying the root cause of this issue ?&lt;/P&gt;&lt;P&gt;(CDP 7.1.6)&lt;/P&gt;&lt;P&gt;Thanks in advance for your help.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 21 Apr 2026 06:29:54 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Agent-Heartbeat-problems-on-datanodes/m-p/388927#M246823</guid>
      <dc:creator>OlivierT</dc:creator>
      <dc:date>2026-04-21T06:29:54Z</dc:date>
    </item>
    <item>
      <title>Re: Agent Heartbeat problems on datanodes</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Agent-Heartbeat-problems-on-datanodes/m-p/389315#M246950</link>
      <description>&lt;P&gt;Hello&amp;nbsp;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/29384"&gt;@OlivierT&lt;/a&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thank you for reaching out&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Can you please share the output of the below to see what is creating this process?&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;# ps -ef | grep -i 25833&lt;/LI-CODE&gt;</description>
      <pubDate>Tue, 18 Jun 2024 04:13:38 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Agent-Heartbeat-problems-on-datanodes/m-p/389315#M246950</guid>
      <dc:creator>upadhyayk04</dc:creator>
      <dc:date>2024-06-18T04:13:38Z</dc:date>
    </item>
    <item>
      <title>Re: Agent Heartbeat problems on datanodes</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Agent-Heartbeat-problems-on-datanodes/m-p/389632#M247024</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;sorry for the late answer, I was off for a few days.&lt;/P&gt;&lt;P&gt;ps -ef | grep -i 25833 returnnothing&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 26 Jun 2024 14:24:59 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Agent-Heartbeat-problems-on-datanodes/m-p/389632#M247024</guid>
      <dc:creator>OlivierT</dc:creator>
      <dc:date>2024-06-26T14:24:59Z</dc:date>
    </item>
  </channel>
</rss>

