<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Ambari-agent heartbeat lost in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Ambari-agent-heartbeat-lost/m-p/237061#M198874</link>
    <description>&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;My ambari agent has lost its heartbeat. I tried restarting the agent/server starting stopping etc but nothing worked.&lt;/P&gt;&lt;P&gt;Im really desperate and dont know what to do. Any help would be greatly appreciated. I included the ambari-agent logs below.&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;ERROR 2019-08-02 10:51:04,424 script_alert.py:123 - [Alert][hive_webhcat_server_status] Failed with result CRITICAL: ['Connection failed to &lt;A href="http://&amp;lt;My" target="_blank"&gt;http://&amp;lt;My&lt;/A&gt; server id was here&amp;gt;/templeton/v1/status?user.name=ambari-qa + \nTraceback (most recent call last):\n File "/var/lib/ambari-agent/cache/common-services/HIVE/0.12.0.2.0/package/alerts/alert_webhcat_server.py", line 190, in execute\n url_response = urllib2.urlopen(query_url, timeout=connection_timeout)\n File "/usr/lib64/python2.7/urllib2.py", line 154, in urlopen\n return opener.open(url, data, timeout)\n File "/usr/lib64/python2.7/urllib2.py", line 431, in open\n response = self._open(req, data)\n File "/usr/lib64/python2.7/urllib2.py", line 449, in _open\n \'_open\', req)\n File "/usr/lib64/python2.7/urllib2.py", line 409, in _call_chain\n result = func(*args)\n File "/usr/lib64/python2.7/urllib2.py", line 1244, in http_open\n return self.do_open(httplib.HTTPConnection, req)\n File "/usr/lib64/python2.7/urllib2.py", line 1214, in do_open\n raise URLError(err)\nURLError: &amp;lt;urlopen error [Errno 111] Connection refused&amp;gt;\n']&lt;/P&gt;&lt;P&gt;INFO 2019-08-02 10:51:04,509 logger.py:75 - Execute['! beeline -u 'jdbc:hive2://&amp;lt;My server id was here&amp;gt;:10000/;transportMode=binary;auth=noSasl' -e '' 2&amp;gt;&amp;amp;1| awk '{print}'|grep -i -e 'Connection refused' -e 'Invalid URL''] {'path': ['/bin/', '/usr/bin/', '/usr/lib/hive/bin/', '/usr/sbin/'], 'user': 'ambari-qa', 'timeout': 60}&lt;/P&gt;&lt;P&gt;Result CRITICAL: ['Connection failed on host &amp;lt;My server id was here&amp;gt;:10000 (Traceback (most recent call last):\n File "/var/lib/ambari-agent/cache/common-services/HIVE/0.12.0.2.0/package/alerts/alert_hive_thrift_port.py", line 211, in execute\n ldap_password=ldap_password)\n File "/usr/lib/python2.6/site-packages/resource_management/libraries/functions/hive_check.py", line 79, in check_thrift_port_sasl\n timeout=check_command_timeout)\n File "/usr/lib/python2.6/site-packages/resource_management/core/base.py", line 155, in __init__\n self.env.run()\n File "/usr/lib/python2.6/site-packages/resource_management/core/environment.py", line 160, in run\n self.run_action(resource, action)\n File "/usr/lib/python2.6/site-packages/resource_management/core/environment.py", line 124, in run_action\n provider_action()\n File "/usr/lib/python2.6/site-packages/resource_management/core/providers/system.py", line 262, in action_run\n tries=self.resource.tries, try_sleep=self.resource.try_sleep)\n File "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line 72, in inner\n result = function(command, **kwargs)\n File "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line 102, in checked_call\n tries=tries, try_sleep=try_sleep, timeout_kill_strategy=timeout_kill_strategy)\n File "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line 150, in _call_wrapper\n result = _call(command, **kwargs_copy)\n File "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line 303, in _call\n raise ExecutionFailed(err_msg, code, out, err)\nExecutionFailed: Execution of \'! beeline -u \'jdbc:hive2://&amp;lt;My server id was here&amp;gt;:10000/;transportMode=binary;auth=noSasl\' -e \'\' 2&amp;gt;&amp;amp;1| awk \'{print}\'|grep -i -e \'Connection refused\' -e \'Invalid URL\'\' returned 1. Error: Could not open client transport with JDBC Uri: jdbc:hive2://&amp;lt;My server id was here&amp;gt;:10000/;transportMode=binary;auth=noSasl: java.net.ConnectException: Connection refused (Connection refused) (state=08S01,code=0)\nError: Could not open client transport with JDBC Uri: jdbc:hive2://&amp;lt;My server id was here&amp;gt;:10000/;transportMode=binary;auth=noSasl: java.net.ConnectException: Connection refused (Connection refused) (state=08S01,code=0)\n)']&lt;/P&gt;&lt;P&gt;INFO 2019-08-02 10:51:38,110 Controller.py:304 - Heartbeat (response id = 760) with server is running...&lt;/P&gt;&lt;P&gt;INFO 2019-08-02 10:51:38,111 Controller.py:311 - Building heartbeat message&lt;/P&gt;&lt;P&gt;INFO 2019-08-02 10:51:38,113 Heartbeat.py:90 - Adding host info/state to heartbeat message.&lt;/P&gt;&lt;P&gt;INFO 2019-08-02 10:51:38,197 logger.py:75 - Testing the JVM's JCE policy to see it if supports an unlimited key length.&lt;/P&gt;&lt;P&gt;INFO 2019-08-02 10:51:38,197 logger.py:75 - Testing the JVM's JCE policy to see it if supports an unlimited key length.&lt;/P&gt;&lt;P&gt;INFO 2019-08-02 10:51:38,455 Hardware.py:176 - Some mount points were ignored: /, /dev, /dev/shm, /run, /sys/fs/cgroup, /grid0, /run/user/1000&lt;/P&gt;&lt;P&gt;INFO 2019-08-02 10:51:38,459 Controller.py:320 - Sending Heartbeat (id = 760)&lt;/P&gt;&lt;P&gt;INFO 2019-08-02 10:51:38,464 Controller.py:332 - Heartbeat response received (id = 761)&lt;/P&gt;&lt;P&gt;INFO 2019-08-02 10:51:38,464 Controller.py:341 - Heartbeat interval is 10 seconds&lt;/P&gt;&lt;P&gt;INFO 2019-08-02 10:51:38,464 Controller.py:377 - Updating configurations from heartbeat&lt;/P&gt;&lt;P&gt;INFO 2019-08-02 10:51:38,465 Controller.py:386 - Adding cancel/execution commands&lt;/P&gt;&lt;P&gt;INFO 2019-08-02 10:51:38,465 Controller.py:403 - Adding recovery commands&lt;/P&gt;&lt;P&gt;INFO 2019-08-02 10:51:38,465 Controller.py:471 - Waiting 9.9 for next heartbeat&lt;/P&gt;&lt;P&gt;INFO 2019-08-02 10:51:48,365 Controller.py:478 - Wait for next heartbeat over&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;</description>
    <pubDate>Fri, 02 Aug 2019 21:25:46 GMT</pubDate>
    <dc:creator>matas_mockus</dc:creator>
    <dc:date>2019-08-02T21:25:46Z</dc:date>
    <item>
      <title>Ambari-agent heartbeat lost</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Ambari-agent-heartbeat-lost/m-p/237061#M198874</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;My ambari agent has lost its heartbeat. I tried restarting the agent/server starting stopping etc but nothing worked.&lt;/P&gt;&lt;P&gt;Im really desperate and dont know what to do. Any help would be greatly appreciated. I included the ambari-agent logs below.&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;ERROR 2019-08-02 10:51:04,424 script_alert.py:123 - [Alert][hive_webhcat_server_status] Failed with result CRITICAL: ['Connection failed to &lt;A href="http://&amp;lt;My" target="_blank"&gt;http://&amp;lt;My&lt;/A&gt; server id was here&amp;gt;/templeton/v1/status?user.name=ambari-qa + \nTraceback (most recent call last):\n File "/var/lib/ambari-agent/cache/common-services/HIVE/0.12.0.2.0/package/alerts/alert_webhcat_server.py", line 190, in execute\n url_response = urllib2.urlopen(query_url, timeout=connection_timeout)\n File "/usr/lib64/python2.7/urllib2.py", line 154, in urlopen\n return opener.open(url, data, timeout)\n File "/usr/lib64/python2.7/urllib2.py", line 431, in open\n response = self._open(req, data)\n File "/usr/lib64/python2.7/urllib2.py", line 449, in _open\n \'_open\', req)\n File "/usr/lib64/python2.7/urllib2.py", line 409, in _call_chain\n result = func(*args)\n File "/usr/lib64/python2.7/urllib2.py", line 1244, in http_open\n return self.do_open(httplib.HTTPConnection, req)\n File "/usr/lib64/python2.7/urllib2.py", line 1214, in do_open\n raise URLError(err)\nURLError: &amp;lt;urlopen error [Errno 111] Connection refused&amp;gt;\n']&lt;/P&gt;&lt;P&gt;INFO 2019-08-02 10:51:04,509 logger.py:75 - Execute['! beeline -u 'jdbc:hive2://&amp;lt;My server id was here&amp;gt;:10000/;transportMode=binary;auth=noSasl' -e '' 2&amp;gt;&amp;amp;1| awk '{print}'|grep -i -e 'Connection refused' -e 'Invalid URL''] {'path': ['/bin/', '/usr/bin/', '/usr/lib/hive/bin/', '/usr/sbin/'], 'user': 'ambari-qa', 'timeout': 60}&lt;/P&gt;&lt;P&gt;Result CRITICAL: ['Connection failed on host &amp;lt;My server id was here&amp;gt;:10000 (Traceback (most recent call last):\n File "/var/lib/ambari-agent/cache/common-services/HIVE/0.12.0.2.0/package/alerts/alert_hive_thrift_port.py", line 211, in execute\n ldap_password=ldap_password)\n File "/usr/lib/python2.6/site-packages/resource_management/libraries/functions/hive_check.py", line 79, in check_thrift_port_sasl\n timeout=check_command_timeout)\n File "/usr/lib/python2.6/site-packages/resource_management/core/base.py", line 155, in __init__\n self.env.run()\n File "/usr/lib/python2.6/site-packages/resource_management/core/environment.py", line 160, in run\n self.run_action(resource, action)\n File "/usr/lib/python2.6/site-packages/resource_management/core/environment.py", line 124, in run_action\n provider_action()\n File "/usr/lib/python2.6/site-packages/resource_management/core/providers/system.py", line 262, in action_run\n tries=self.resource.tries, try_sleep=self.resource.try_sleep)\n File "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line 72, in inner\n result = function(command, **kwargs)\n File "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line 102, in checked_call\n tries=tries, try_sleep=try_sleep, timeout_kill_strategy=timeout_kill_strategy)\n File "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line 150, in _call_wrapper\n result = _call(command, **kwargs_copy)\n File "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line 303, in _call\n raise ExecutionFailed(err_msg, code, out, err)\nExecutionFailed: Execution of \'! beeline -u \'jdbc:hive2://&amp;lt;My server id was here&amp;gt;:10000/;transportMode=binary;auth=noSasl\' -e \'\' 2&amp;gt;&amp;amp;1| awk \'{print}\'|grep -i -e \'Connection refused\' -e \'Invalid URL\'\' returned 1. Error: Could not open client transport with JDBC Uri: jdbc:hive2://&amp;lt;My server id was here&amp;gt;:10000/;transportMode=binary;auth=noSasl: java.net.ConnectException: Connection refused (Connection refused) (state=08S01,code=0)\nError: Could not open client transport with JDBC Uri: jdbc:hive2://&amp;lt;My server id was here&amp;gt;:10000/;transportMode=binary;auth=noSasl: java.net.ConnectException: Connection refused (Connection refused) (state=08S01,code=0)\n)']&lt;/P&gt;&lt;P&gt;INFO 2019-08-02 10:51:38,110 Controller.py:304 - Heartbeat (response id = 760) with server is running...&lt;/P&gt;&lt;P&gt;INFO 2019-08-02 10:51:38,111 Controller.py:311 - Building heartbeat message&lt;/P&gt;&lt;P&gt;INFO 2019-08-02 10:51:38,113 Heartbeat.py:90 - Adding host info/state to heartbeat message.&lt;/P&gt;&lt;P&gt;INFO 2019-08-02 10:51:38,197 logger.py:75 - Testing the JVM's JCE policy to see it if supports an unlimited key length.&lt;/P&gt;&lt;P&gt;INFO 2019-08-02 10:51:38,197 logger.py:75 - Testing the JVM's JCE policy to see it if supports an unlimited key length.&lt;/P&gt;&lt;P&gt;INFO 2019-08-02 10:51:38,455 Hardware.py:176 - Some mount points were ignored: /, /dev, /dev/shm, /run, /sys/fs/cgroup, /grid0, /run/user/1000&lt;/P&gt;&lt;P&gt;INFO 2019-08-02 10:51:38,459 Controller.py:320 - Sending Heartbeat (id = 760)&lt;/P&gt;&lt;P&gt;INFO 2019-08-02 10:51:38,464 Controller.py:332 - Heartbeat response received (id = 761)&lt;/P&gt;&lt;P&gt;INFO 2019-08-02 10:51:38,464 Controller.py:341 - Heartbeat interval is 10 seconds&lt;/P&gt;&lt;P&gt;INFO 2019-08-02 10:51:38,464 Controller.py:377 - Updating configurations from heartbeat&lt;/P&gt;&lt;P&gt;INFO 2019-08-02 10:51:38,465 Controller.py:386 - Adding cancel/execution commands&lt;/P&gt;&lt;P&gt;INFO 2019-08-02 10:51:38,465 Controller.py:403 - Adding recovery commands&lt;/P&gt;&lt;P&gt;INFO 2019-08-02 10:51:38,465 Controller.py:471 - Waiting 9.9 for next heartbeat&lt;/P&gt;&lt;P&gt;INFO 2019-08-02 10:51:48,365 Controller.py:478 - Wait for next heartbeat over&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 02 Aug 2019 21:25:46 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Ambari-agent-heartbeat-lost/m-p/237061#M198874</guid>
      <dc:creator>matas_mockus</dc:creator>
      <dc:date>2019-08-02T21:25:46Z</dc:date>
    </item>
    <item>
      <title>Re: Ambari-agent heartbeat lost</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Ambari-agent-heartbeat-lost/m-p/237062#M198875</link>
      <description>&lt;P&gt;&lt;A rel="noopener noreferrer noopener noreferrer" href="http://@Matas%20Mockus" target="_blank"&gt;&lt;EM&gt;@Matas Mockus&lt;/EM&gt;&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;This is a duplicate posting  I responded to the initial thread.&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;&lt;A rel="noopener noreferrer noopener noreferrer" href="http://community.hortonworks.com/answers/249938/view.html" target="_blank"&gt;http://community.hortonworks.com/answers/249938/view.html&lt;/A&gt;&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;Did you check the response please either merge or delete this post as it will be difficult to follow the 2 threads!&lt;/EM&gt;&lt;/P&gt;</description>
      <pubDate>Sat, 03 Aug 2019 05:18:38 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Ambari-agent-heartbeat-lost/m-p/237062#M198875</guid>
      <dc:creator>Shelton</dc:creator>
      <dc:date>2019-08-03T05:18:38Z</dc:date>
    </item>
  </channel>
</rss>

