<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Heartbeat lost[Ambari-agent] in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Heartbeat-lost-Ambari-agent/m-p/289565#M214346</link>
    <description>&lt;P&gt;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/70854"&gt;@TR7_BRYLE&lt;/a&gt;&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;The error is actually due to timeout (and not because of port access)&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;SSLError('The read operation timed out',)&lt;/LI-CODE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Above error indicates that communication further like reading a response is timing out. So we will have to first check why the "&lt;EM&gt;&lt;STRONG&gt;https&lt;/STRONG&gt;&lt;/EM&gt;" request is being timed out.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;We can try using the following kind of simple Python script to simulate what agent actually tries. Ambari agent is a python utility which tries to connect to ambari server a d tries to register itself and sends heartbeat messages to ambari server.&lt;/P&gt;&lt;P&gt;So we can test the following script from the agent host to see if it is able to connect or if that is also getting timed out. We are using 'httplib' to test the access and Https communication.&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;# cat /tmp/SSL/ssl_test.py

import httplib
import ssl
if __name__ == "__main__":
     ca_connection = httplib.HTTPSConnection('kerlatest1.example.com:8440', timeout=5, context=ssl._create_unverified_context())
     ca_connection.request("GET", '/connection_info')
     response = ca_connection.getresponse()
     print response.status
     data = response.read()
     print str(data)&lt;/LI-CODE&gt;&lt;P&gt;&lt;BR /&gt;Run it like following:&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;# export PYTHONPATH=/usr/lib/ambari-agent/lib:/usr/lib/ambari-agent/lib/ambari_agent:$PYTHONPATH
# python /tmp/SSL/ssl_test.py&lt;/LI-CODE&gt;&lt;P&gt;If above works fine and it returns 200 and returns result like following:&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;# python /tmp/SSL/ssl_test.py 
200
{"security.server.two_way_ssl":"false"}&lt;/LI-CODE&gt;&lt;P&gt;&lt;BR /&gt;If you notice any HTTPS communitation or certificat related error then you might want to refer to the following article and according to your Ambari version please check if you have following defined in your ambari-agent.ini file "[security]" section?&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;[security]
force_https_protocol=PROTOCOL_TLSv1_2&lt;/LI-CODE&gt;&lt;P&gt;&lt;BR /&gt;&lt;BR /&gt;- If you still face any issue then can you please share the "ambari-agent.log" freshly after restarting it ?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Reference Article:&lt;/STRONG&gt;&lt;BR /&gt;Java/Python Updates and Ambari Agent TLS Settings&lt;BR /&gt;&lt;A href="https://community.cloudera.com/t5/Community-Articles/Java-Python-Updates-and-Ambari-Agent-TLS-Settings/ta-p/248328" target="_blank"&gt;https://community.cloudera.com/t5/Community-Articles/Java-Python-Updates-and-Ambari-Agent-TLS-Settings/ta-p/248328&lt;/A&gt;&lt;/P&gt;&lt;P&gt;.&lt;/P&gt;&lt;P&gt;.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Wed, 12 Feb 2020 01:40:05 GMT</pubDate>
    <dc:creator>jsensharma</dc:creator>
    <dc:date>2020-02-12T01:40:05Z</dc:date>
    <item>
      <title>Heartbeat lost[Ambari-agent]</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Heartbeat-lost-Ambari-agent/m-p/289531#M214322</link>
      <description>&lt;P&gt;Hi,&lt;BR /&gt;Newbie here. Suddenly one of the nodes lost the heartbeat. Tried to restart ambari-agent and ambari-server. However, the error still persists. Here is the ambari-agent log.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;WARNING 2020-02-11 15:24:08,318 base_alert.py:138 - [Alert][ranger_admin_password_check] Unable to execute alert. argument of type 'NoneType' is not iterable&lt;BR /&gt;INFO 2020-02-11 15:24:14,721 security.py:141 - Encountered communication error. Details: SSLError('The read operation timed out',)&lt;BR /&gt;ERROR 2020-02-11 15:24:14,721 Controller.py:226 - Unable to connect to: &lt;A href="https://master1.com:8441/agent/v1/register/master2.com" target="_blank" rel="noopener"&gt;https://xxx1:8441/agent/v1/register/xxx2.com&lt;/A&gt;&lt;BR /&gt;Traceback (most recent call last):&lt;BR /&gt;File "/usr/lib/ambari-agent/lib/ambari_agent/Controller.py", line 175, in registerWithServer&lt;BR /&gt;ret = self.sendRequest(self.registerUrl, data)&lt;BR /&gt;File "/usr/lib/ambari-agent/lib/ambari_agent/Controller.py", line 549, in sendRequest&lt;BR /&gt;raise IOError('Request to {0} failed due to {1}'.format(url, str(exception)))&lt;BR /&gt;IOError: Request to &lt;A href="https://master1.com:8441/agent/v1/register/master2.com" target="_blank" rel="noopener"&gt;https://xxx1.com:8441/agent/v1/register/xxx2.com&lt;/A&gt; failed due to Error occured during connecting to the server: ('The read operation timed out',)&lt;BR /&gt;ERROR 2020-02-11 15:24:14,721 Controller.py:227 - Error:Request to &lt;A href="https://master1.com:8441/agent/v1/register/master2.com" target="_blank" rel="noopener"&gt;https://xxx1.com:8441/agent/v1/register/xxx2.com&lt;/A&gt; failed due to Error occurred during connecting to the server: ('The read operation timed out',)&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Note: Able to telnet manually port 8440 and 8441. All ports are listening also.&lt;/P&gt;
&lt;P&gt;Thanks in advance.&lt;/P&gt;</description>
      <pubDate>Tue, 11 Feb 2020 20:27:05 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Heartbeat-lost-Ambari-agent/m-p/289531#M214322</guid>
      <dc:creator>TR7_BRYLE</dc:creator>
      <dc:date>2020-02-11T20:27:05Z</dc:date>
    </item>
    <item>
      <title>Re: Heartbeat lost[Ambari-agent]</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Heartbeat-lost-Ambari-agent/m-p/289560#M214345</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/70854"&gt;@TR7_BRYLE&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;What is your Ambari version?&amp;nbsp;You may want to check this knowledge article:&lt;/P&gt;
&lt;P&gt;&lt;A href="https://my.cloudera.com/knowledge/ERROR-quot-Request-to-https-AMBARI-SERVER-8441-agent-v1?id=273271" target="_self"&gt;https://my.cloudera.com/knowledge/ERROR-quot-Request-to-https-AMBARI-SERVER-8441-agent-v1?id=273271&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;In case you can not access above, here are some details:&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Cause&lt;/STRONG&gt;:&lt;/P&gt;
&lt;P&gt;This issue occurs when ethernet card or the switch does not support Jumbo frame, but the Jumbo frame (MTUSIZE=9000) is set in the network configuration.&lt;/P&gt;
&lt;P&gt;To verify if the Jumbo frame is enabled, check the content of network interface configuration by running the following:&lt;/P&gt;
&lt;LI-CODE lang="markup"&gt;cat /etc/sysconfig/network-scripts/ifcfg-eth#&lt;/LI-CODE&gt;
&lt;P&gt;&lt;BR /&gt;The Jumbo frame is enabled, if the following content (in bold) is displayed: &lt;/P&gt;
&lt;LI-CODE lang="markup"&gt;TYPE=Ethernet 
DEVICE=eth0 
ONBOOT=yes 
BOOTPROTO=static 
IPADDR=xxx.xxx.xxx.xxx
NETMASK=xxx.xxx.xxx.xxx
MTUSIZE=9000&lt;/LI-CODE&gt;
&lt;P&gt;&lt;BR /&gt;&lt;STRONG&gt;Instructions:&lt;/STRONG&gt;&lt;BR /&gt;To resolve this issue, do the following for each node with the issue:&lt;/P&gt;
&lt;P&gt;1. From /etc/sysconfig/network-scripts/ifcfg-eth#1, remove the following:&lt;/P&gt;
&lt;LI-CODE lang="markup"&gt;MTUSIZE=9000&lt;/LI-CODE&gt;
&lt;P&gt;&lt;BR /&gt;2. Restart the network:&lt;/P&gt;
&lt;LI-CODE lang="markup"&gt;/etc/initd/network restart&lt;/LI-CODE&gt;
&lt;P&gt;&lt;BR /&gt;3. Restart the ambari-agent: &lt;/P&gt;
&lt;LI-CODE lang="markup"&gt;ambari-agent restart&lt;/LI-CODE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks and hope this helps!&lt;/P&gt;</description>
      <pubDate>Tue, 11 Feb 2020 23:14:08 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Heartbeat-lost-Ambari-agent/m-p/289560#M214345</guid>
      <dc:creator>lwang</dc:creator>
      <dc:date>2020-02-11T23:14:08Z</dc:date>
    </item>
    <item>
      <title>Re: Heartbeat lost[Ambari-agent]</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Heartbeat-lost-Ambari-agent/m-p/289565#M214346</link>
      <description>&lt;P&gt;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/70854"&gt;@TR7_BRYLE&lt;/a&gt;&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;The error is actually due to timeout (and not because of port access)&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;SSLError('The read operation timed out',)&lt;/LI-CODE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Above error indicates that communication further like reading a response is timing out. So we will have to first check why the "&lt;EM&gt;&lt;STRONG&gt;https&lt;/STRONG&gt;&lt;/EM&gt;" request is being timed out.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;We can try using the following kind of simple Python script to simulate what agent actually tries. Ambari agent is a python utility which tries to connect to ambari server a d tries to register itself and sends heartbeat messages to ambari server.&lt;/P&gt;&lt;P&gt;So we can test the following script from the agent host to see if it is able to connect or if that is also getting timed out. We are using 'httplib' to test the access and Https communication.&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;# cat /tmp/SSL/ssl_test.py

import httplib
import ssl
if __name__ == "__main__":
     ca_connection = httplib.HTTPSConnection('kerlatest1.example.com:8440', timeout=5, context=ssl._create_unverified_context())
     ca_connection.request("GET", '/connection_info')
     response = ca_connection.getresponse()
     print response.status
     data = response.read()
     print str(data)&lt;/LI-CODE&gt;&lt;P&gt;&lt;BR /&gt;Run it like following:&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;# export PYTHONPATH=/usr/lib/ambari-agent/lib:/usr/lib/ambari-agent/lib/ambari_agent:$PYTHONPATH
# python /tmp/SSL/ssl_test.py&lt;/LI-CODE&gt;&lt;P&gt;If above works fine and it returns 200 and returns result like following:&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;# python /tmp/SSL/ssl_test.py 
200
{"security.server.two_way_ssl":"false"}&lt;/LI-CODE&gt;&lt;P&gt;&lt;BR /&gt;If you notice any HTTPS communitation or certificat related error then you might want to refer to the following article and according to your Ambari version please check if you have following defined in your ambari-agent.ini file "[security]" section?&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;[security]
force_https_protocol=PROTOCOL_TLSv1_2&lt;/LI-CODE&gt;&lt;P&gt;&lt;BR /&gt;&lt;BR /&gt;- If you still face any issue then can you please share the "ambari-agent.log" freshly after restarting it ?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Reference Article:&lt;/STRONG&gt;&lt;BR /&gt;Java/Python Updates and Ambari Agent TLS Settings&lt;BR /&gt;&lt;A href="https://community.cloudera.com/t5/Community-Articles/Java-Python-Updates-and-Ambari-Agent-TLS-Settings/ta-p/248328" target="_blank"&gt;https://community.cloudera.com/t5/Community-Articles/Java-Python-Updates-and-Ambari-Agent-TLS-Settings/ta-p/248328&lt;/A&gt;&lt;/P&gt;&lt;P&gt;.&lt;/P&gt;&lt;P&gt;.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 12 Feb 2020 01:40:05 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Heartbeat-lost-Ambari-agent/m-p/289565#M214346</guid>
      <dc:creator>jsensharma</dc:creator>
      <dc:date>2020-02-12T01:40:05Z</dc:date>
    </item>
    <item>
      <title>Re: Heartbeat lost[Ambari-agent]</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Heartbeat-lost-Ambari-agent/m-p/289569#M214350</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/5016"&gt;@lwang&lt;/a&gt;&amp;nbsp;.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thank you for this suggestion. I will perform this troubleshooting.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks&lt;/P&gt;</description>
      <pubDate>Wed, 12 Feb 2020 03:04:18 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Heartbeat-lost-Ambari-agent/m-p/289569#M214350</guid>
      <dc:creator>TR7_BRYLE</dc:creator>
      <dc:date>2020-02-12T03:04:18Z</dc:date>
    </item>
    <item>
      <title>Re: Heartbeat lost[Ambari-agent]</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Heartbeat-lost-Ambari-agent/m-p/289572#M214351</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/50614"&gt;@jsensharma&lt;/a&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thank you for this suggestion. I will have it try and test the python script.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks&lt;/P&gt;</description>
      <pubDate>Wed, 12 Feb 2020 03:08:06 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Heartbeat-lost-Ambari-agent/m-p/289572#M214351</guid>
      <dc:creator>TR7_BRYLE</dc:creator>
      <dc:date>2020-02-12T03:08:06Z</dc:date>
    </item>
    <item>
      <title>Re: Heartbeat lost[Ambari-agent]</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Heartbeat-lost-Ambari-agent/m-p/289573#M214352</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/50614"&gt;@jsensharma&lt;/a&gt;&amp;nbsp;.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Another thing, I have already declared this on my ambari.ini file.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;[security]
force_https_protocol=PROTOCOL_TLSv1_2&lt;/LI-CODE&gt;&lt;P&gt;&amp;nbsp; Thanks.&lt;/P&gt;</description>
      <pubDate>Wed, 12 Feb 2020 03:16:39 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Heartbeat-lost-Ambari-agent/m-p/289573#M214352</guid>
      <dc:creator>TR7_BRYLE</dc:creator>
      <dc:date>2020-02-12T03:16:39Z</dc:date>
    </item>
    <item>
      <title>Re: Heartbeat lost[Ambari-agent]</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Heartbeat-lost-Ambari-agent/m-p/289603#M214364</link>
      <description>&lt;P&gt;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/70854"&gt;@TR7_BRYLE&lt;/a&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;As requested earlier&amp;nbsp;&lt;BR /&gt;&lt;SPAN&gt;- If you still face any issue then can you please share the "ambari-agent.log" freshly after restarting it ?&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 12 Feb 2020 07:39:26 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Heartbeat-lost-Ambari-agent/m-p/289603#M214364</guid>
      <dc:creator>jsensharma</dc:creator>
      <dc:date>2020-02-12T07:39:26Z</dc:date>
    </item>
    <item>
      <title>Re: Heartbeat lost[Ambari-agent]</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Heartbeat-lost-Ambari-agent/m-p/289671#M214403</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/5016"&gt;@lwang&lt;/a&gt;&amp;nbsp;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/50614"&gt;@jsensharma&lt;/a&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thank you for the useful information that you've provided.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;After doing some testing. I found&amp;nbsp; out that there an issue with one of the network interfaces on the servers. By testing the jumbo frame connectivity. We remove the defective module and heartbeat lost has been resolved. Thank you for your assistance guys!.&lt;/P&gt;</description>
      <pubDate>Thu, 13 Feb 2020 00:28:13 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Heartbeat-lost-Ambari-agent/m-p/289671#M214403</guid>
      <dc:creator>TR7_BRYLE</dc:creator>
      <dc:date>2020-02-13T00:28:13Z</dc:date>
    </item>
  </channel>
</rss>

