<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Getting Heartbeat lost with Exception in Ambari-2.4.2 in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Getting-Heartbeat-lost-with-Exception-in-Ambari-2-4-2/m-p/190206#M65288</link>
    <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/3418/jsensharma.html" nodeid="3418"&gt;@Jay SenSharma&lt;/A&gt; &lt;/P&gt;&lt;P&gt;I even tried this to set to 300, But no luck.&lt;/P&gt;&lt;P&gt;I will try to set ambari-agent debug mode and will check the stack.&lt;/P&gt;</description>
    <pubDate>Fri, 21 Jul 2017 18:55:58 GMT</pubDate>
    <dc:creator>nshelke</dc:creator>
    <dc:date>2017-07-21T18:55:58Z</dc:date>
    <item>
      <title>Getting Heartbeat lost with Exception in Ambari-2.4.2</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Getting-Heartbeat-lost-with-Exception-in-Ambari-2-4-2/m-p/190204#M65286</link>
      <description>&lt;P&gt;Getting Below Trace in Ambari-agent.log&lt;/P&gt;&lt;PRE&gt;traceback (most recent call last):
  File "/usr/lib/python2.6/site-packages/ambari_agent/Controller.py", line 165, in registerWithServer
    ret = self.sendRequest(self.registerUrl, data)
  File "/usr/lib/python2.6/site-packages/ambari_agent/Controller.py", line 496, in sendRequest
    raise IOError('Request to {0} failed due to {1}'.format(url, str(exception)))
IOError: Request to &lt;A href="https://lntpmn01.snapdot.com:8441/agent/v1/register/lntpdn03.snapdot.com" target="_blank"&gt;https://lntpmn01.snapdot.com:8441/agent/v1/register/lntpdn03.snapdot.com&lt;/A&gt; failed due to Error occured during connecting to the server: ('The read operation timed out',)
ERROR 2017-07-19 16:10:19,383 Controller.py:213 - Error:Request to &lt;A href="https://lntpmn01.snapdot.com:8441/agent/v1/register/lntpdn03.snapdot.com" target="_blank"&gt;https://lntpmn01.snapdot.com:8441/agent/v1/register/lntpdn03.snapdot.com&lt;/A&gt; failed due to Error occured during connecting to the server: ('The read operation timed out',)&lt;/PRE&gt;&lt;P&gt;I have tried increasing the timeout in security.py script to 180. Still no luck.&lt;/P&gt;&lt;P&gt;SSL enabled ambari. No firewall on all the nodes. I can ping each other.&lt;/P&gt;&lt;PRE&gt;# telnet &amp;lt;ambari-server&amp;gt; 8441
successful !!!
# openssl s_client -connect &amp;lt;ambari-server&amp;gt;:8441
successful !!!
&lt;/PRE&gt;&lt;P&gt;Please help me out.&lt;/P&gt;</description>
      <pubDate>Fri, 21 Jul 2017 08:12:53 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Getting-Heartbeat-lost-with-Exception-in-Ambari-2-4-2/m-p/190204#M65286</guid>
      <dc:creator>nshelke</dc:creator>
      <dc:date>2017-07-21T08:12:53Z</dc:date>
    </item>
    <item>
      <title>Re: Getting Heartbeat lost with Exception in Ambari-2.4.2</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Getting-Heartbeat-lost-with-Exception-in-Ambari-2-4-2/m-p/190205#M65287</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/17944/nshelke.html" nodeid="17944"&gt;@nshelke&lt;/A&gt;&lt;/P&gt;&lt;P&gt;There was an issue reported for ambari 2.4  (fixed in 2.4.0) but the stack looks similar.   &lt;A href="https://issues.apache.org/jira/browse/AMBARI-17991" target="_blank"&gt;https://issues.apache.org/jira/browse/AMBARI-17991&lt;/A&gt;&lt;/P&gt;&lt;P&gt;Can you check if the workaround mentioned in the mentioned JIRA works for you, to edit the "/usr/lib/python2.6/site-packages/ambari_agent/security.py" file and increase the timeout to a larger value like 360 seconds.&lt;/P&gt;&lt;PRE&gt;def create_connection(self):
    if self.sock:
      self.sock.close()
    logger.info("SSL Connect being called.. connecting to the server")
    sock = socket.create_connection((self.host, self.port), 360)&lt;/PRE&gt;&lt;P&gt;.&lt;/P&gt;&lt;P&gt;360 or more to see of it is still getting times out?&lt;/P&gt;&lt;P&gt;Is it happening with all the ambari agents?&lt;/P&gt;&lt;P&gt;Have you tried running the Ambari Agent in Debug mode to extract more details?&lt;/P&gt;&lt;P&gt;.&lt;/P&gt;</description>
      <pubDate>Fri, 21 Jul 2017 08:25:31 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Getting-Heartbeat-lost-with-Exception-in-Ambari-2-4-2/m-p/190205#M65287</guid>
      <dc:creator>jsensharma</dc:creator>
      <dc:date>2017-07-21T08:25:31Z</dc:date>
    </item>
    <item>
      <title>Re: Getting Heartbeat lost with Exception in Ambari-2.4.2</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Getting-Heartbeat-lost-with-Exception-in-Ambari-2-4-2/m-p/190206#M65288</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/3418/jsensharma.html" nodeid="3418"&gt;@Jay SenSharma&lt;/A&gt; &lt;/P&gt;&lt;P&gt;I even tried this to set to 300, But no luck.&lt;/P&gt;&lt;P&gt;I will try to set ambari-agent debug mode and will check the stack.&lt;/P&gt;</description>
      <pubDate>Fri, 21 Jul 2017 18:55:58 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Getting-Heartbeat-lost-with-Exception-in-Ambari-2-4-2/m-p/190206#M65288</guid>
      <dc:creator>nshelke</dc:creator>
      <dc:date>2017-07-21T18:55:58Z</dc:date>
    </item>
  </channel>
</rss>

