<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: NiFi node disconnection | missable heartbeat window in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/NiFi-node-disconnection-missable-heartbeat-window/m-p/330750#M230741</link>
    <description>&lt;P&gt;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/93672"&gt;@mt13&lt;/a&gt;&amp;nbsp; Looking at &lt;A href="https://github.com/apache/nifi/blob/rel/nifi-1.12.0/nifi-commons/nifi-properties/src/main/java/org/apache/nifi/util/NiFiProperties.java#L202" target="_self"&gt;this&lt;/A&gt;&lt;/P&gt;&lt;P&gt;It seems like the property&amp;nbsp;&lt;EM&gt;nifi.cluster.protocol.heartbeat.missable.max&amp;nbsp;&lt;/EM&gt;made configurable on NiFi 1.12, before this it is using the default value of 8. Which is the reason you are not able to get the expected timeframe results.&lt;/P&gt;&lt;P&gt;But instead of looking to tune the timeouts to higher values, you should look to tune the dataflow designs. The lack of heartbeat is mostly occurring due to high resource utilization either memory or CPU or network. The choice of processors and incoming data has to be analyzed for a more stable cluster.&lt;/P&gt;&lt;P&gt;In NiFi-1.12 a new feature was added to review the node status history from NIFi UI &amp;gt; Global Menu &amp;gt; Node Status History. It shows the resources utilization in graphical form for each node which would be a great indicator to analyze the load, incoming traffic, heap utilization, etc.&lt;/P&gt;</description>
    <pubDate>Mon, 22 Nov 2021 16:37:19 GMT</pubDate>
    <dc:creator>ashinde</dc:creator>
    <dc:date>2021-11-22T16:37:19Z</dc:date>
    <item>
      <title>NiFi node disconnection | missable heartbeat window</title>
      <link>https://community.cloudera.com/t5/Support-Questions/NiFi-node-disconnection-missable-heartbeat-window/m-p/330494#M230700</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;We are on&amp;nbsp;nifi-1.11.4, on a 3 node cluster, and we are observing frequent node disconnections.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN class="t a"&gt;&lt;SPAN class="t"&gt;..Disconnect&lt;/SPAN&gt;&lt;/SPAN&gt; &lt;SPAN class="t"&gt;Code=Lack&lt;/SPAN&gt; &lt;SPAN class="t"&gt;of&lt;/SPAN&gt; &lt;SPAN class="t"&gt;Heartbeat&lt;/SPAN&gt;&lt;SPAN&gt;, &lt;/SPAN&gt;&lt;SPAN class="t a"&gt;&lt;SPAN class="t"&gt;Disconnect&lt;/SPAN&gt;&lt;/SPAN&gt; &lt;SPAN class="t"&gt;Reason=Have&lt;/SPAN&gt; &lt;SPAN class="t"&gt;not&lt;/SPAN&gt; &lt;SPAN class="t"&gt;received&lt;/SPAN&gt; &lt;SPAN class="t"&gt;a&lt;/SPAN&gt; &lt;SPAN class="t"&gt;heartbeat&lt;/SPAN&gt; &lt;SPAN class="t"&gt;from&lt;/SPAN&gt; &lt;SPAN class="t"&gt;node&lt;/SPAN&gt; &lt;SPAN class="t"&gt;in&lt;/SPAN&gt; &lt;SPAN class="t"&gt;42&lt;/SPAN&gt; &lt;SPAN class="t"&gt;seconds&lt;/SPAN&gt;&lt;SPAN&gt;, &lt;/SPAN&gt;&lt;SPAN class="t"&gt;updateId=62]&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN class="t"&gt;While we investigate the issue for lack of heartbeat, we were trying to increase the disconnection threshold from the default 40 seconds [nifi.cluster.protocol.heartbeat.missable.max=8 X nifi.cluster.protocol.heartbeat.interval=5 sec] to 300 seconds. However, even when we applied the&amp;nbsp;nifi.cluster.protocol.heartbeat.missable.max=60 setting change on all nodes, we still observe the node disconnections occurring after 40 seconds. Any thoughts on why this setting is not getting picked up?&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;nifi.cluster.protocol.heartbeat.interval=5 sec&lt;BR /&gt;nifi.cluster.protocol.heartbeat.missable.max=60&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thank you&lt;/P&gt;&lt;P&gt;-MT13&lt;/P&gt;</description>
      <pubDate>Thu, 18 Nov 2021 17:14:17 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/NiFi-node-disconnection-missable-heartbeat-window/m-p/330494#M230700</guid>
      <dc:creator>mt13</dc:creator>
      <dc:date>2021-11-18T17:14:17Z</dc:date>
    </item>
    <item>
      <title>Re: NiFi node disconnection | missable heartbeat window</title>
      <link>https://community.cloudera.com/t5/Support-Questions/NiFi-node-disconnection-missable-heartbeat-window/m-p/330750#M230741</link>
      <description>&lt;P&gt;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/93672"&gt;@mt13&lt;/a&gt;&amp;nbsp; Looking at &lt;A href="https://github.com/apache/nifi/blob/rel/nifi-1.12.0/nifi-commons/nifi-properties/src/main/java/org/apache/nifi/util/NiFiProperties.java#L202" target="_self"&gt;this&lt;/A&gt;&lt;/P&gt;&lt;P&gt;It seems like the property&amp;nbsp;&lt;EM&gt;nifi.cluster.protocol.heartbeat.missable.max&amp;nbsp;&lt;/EM&gt;made configurable on NiFi 1.12, before this it is using the default value of 8. Which is the reason you are not able to get the expected timeframe results.&lt;/P&gt;&lt;P&gt;But instead of looking to tune the timeouts to higher values, you should look to tune the dataflow designs. The lack of heartbeat is mostly occurring due to high resource utilization either memory or CPU or network. The choice of processors and incoming data has to be analyzed for a more stable cluster.&lt;/P&gt;&lt;P&gt;In NiFi-1.12 a new feature was added to review the node status history from NIFi UI &amp;gt; Global Menu &amp;gt; Node Status History. It shows the resources utilization in graphical form for each node which would be a great indicator to analyze the load, incoming traffic, heap utilization, etc.&lt;/P&gt;</description>
      <pubDate>Mon, 22 Nov 2021 16:37:19 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/NiFi-node-disconnection-missable-heartbeat-window/m-p/330750#M230741</guid>
      <dc:creator>ashinde</dc:creator>
      <dc:date>2021-11-22T16:37:19Z</dc:date>
    </item>
    <item>
      <title>Re: NiFi node disconnection | missable heartbeat window</title>
      <link>https://community.cloudera.com/t5/Support-Questions/NiFi-node-disconnection-missable-heartbeat-window/m-p/330756#M230746</link>
      <description>&lt;P&gt;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/60285"&gt;@ashinde&lt;/a&gt;,&amp;nbsp;&lt;STRONG&gt;thank you&lt;/STRONG&gt; very much for the response. Makes sense why that setting is not working for us on 1.11.4. I will have to check when we can be on 1.12 version to use some of the additional features that'd help us with the troubleshooting.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Point taken about finding the actual RC for the heartbeat delay. CPU, and Memory (GC, heap usage) have not been an issue so far during our investigations. We are not really running any high volume processing here, and we are on a cluster with 3 nodes each with 24 cores, and 22 GB heap allocation. Network is one thing we need to check further, but so far basic analysis does not show any delay b/w the nodes or any packet loss.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;For now we have stopped the frequent node disconnections due to missing heartbeats by increasing &lt;SPAN&gt;nifi.cluster.protocol.heartbeat.interval=30 sec, while we investigate the issue further.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Best,&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;MT13&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 22 Nov 2021 17:26:44 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/NiFi-node-disconnection-missable-heartbeat-window/m-p/330756#M230746</guid>
      <dc:creator>mt13</dc:creator>
      <dc:date>2021-11-22T17:26:44Z</dc:date>
    </item>
  </channel>
</rss>

