<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: using nifi as a kafka streaming- real-time stream processing in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/using-nifi-as-a-kafka-streaming-real-time-stream-processing/m-p/412702#M253635</link>
    <description>&lt;P&gt;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/107711"&gt;@Rohit1997jio&lt;/a&gt;&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;Yes it can be fast, but that depends on your dataflow(s). How many other dataflows is your NiFi also running?&amp;nbsp; All these dataflows share the same JVM resources and utilize server resources.&amp;nbsp; &amp;nbsp;Also concurrency is important so having a match between number of partitions in your source Topic A and the concurrency within your ConsumeKafkaRecord processor is important.&amp;nbsp; Concurrent tasks in the ConsumeKafkaRecord you be set using the following formula:&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;(num partitions in Topic A) / (num NiFi nodes in cluster) = (num concurrent tasks set on ConsumeKafkaRecord)&lt;/LI-CODE&gt;&lt;P&gt;&lt;BR /&gt;Example:&lt;BR /&gt;Kafka Topic partions = 12 partitions&lt;BR /&gt;NiFI cluster num nodes = 3&lt;BR /&gt;Concurrent tasks set on ConsumeKafkaRecord = 4&lt;BR /&gt;&lt;BR /&gt;4 X 3 = 12 consumers in the consumer group (1&amp;nbsp; consumer per partition in source topic)&lt;BR /&gt;&lt;BR /&gt;Also in NiFI keep in mind that each concurrent task utilizes a thread from NiFi's "Max Timer Driven Thread Count" thread pool.&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="MattWho_0-1760966034129.png" style="width: 483px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/46385i2793069F0A9405A3/image-dimensions/483x263?v=v2" width="483" height="263" role="button" title="MattWho_0-1760966034129.png" alt="MattWho_0-1760966034129.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;The default pool size is 10.&amp;nbsp; &amp;nbsp;This means that only 10 concurrent task can execute at the same time.&amp;nbsp; Generally speaking threads milliseconds, but for optimal performance you'll want to mange your server CPU load average resources and set the pool higher to maximize your throughput performance.&amp;nbsp;&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;NiFi can show you the Core Load Average (1 = 100% utilization of one physical core).&amp;nbsp; &amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="MattWho_1-1760966415499.png" style="width: 400px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/46386iAEF4D1C4F785D9ED/image-size/medium?v=v2&amp;amp;px=400" role="button" title="MattWho_1-1760966415499.png" alt="MattWho_1-1760966415499.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;So if your NiFi server has 4 cores and the load average is 4 or higher, the CPU is saturated.&amp;nbsp; You'll be able to use this Core Load Average data to determine how much your can increase the size of the "Max timer Driven Thread Count" pool setting.&amp;nbsp; &amp;nbsp;NiFi recommends setting your "Max Timer Driven Count" initially to 2 - 4 times the number of Physical cores on the NiFi node.&amp;nbsp; This assume all nodes in the NiFi cluster have same number of physical cores.&amp;nbsp; &amp;nbsp;If nodes have various numbers of physical cores, use the nodes with the fewest to set your initial pool size.&amp;nbsp; Then monitor core load average across all NiFi nodes and adjust accordingly.&amp;nbsp; There is no way to configure different thread pools per node in a NiFi cluster, NiFi expects all all servers in a NiFi cluster to have same hardware configuration.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Please help our community grow. If you found&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;STRONG&gt;any&lt;/STRONG&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;of the suggestions/solutions provided helped you with solving your issue or answering your question, please take a moment to login and click "&lt;SPAN&gt;&lt;EM&gt;&lt;STRONG&gt;&lt;FONT color="#FF0000"&gt;Accept as Solution&lt;/FONT&gt;&lt;/STRONG&gt;&lt;/EM&gt;" on&amp;nbsp;&lt;STRONG&gt;one or more&lt;/STRONG&gt;&amp;nbsp;of them that helped.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Thank you,&lt;BR /&gt;Matt&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Mon, 20 Oct 2025 13:29:54 GMT</pubDate>
    <dc:creator>MattWho</dc:creator>
    <dc:date>2025-10-20T13:29:54Z</dc:date>
    <item>
      <title>using nifi as a kafka streaming- real-time stream processing</title>
      <link>https://community.cloudera.com/t5/Support-Questions/using-nifi-as-a-kafka-streaming-real-time-stream-processing/m-p/412640#M253586</link>
      <description>&lt;P&gt;Hi ,&amp;nbsp;&lt;/P&gt;&lt;P&gt;This is my current NIFI flow where i am consuming data in NIFI using ConsumeKafka_1_0 and publishing using PublishKafka_1_0 . Now&amp;nbsp;&lt;/P&gt;&lt;P&gt;HiveMQ (MQTT) → Kafka Topic A → NiFi consumes → NiFi publishes → Kafka Topic B&lt;/P&gt;&lt;P&gt;My requirement is send data in real time , i want to do real time data streams like kafka streams .&lt;/P&gt;&lt;P&gt;How to achieve that using NIFI . Is kafka streams available in NIFI ?&lt;/P&gt;&lt;P&gt;iam not doing and data transformation or any other operation to data. Iam just simply consuming and publishing data , but now i want to do it in real time .&lt;/P&gt;</description>
      <pubDate>Fri, 10 Oct 2025 12:03:04 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/using-nifi-as-a-kafka-streaming-real-time-stream-processing/m-p/412640#M253586</guid>
      <dc:creator>Rohit1997jio</dc:creator>
      <dc:date>2025-10-10T12:03:04Z</dc:date>
    </item>
    <item>
      <title>Re: using nifi as a kafka streaming- real-time stream processing</title>
      <link>https://community.cloudera.com/t5/Support-Questions/using-nifi-as-a-kafka-streaming-real-time-stream-processing/m-p/412642#M253588</link>
      <description>&lt;P&gt;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/107711"&gt;@Rohit1997jio&lt;/a&gt;&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;For real-time streaming, Apache NiFi may not going to be the best option.&amp;nbsp; NiFi works with files at rest.&amp;nbsp; So you have an ingest processor that consumes from Kafka (consumed content is written to NiFi's content_repository and associated to a NiFi FlowFile that get moved to the outbound success relationship.&amp;nbsp; ConsumeKafkaRecord processor likely to give better throughput).&amp;nbsp; So even if you connect that success directly to a PublishKafka/PublishKafkaRecord processor, you'll still have a delay of some minimal amount of time as the Publish processor will get scheduled to execute and publish a FlowFiles message or record with multiple messages to the target cluster.&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Please help our community grow. If you found&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;STRONG&gt;any&lt;/STRONG&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;of the suggestions/solutions provided helped you with solving your issue or answering your question, please take a moment to login and click "&lt;SPAN&gt;&lt;EM&gt;&lt;STRONG&gt;&lt;FONT color="#FF0000"&gt;Accept as Solution&lt;/FONT&gt;&lt;/STRONG&gt;&lt;/EM&gt;" on&amp;nbsp;&lt;STRONG&gt;one or more&lt;/STRONG&gt;&amp;nbsp;of them that helped.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Thank you,&lt;BR /&gt;Matt&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 10 Oct 2025 15:03:45 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/using-nifi-as-a-kafka-streaming-real-time-stream-processing/m-p/412642#M253588</guid>
      <dc:creator>MattWho</dc:creator>
      <dc:date>2025-10-10T15:03:45Z</dc:date>
    </item>
    <item>
      <title>Re: using nifi as a kafka streaming- real-time stream processing</title>
      <link>https://community.cloudera.com/t5/Support-Questions/using-nifi-as-a-kafka-streaming-real-time-stream-processing/m-p/412656#M253595</link>
      <description>&lt;P&gt;Ok , so can i make a flow like&amp;nbsp;&lt;SPAN&gt;ConsumeKafkaRecord&amp;nbsp; --&amp;gt; topic A --&amp;gt;&amp;nbsp;PublishKafkaRecord&amp;nbsp; --&amp;gt; topic B , using both record processor for consuming and publishing data ? Will this be fast ?&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 13 Oct 2025 14:19:49 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/using-nifi-as-a-kafka-streaming-real-time-stream-processing/m-p/412656#M253595</guid>
      <dc:creator>Rohit1997jio</dc:creator>
      <dc:date>2025-10-13T14:19:49Z</dc:date>
    </item>
    <item>
      <title>Re: using nifi as a kafka streaming- real-time stream processing</title>
      <link>https://community.cloudera.com/t5/Support-Questions/using-nifi-as-a-kafka-streaming-real-time-stream-processing/m-p/412662#M253601</link>
      <description>&lt;P&gt;&lt;SPAN&gt;Ok , so can i make a flow like&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN&gt;ConsumeKafkaRecord&amp;nbsp; --&amp;gt; topic A --&amp;gt;&amp;nbsp;PublishKafkaRecord&amp;nbsp; --&amp;gt; topic B , using both record processor for consuming and publishing data ? Will this be fast ?&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 14 Oct 2025 07:00:23 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/using-nifi-as-a-kafka-streaming-real-time-stream-processing/m-p/412662#M253601</guid>
      <dc:creator>Rohit1997jio</dc:creator>
      <dc:date>2025-10-14T07:00:23Z</dc:date>
    </item>
    <item>
      <title>Re: using nifi as a kafka streaming- real-time stream processing</title>
      <link>https://community.cloudera.com/t5/Support-Questions/using-nifi-as-a-kafka-streaming-real-time-stream-processing/m-p/412702#M253635</link>
      <description>&lt;P&gt;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/107711"&gt;@Rohit1997jio&lt;/a&gt;&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;Yes it can be fast, but that depends on your dataflow(s). How many other dataflows is your NiFi also running?&amp;nbsp; All these dataflows share the same JVM resources and utilize server resources.&amp;nbsp; &amp;nbsp;Also concurrency is important so having a match between number of partitions in your source Topic A and the concurrency within your ConsumeKafkaRecord processor is important.&amp;nbsp; Concurrent tasks in the ConsumeKafkaRecord you be set using the following formula:&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;(num partitions in Topic A) / (num NiFi nodes in cluster) = (num concurrent tasks set on ConsumeKafkaRecord)&lt;/LI-CODE&gt;&lt;P&gt;&lt;BR /&gt;Example:&lt;BR /&gt;Kafka Topic partions = 12 partitions&lt;BR /&gt;NiFI cluster num nodes = 3&lt;BR /&gt;Concurrent tasks set on ConsumeKafkaRecord = 4&lt;BR /&gt;&lt;BR /&gt;4 X 3 = 12 consumers in the consumer group (1&amp;nbsp; consumer per partition in source topic)&lt;BR /&gt;&lt;BR /&gt;Also in NiFI keep in mind that each concurrent task utilizes a thread from NiFi's "Max Timer Driven Thread Count" thread pool.&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="MattWho_0-1760966034129.png" style="width: 483px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/46385i2793069F0A9405A3/image-dimensions/483x263?v=v2" width="483" height="263" role="button" title="MattWho_0-1760966034129.png" alt="MattWho_0-1760966034129.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;The default pool size is 10.&amp;nbsp; &amp;nbsp;This means that only 10 concurrent task can execute at the same time.&amp;nbsp; Generally speaking threads milliseconds, but for optimal performance you'll want to mange your server CPU load average resources and set the pool higher to maximize your throughput performance.&amp;nbsp;&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;NiFi can show you the Core Load Average (1 = 100% utilization of one physical core).&amp;nbsp; &amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="MattWho_1-1760966415499.png" style="width: 400px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/46386iAEF4D1C4F785D9ED/image-size/medium?v=v2&amp;amp;px=400" role="button" title="MattWho_1-1760966415499.png" alt="MattWho_1-1760966415499.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;So if your NiFi server has 4 cores and the load average is 4 or higher, the CPU is saturated.&amp;nbsp; You'll be able to use this Core Load Average data to determine how much your can increase the size of the "Max timer Driven Thread Count" pool setting.&amp;nbsp; &amp;nbsp;NiFi recommends setting your "Max Timer Driven Count" initially to 2 - 4 times the number of Physical cores on the NiFi node.&amp;nbsp; This assume all nodes in the NiFi cluster have same number of physical cores.&amp;nbsp; &amp;nbsp;If nodes have various numbers of physical cores, use the nodes with the fewest to set your initial pool size.&amp;nbsp; Then monitor core load average across all NiFi nodes and adjust accordingly.&amp;nbsp; There is no way to configure different thread pools per node in a NiFi cluster, NiFi expects all all servers in a NiFi cluster to have same hardware configuration.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Please help our community grow. If you found&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;STRONG&gt;any&lt;/STRONG&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;of the suggestions/solutions provided helped you with solving your issue or answering your question, please take a moment to login and click "&lt;SPAN&gt;&lt;EM&gt;&lt;STRONG&gt;&lt;FONT color="#FF0000"&gt;Accept as Solution&lt;/FONT&gt;&lt;/STRONG&gt;&lt;/EM&gt;" on&amp;nbsp;&lt;STRONG&gt;one or more&lt;/STRONG&gt;&amp;nbsp;of them that helped.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Thank you,&lt;BR /&gt;Matt&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 20 Oct 2025 13:29:54 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/using-nifi-as-a-kafka-streaming-real-time-stream-processing/m-p/412702#M253635</guid>
      <dc:creator>MattWho</dc:creator>
      <dc:date>2025-10-20T13:29:54Z</dc:date>
    </item>
  </channel>
</rss>

