Support Questions

Find answers, ask questions, and share your expertise

In NiFi queue with Load Balance Strategy RoundRobin flowfiles stuck

avatar
Contributor

Hi All

If in connection(queue between two processors)  I switch on RoundRobin strategy flow files start to stuck there for minutes. 

Some investigation showed, that flow files stuck only on the primary node.

No any Warns or Errors in logs.

I am ready to provide any information

Please, any help.

Thanks

1 ACCEPTED SOLUTION

avatar
Master Mentor

@ilyal 

 

What version of NiFi are you running?  Is it Apache NiFi 1.8.x?

 

There are numerous bugs with the new load balanced connections feature.  The good number of these known bugs have been addressed between NiFi 1.9.0 and NiFi 1.9.2.

https://issues.apache.org/jira/browse/NIFI-5745

https://issues.apache.org/jira/browse/NIFI-5919

https://issues.apache.org/jira/browse/NIFI-5663

https://issues.apache.org/jira/browse/NIFI-5771

https://issues.apache.org/jira/browse/NIFI-6017

 

There are still some additional bugs that are fixed in NiFi 1.10.0

https://issues.apache.org/jira/browse/NIFI-6353

https://issues.apache.org/jira/browse/NIFI-6760

https://issues.apache.org/jira/browse/NIFI-6517

https://issues.apache.org/jira/browse/NIFI-6736

https://issues.apache.org/jira/browse/NIFI-6285

https://issues.apache.org/jira/browse/NIFI-6759

 

I strongly recommend upgrading to Apache NiFi 1.10 as a first step upgrading (releasing soon).

 

Hope this helps,
Matt

View solution in original post

13 REPLIES 13

avatar
New Contributor

@MattWho Upgraded to Nifi 1.10 recently and flowfiles are struck with LoadBalance strategy PartitionBy Attribute

 

avatar
Master Mentor

@venu413 

 

Are these the same FlowFiles that were stuck prior to the upgrade? 
If you clean out your connections, do you see the issue occur later?

 

 

avatar
New Contributor

@MattWho no these are not same flowfiles before upgrade.. 
We cleaned all flow files before upgrade...

avatar
Master Mentor

@JatinSab 

Troubleshooting this here with limited information is going to prove difficult.
Does the downstream processor of the connection with the "stuck" FlowFiles have...
1. Configured for all nodes or primary node Execution?  If primary node only, are the queued FlowFiles on the primary node?
2. Does processor show it has active threads?  (small number in upper right corner of processor) . If so, get some thread dumps from your NiFi and inspect if the thread is progressing or hung.
3. Is back pressure being applied on any of the outbound connections?
4. Is processor getting threads to do work?  What is CPU load on your system and configured "Max Timer Driven Thread Count?"  It may be set to low causing a thread starved situation for your dataflow.

 

Additionally, if you try to perform a "list queue" on the connection of issue, what result do you get?

If you have a support contract with Cloudera, you can open a support case and they can help you to a solution.

 

Thanks,

Matt