Created 10-17-2019 06:35 AM
Hi All
If in connection(queue between two processors) I switch on RoundRobin strategy flow files start to stuck there for minutes.
Some investigation showed, that flow files stuck only on the primary node.
No any Warns or Errors in logs.
I am ready to provide any information
Please, any help.
Thanks
Created 10-17-2019 06:54 AM
What version of NiFi are you running? Is it Apache NiFi 1.8.x?
There are numerous bugs with the new load balanced connections feature. The good number of these known bugs have been addressed between NiFi 1.9.0 and NiFi 1.9.2.
https://issues.apache.org/jira/browse/NIFI-5745
https://issues.apache.org/jira/browse/NIFI-5919
https://issues.apache.org/jira/browse/NIFI-5663
https://issues.apache.org/jira/browse/NIFI-5771
https://issues.apache.org/jira/browse/NIFI-6017
There are still some additional bugs that are fixed in NiFi 1.10.0
https://issues.apache.org/jira/browse/NIFI-6353
https://issues.apache.org/jira/browse/NIFI-6760
https://issues.apache.org/jira/browse/NIFI-6517
https://issues.apache.org/jira/browse/NIFI-6736
https://issues.apache.org/jira/browse/NIFI-6285
https://issues.apache.org/jira/browse/NIFI-6759
I strongly recommend upgrading to Apache NiFi 1.10 as a first step upgrading (releasing soon).
Hope this helps,
Matt
Created 01-06-2020 08:31 AM
@MattWho Upgraded to Nifi 1.10 recently and flowfiles are struck with LoadBalance strategy PartitionBy Attribute
Created 02-11-2020 08:27 AM
Are these the same FlowFiles that were stuck prior to the upgrade?
If you clean out your connections, do you see the issue occur later?
Created 02-11-2020 08:33 AM
@MattWho no these are not same flowfiles before upgrade..
We cleaned all flow files before upgrade...
Created 02-11-2020 10:34 AM
Troubleshooting this here with limited information is going to prove difficult.
Does the downstream processor of the connection with the "stuck" FlowFiles have...
1. Configured for all nodes or primary node Execution? If primary node only, are the queued FlowFiles on the primary node?
2. Does processor show it has active threads? (small number in upper right corner of processor) . If so, get some thread dumps from your NiFi and inspect if the thread is progressing or hung.
3. Is back pressure being applied on any of the outbound connections?
4. Is processor getting threads to do work? What is CPU load on your system and configured "Max Timer Driven Thread Count?" It may be set to low causing a thread starved situation for your dataflow.
Additionally, if you try to perform a "list queue" on the connection of issue, what result do you get?
If you have a support contract with Cloudera, you can open a support case and they can help you to a solution.
Thanks,
Matt