Created 10-17-2019 06:35 AM
Hi All
If in connection(queue between two processors) I switch on RoundRobin strategy flow files start to stuck there for minutes.
Some investigation showed, that flow files stuck only on the primary node.
No any Warns or Errors in logs.
I am ready to provide any information
Please, any help.
Thanks
Created 10-17-2019 06:54 AM
What version of NiFi are you running? Is it Apache NiFi 1.8.x?
There are numerous bugs with the new load balanced connections feature. The good number of these known bugs have been addressed between NiFi 1.9.0 and NiFi 1.9.2.
https://issues.apache.org/jira/browse/NIFI-5745
https://issues.apache.org/jira/browse/NIFI-5919
https://issues.apache.org/jira/browse/NIFI-5663
https://issues.apache.org/jira/browse/NIFI-5771
https://issues.apache.org/jira/browse/NIFI-6017
There are still some additional bugs that are fixed in NiFi 1.10.0
https://issues.apache.org/jira/browse/NIFI-6353
https://issues.apache.org/jira/browse/NIFI-6760
https://issues.apache.org/jira/browse/NIFI-6517
https://issues.apache.org/jira/browse/NIFI-6736
https://issues.apache.org/jira/browse/NIFI-6285
https://issues.apache.org/jira/browse/NIFI-6759
I strongly recommend upgrading to Apache NiFi 1.10 as a first step upgrading (releasing soon).
Hope this helps,
Matt
Created 10-17-2019 06:54 AM
What version of NiFi are you running? Is it Apache NiFi 1.8.x?
There are numerous bugs with the new load balanced connections feature. The good number of these known bugs have been addressed between NiFi 1.9.0 and NiFi 1.9.2.
https://issues.apache.org/jira/browse/NIFI-5745
https://issues.apache.org/jira/browse/NIFI-5919
https://issues.apache.org/jira/browse/NIFI-5663
https://issues.apache.org/jira/browse/NIFI-5771
https://issues.apache.org/jira/browse/NIFI-6017
There are still some additional bugs that are fixed in NiFi 1.10.0
https://issues.apache.org/jira/browse/NIFI-6353
https://issues.apache.org/jira/browse/NIFI-6760
https://issues.apache.org/jira/browse/NIFI-6517
https://issues.apache.org/jira/browse/NIFI-6736
https://issues.apache.org/jira/browse/NIFI-6285
https://issues.apache.org/jira/browse/NIFI-6759
I strongly recommend upgrading to Apache NiFi 1.10 as a first step upgrading (releasing soon).
Hope this helps,
Matt
Created 10-27-2019 12:27 AM
Thanks you Matt. I will try and will update Question
Created 01-06-2020 08:25 AM
Upgraded to 1.10 and now I see flow files are stuck with Load Balance strategy Partition By Attribute
Created 01-06-2020 10:53 PM
Hi, I have issues with other processors, but some processors do not have this issue for example UpdateAttributes. If you add this processor it will be work.
Created 01-07-2020 10:58 AM
Issue seems to be in Queues rather than processor. I tried with different processors and issue remains same For eg: below is the queue that points to updateAttribute Processor.
Created on 01-07-2020 01:49 PM - edited 01-07-2020 02:08 PM
If you open the NiFi summary UI (NiFi UI --> Global menu --> Summary), select the connections tab, locate this connection with the 54 queued flowfiles, and then click the cluster connection summary icon ()to far right, Are all 54 queued FlowFiles on same node?
Is anything being logged in the nifi-app.log on that node were these FlowFiles are queued?
Any observed errors in nifi-app.log during startup if you restart this node?
In your nifi.properties file, what values are configured for these properties:
nifi.cluster.load.balance.comms.timeout=
nifi.cluster.load.balance.connections.per.node=
nifi.cluster.load.balance.host=
nifi.cluster.load.balance.max.thread.count=
nifi.cluster.load.balance.port=
nifi.cluster.node.address=
If recommend that both the "nifi.cluster.node.address=" and "nifi.cluster.load.balance.host=" have been configured uniquely per node in your cluster to the resolvable hostname for the given node. So if you node has a hostname of node1.mycompany.com, then this hostname should be used in both these properties in the NiFi running on that host. Restart is needed anytime you edit the nifi.properties file.
Created 02-10-2020 08:51 AM
@MattWho Here are the values of properties asked
nifi.cluster.load.balance.host=nifi-dev-0.nifi-dev
nifi.cluster.load.balance.port=6342
nifi.cluster.load.balance.connections.per.node=10
nifi.cluster.load.balance.max.thread.count=8
nifi.cluster.load.balance.comms.timeout=300 sec
nifi.cluster.node.address=nifi-dev-0.nifi-dev
Still the issue is appearing.
Actually we have a Kubernetes Cluster using NiFi 1.11.1 image. However, it still appears to happen
Created 02-11-2020 08:20 AM
Apache NiFi 1.11.1 specifically has a fix https://jira.apache.org/jira/browse/NIFI-7059 which introduced a bug with load-balanced connections. This bug is addressed in 1.11.2 and is covered in jira https://jira.apache.org/jira/browse/NIFI-7117.
Thanks,
Matt
Created 02-11-2020 10:40 AM
Thanks @MattWho ..
Joe Asked me to build using PR# 4045, and test it...
M on it to verify if it fixed the issue...
Will let you know...
Thanks for your help..