Member since
11-22-2025
17
Posts
1
Kudos Received
1
Solution
My Accepted Solutions
| Title | Views | Posted |
|---|---|---|
| 1649 | 12-11-2025 03:06 PM |
06-01-2026
03:21 PM
Hi Thank you very much for this info, we will apply and let you know how it goes. Cheers
... View more
05-28-2026
03:31 PM
Hi, Thank you very much for this info, we will try it and let you know how it goes. One of the other engineers was thinking of trying to start up nifi with a switch using a string that had "FFFFFFFFFF" in it ( I only saw it briefly ) I think with the idea of forcing nifi to start with all CPUs initially. I think he was looking at also changing CPU affinity on the workstation via windows task manager but don't know where he got to with it. Hypothetically, if you had say a setting of nifi.flowcontroller.max.timer.driven.thread.count=36 and you had 10 processors on your canvas, and each processor was set to run 4 concurrent tasks. In my mind this means there is either complete starvation of 1 processor downstream with zero threads and cause a bottleneck. or would Nifi reduce thread allocation to all processors to ensure it had max output like say allocating 3 threads per processor and leave 6 in reserve? In this case would it be better to set nifi.flowcontroller.max.timer.driven.thread.count=48 to ensure there is no thread starvation and have a safety window of 48 - (10x4) = 8 ? If you have a safety window of 8 threads, would these be allocated to say garbage collection and other activities or is that controlled by the threads in use by JVM ? Does Nifi tell the JVM it needs to run with minimum 48 threads since nifi.flowcontroller.max.timer.driven.thread.count=48 has been set ? How does that JVM/nifi interplay work please? I'm trying to understand the right way to get tuning right to avoid resource starvation. 🙂 Cheers
... View more
05-27-2026
09:58 PM
Hi We are having problems with a single instance of nifi 2.5.0 on a physical windows 11 machine with 64 CPU cores, 64 GB RAM and NVME 12 TB ( 4 x 3 TB drives ) striped D : drive set ingesting slowly. C: drive is for windows. I was hoping someone could maybe give a sequential list of things to check so we at least are approaching this the right way. The people who set up the machine have put the ingest directory ( ingest files via Getfiles ), and the nifi instance ( content repo and everything else ) on the same single D ; drive striped disk set. Being a striped set I wouldn't have thought speed would be an issue even with both ingest and content repo on the same striped set. So one thing I wondered was can we force nifi to use all cores, or do we need keep some cores ( maybe 4 ? ) for windows to use? And if we did use say 60 cores, what is the best way to do this - and is it set at a nifi processor level, or do you need to force Nifi to use all cores at Nifi boot up? The Nifi JVM has 8 GB of memory allocated and I haven't seen it go much past 50% utilization, which hopefully keeps garbage collection in check. We do ingest individual 200 GB size files ( but also smaller files ) , so would it be worth re-formatting the stripe set for much larger block sizes to stop additional seeks? I'm guessing they used standard 4 KB blocks, I was thinking of maybe using 1 MB blocks? Standard windows Antivirius is running on this machine and also our Dev machine, but in the Dev environment 8 core windows 11 machine, ingest is fast as we would expect. Would welcome thoughts, thanks in advance....
... View more
Labels:
- Labels:
-
Apache NiFi
05-14-2026
04:32 PM
Hi Matt Sorry its taken a few days to post back, we've been running tests and observing behaviour of the clusters. So we have nifi 2.5.0 on a 3 node cluster on ubuntu, and we run zookeeper as the cluster manager. We have a client Windows 11 machine with nifi 2.5.0 on it that sends data files to the cluster, the cluster processes them and sends acknowledgement files ( one each per data file ) back via Nifi to the windows machine. Its at this point where the acknowledgement files get stuck between the cluster and the client machine. We've done belt and braces baselining of zookeeper - looking at logs, making sure NTP had node times correct, memory correct, looked for errors in logs. That came up clean. In the nifi cluster its also 3 node, and what we did last night was delete the S2S output port and that seemed to fix it. But only temporarily - the files are getting stuck again this morning. This port connects to a down stream windows machine with nifi 2.5.0 on it. We have the port opened right up to 100,000 files and 1TB for back pressure. Penalty is 30 seconds. Traffic seems to flow reasonably evenly between all 3 nifi nodes. What we have observed and i don't know if its related, but in our non-clustered lab set up, we observed slow site to site between dockers ( 2 dockers on same ubuntu machine ), and so we had to install nifi onto linux directly which seemed to fix it. Then it slowed down again - so we deleted everything on the canvas and re-created it by importing the canvas backup JSON file we took before wiping everything .It then ran OK and seemed happy enough, with decent throughput at levels we would expect. We also observed files would "disappear" when we had 2 downstream sites connected to the same output port, we think there is a not-so-"round robin" happening, whereby we have files moving to one downstream site and others going to a second site but not in any particular sequence which makes them look like on one site they have "disappeared". We still have files being stuck again. I'm just keen to get to the bottom of it. 🙂 We
... View more
05-12-2026
03:07 PM
We have Nifi 2.5.0 and a problem whereby some files seem to be "stuck" after leaving a 3 node cluster, and get "stuck" in a site-to-site connection queue after the cluster. We ingest 60,000 small files out of a series of folders that are nested to a depth of 10 folders. Most files seem to move through the cluster OK, but we get a few that seem to enter the cluster fine , but then have a problem leaving it. Its different files each time and they sit there for 30-60 mins and eventually move on. Its not the same files each time. The network and compute resources are fine, the memory that Nifi is using is about 47% of the JVM maximum. Disk space doesn't seem to be an issue either. I had a look through the nifi-app.log file and nothing appears to show as an issue. Cluster heartbeat is average 10 milliseconds, GC runs in 50 milliseconds every 30 seconds. Would welcome some suggestions. Thank you in advance.
... View more
Labels:
- Labels:
-
Apache NiFi
02-03-2026
06:59 PM
We were looking at storing the provenance *.gz ( lucene ) files so we could have a record of all activity through Nifi, as our IT security people are fussy about such things. It seems Nifi writes to multiple files at the same time, so this may not be possible, or perhaps there is a better way of doing this? I would welcome thoughts as to how we might do this, or maybe propose a different way to achieve the same logging outcome. Thanks in advance.
... View more
Labels:
- Labels:
-
Apache NiFi
02-02-2026
07:56 PM
1 Kudo
Hi all, I recently spent a full 2 days over a rainy weekend trying to get a stand alone instance of Nifi v2.5.0 on ubuntu 22.04 trying to integrate with AD, to use AD for Nifi authentication, but with no joy. It almost worked but couldn't get past a certain point. The problem I discovered is there doesn't seem to exist on the internet a simple step by step process on how to set up Nifi with AD/LDAP. My set up is a linux PC called nifi1 and had an old windows 2012 R2 domain controller with LDAP running on it I decided to use. I created a domain called testdomain.local and created a domain user called testuser in the AD User OU that I wanted to use as a nifi user. I also created a service account called svc-nifi also in the User OU that is used to log onto AD as a service. I ran a series of tests using LDAP command line queries from the linux PC whereby the service account could run LDAP queries from linux that proved it could access AD, read the AD info it needed and could connect OK to AD etc. The testuser logon also works fine in logging onto the domain on other PCs. Is there a documented basic step by step process of how to do this on a linux ( or windows ) PC please? I just need the minimum basic working example of nifi.properties, authorizers.xml, uses.xml and authorize.xml etc files please. There are so many variables in each file that make it difficult to work through sequentially, as I dont have expert Nifi knowledge. I wondered if I had maybe skipped a step or something, like : * Do I need to add the linux PC to AD itself first , and if so, how is this set up in the config files? * What do i need to add to the nifi.properties, authorizers.xml file etc to get it to work please? * etc. Any help appreciated - this should really be simple, but its not...... I'm happy to write up a step by process and post it back here once I have it running, to help others. Thanks in advance. 🙂
... View more
Labels:
- Labels:
-
Apache NiFi
12-17-2025
07:13 PM
Thanks Matt, thats a very useful explanation.
... View more
12-17-2025
07:10 PM
Hi I wanted to look into a need to transfer 1 TB files by using chunking in Nifi. Each file also has to have its 20 items of meta data associated with it remain intact so the metadata and the data both survive the breaking up of the file ( chunking ) into 1000 chunks and re-assembling the file at the destination ( de-chunking ). Also, is the meta data for the large file duplicated onto each of the file 1000 chunks or is it a sub-set of the meta data? Someone mentioned nifi passes the file chunks data through JVM memory on its way to the content repository. Can I confirm whether file chunks pass through JVM memory as they are written to the file/content repository for a large file ( or any file for that matter ?) - I was fairly sure they aren't, otherwise the JVM size ( limited by machine RAM ) on the machine would limit reading in of large file data, and that would limit large file transfer speed - is that correct? I'm trying to confirm my understanding of how Nifi handles these large files please. Any help appreciated.
... View more
Labels:
- Labels:
-
Apache NiFi