Member since
07-30-2019
3421
Posts
1624
Kudos Received
1010
Solutions
My Accepted Solutions
| Title | Views | Posted |
|---|---|---|
| 58 | 01-13-2026 11:14 AM | |
| 198 | 01-09-2026 06:58 AM | |
| 518 | 12-17-2025 05:55 AM | |
| 579 | 12-15-2025 01:29 PM | |
| 563 | 12-15-2025 06:50 AM |
11-30-2016
03:34 PM
The RPG can be used to redistribute the ingested data of a single node using teh primary node strategy mentind here across every node in your NiFi cluster. This is a great way to distribute the work load while ensuring each node is working a unique set of FlowFiles.
... View more
11-30-2016
03:26 PM
2 Kudos
@Sean Murphy Each Node in a NiFi cluster runs its own threads within its own processor working on its own set of FlowFiles. Nodes in a NiFi cluster have no knowledge of what FlowFiles are being worked on by other nodes. If you are seeing multiple copies of the same output, that suggest that each node in your cluster is processing the same files. I am not sure how your dataflow is designed to ingest the data it works on, but ideally you want to design it in such a way to prevent each node from ingesting the same data/files. Thanks, Matt
... View more
11-28-2016
11:40 PM
1 Kudo
@Mothilal marimuthu The documentation for installing the latest HDF release can be found here: http://docs.hortonworks.com/HDPDocuments/HDF2/HDF-2.0.1/index.html Thanks, Matt
... View more
11-28-2016
07:41 PM
1 Kudo
@Mothilal marimuthu Those processor were not introduced until Apache NiFi 1.0 / HDF 2.0. You screen shot shows you running NiFi 0.3 / HDF 1.1.
... View more
11-28-2016
07:08 PM
The bootstrap port has nothing to do with the web UI port. Please take a look at the nifi-app.log for the cause of the shutdown.
... View more
11-28-2016
07:06 PM
@Sanaz Janbakhsh You should see why NiFi shutdown in either the nifi-bootstrap.log or the nifi-app.log.
... View more
11-21-2016
08:53 PM
Very possible it is related to that bug. With regular queues in excess of the swap threshold of 20,000 FlowFiles, swapping will occur. It is a bug in that swapping that can result in those swapped FlowFiles not getting removed from the content repo. This bug continues to occur until eventually you run out of disk space. On restart all that "orphaned" FlowFile content is then removed because their are no FlowFiles referencing that content anymore. Matt
... View more
11-21-2016
08:39 PM
@Philippe Marseille Apache NiFi 1.1 should be going up for vote very soon..
... View more
11-21-2016
07:36 PM
1 Kudo
@Philippe Marseille The content size displayed in a the UI will not map exactly to disk utilization since Nifi stores multiple FlowFiles in a single claim in the content repo. A claim cannot be deleted until Every FlowFile in contains has reached a point of termination in your dataflow. so it is possible with 450,000 queued FlowFiles you are holding on to a large number of claims still. Try clearing out some of this backlog and see if disk usage drops. Setting backpressure thresholds on connections is good way to prevent your queues from getting so large. Another possibility is that you are running in to https://issues.apache.org/jira/browse/NIFI-2925 . This bug has been addressed for the next Apache NiFi release of 1.1 and HDF 2.1 Thanks, Matt
... View more
11-18-2016
09:06 PM
The source NiFi will initially communicate with the target cluster over the same HTTP(s) port you would use to access the target NiFi cluster's UI. After that initial communication the target cluster will provide your source NiFi with the configured nifi.remote.input.host and nifi.remote.input.port for each node in teh target cluster along with teh current load on each node. If you left the nifi.remote.input.host blank, Java will try to determine the hostname. This may result in either an internal hostname your source can not resolve or even just localhost. I highly recommend setting this property to a public facing FQDN for each node in your cluster. Matt
... View more