Member since
07-30-2019
944
Posts
197
Kudos Received
91
Solutions
My Accepted Solutions
| Title | Views | Posted |
|---|---|---|
| 1506 | 10-05-2021 01:53 PM | |
| 16325 | 09-23-2019 06:03 AM | |
| 6817 | 05-04-2019 08:42 PM | |
| 1502 | 06-11-2018 12:45 PM | |
| 12625 | 06-04-2018 01:11 PM |
05-19-2017
12:51 PM
@Thierry Vernhet Try setting the Batch Size property to 1000, and see if that helps.
... View more
05-19-2017
12:36 PM
@Muhammad Umar What is the OS of the system? How is the partition mounted, xfs, ext4? Is this a VM or a bare metal server? What is returned when you run the command "mount"?
... View more
05-18-2017
10:31 PM
@Prakash Ravi Correct, but the ListSFTP processor does not require a Distributed Cache Service to maintain state. So, don't create one. It will use zookeeper by default.
... View more
05-18-2017
09:42 PM
@Prakash Ravi
Basically correct, which is why NiFi uses zookeeper for state information now. I wouldn't use the DistributedMapCache unless I absolutely had too. Which processors are you using?
... View more
05-18-2017
05:44 PM
@Muhammad Umar
So, the Clear state is working. The warning you are seeing has nothing to do with the state of the processor.
... View more
05-18-2017
05:19 PM
@Muhammad Umar Was there are error? What do you see after you click Clear state?
... View more
05-18-2017
04:52 PM
@Muhammad Umar Was the processor stopped when you tried?
... View more
05-18-2017
01:35 PM
2 Kudos
@Prakash Ravi
Question 1, No, you do not need a controller service for each processor, they can use the same DistributedCache service. Question 2, Yes, that is the correct understanding. If no directory is specified then the cache is in memory only. Question 3, The way the DistributedMapCacheServer works is, each node in the cluster will be running it's own server. But data is written via the DistributedMapCacheClient controller service, and in that configuration, you will specify which of the DistributedMapCacheServers to write the data. So, only one of servers will have the data. If that node is lost, then the server would not be available until the node was back in the cluster. When you say "primarily running the processor", I am assuming that you mean the processor is configured to run on the primary node only. If you lose the primary node, then another node would be elected a primary, and then the processor would start running again. There might be an issue if the lost node also happen to be the node that was configured to be used by the DistributedMapCacheClient controller service, then the state would be unavailable. FYI, there are only three processors currently that require a DistributedMapCache controller service. Any other processors that retain state use zookeeper. So, the state of the processor is maitained across the cluster automatically.
... View more
05-18-2017
12:16 PM
1 Kudo
@Muhammad Umar What you want to do is to clear the state of the processor. Right click on the ListFile processor, from the menu that pops up, select View state Then you will see a new window, in the middle right side of the window is a link Clear state, click that and the file will be picked up again the next time you want to pull it, and there you go. FYI, if you only want to pull the file once a day, you can configure the ListFile processor with a CRON driven Scheduling strategy and it will only run once per day.
... View more
05-17-2017
12:31 PM
@Muhammad Umar If you only want to pull that one file, then just configure ListFile this way then configure FetchFile this way It will get the one file and move it.
... View more