Member since
06-26-2017
191
Posts
10
Kudos Received
1
Solution
My Accepted Solutions
Title | Views | Posted |
---|---|---|
1448 | 09-22-2017 07:13 PM |
10-25-2017
03:22 PM
Hi All, Thanks a lot to this awesome community. I was reading this blog by @Bryan Bende, and in the last paragraph, as per my interpretation in order to have better performance we should have 1 large flowfile as in output from listenTCP which can be achieved by using configuring "Max Batch size I set it to "100 000" . this will collect 100000 messages in a flowfile and input it to PublishKafka and in the Publish kafka the "Message Demaractor" is set to new line (shift+enter) this will send each message from the flow file thus increasing throughput Am I correct in my understanding? Thanks Dheeru
... View more
Labels:
- Labels:
-
Apache NiFi
10-25-2017
12:18 PM
@Wynner Thanks a lot and appreciate your help always
... View more
10-24-2017
10:14 PM
@Wynner By local time zone we mean the system time on my nifi nodes.
... View more
10-24-2017
09:21 PM
Hi All, Thanks a lot to this awesome community There is a use case where my processor should run daily at time 9:01 am cst. question 1. Does it follow any time zone, do I need to provide it? In addition after reading the docs here https://docs.hortonworks.com/HDPDocuments/HDF1/HDF-1.2/bk_UserGuide/content/scheduling-tab.html my value for Run schedule would look like for every day at 9:01 am cst * 01 09 * * * does the above value look correct?
... View more
Labels:
- Labels:
-
Apache NiFi
10-24-2017
08:58 PM
Hi All, Thanks to this awesome community. From my understanding the listSFTP and fetchSFTP is using Secure FTP protocol behind the scenes. Am I correct in my understanding? Thanks Dheeru
... View more
Labels:
- Labels:
-
Apache NiFi
10-23-2017
09:49 PM
@Shu Thanks for explanation. It is very helpful, appreciate it. Dhieru
... View more
10-23-2017
09:14 PM
Hi All, Thanks a lot to this community. I am working on building data flows, and i have 40 to 50 data flows to be configured. some of them are like 15000 events per second. If I have some of the data flows configured and then a very high event per second data source is being configured, and some how fills up the JVM, everything crashes because of the disk space, or sudden event burst occurs Does it mean we will have to proactively watch for any space issues such as JVM heap size, content repo and provenance repo. etc Thanks Dheiru
... View more
Labels:
- Labels:
-
Apache NiFi
10-23-2017
07:46 PM
Hi @Abdelkrim Hadjidj Thanks for the response. Appreciate it. So in order to plan for failover and HA, I need to mount a network drive which is visible/access to all the nodes in nifi cluster, but for the listfileprocessor schedule it to run on primary node. In case failure happens, new primary will be elected and it will start listfiles since it has access to the network location . and the basically use RPG to distribute files for for further processing and saving it on hdfs. Is my understanding correct. Thanks Dhieru
... View more
10-23-2017
06:36 PM
So accroding to this link, https://community.hortonworks.com/articles/16120/how-do-i-distribute-data-across-a-nifi-cluster.html I need to mount a shared netwrok drive to all nodes, so that each of the nodes will have access and then use listFiles processor on primary node followed up by remote process group. AFter that use a input port with remote process group followed up by fetch files. In this case, even if the primary changes all the nodes in cluster still have access to netwrok drive. Please some one can correct if I am wring here? Thanks Dheiru
... View more