About bbende

bbende · ‎10-26-2016

Currently you just copy the NAR to the lib directory of every server and restart NiFi, you could script this process. The NiFi community has talked about a future enhancement to have an extension registry where custom NARs could be uploaded, and then a given NiFi instance/cluster could load NARs from the registry. There is still a lot of design to be done, but this could potentially help with this scenario.

bbende · ‎10-25-2016

Do you still have the original source of data outside of NiFi? If so, you could completely wipe your NiFi back to a clean slate by stoping NiFi and deleting all of the "_repository" directories and restarting. Then you could configure appropriate back-pressure before trying to pick up these files again. If that is not an option you probably need to figure out what is causing things to hang, is it going out of memory and showing exceptions in the nifi-app.log?

bbende · ‎10-25-2016

There is no hard-coded limit, but there is definitely some limit in terms of performance. Flow File attributes are held in memory (in addition to being persisted to disk) so more attributes means more Java objects on the heap, means more garbage collection pressure. I don't think there is any way to set what the limit is because it depends on how much data is in the attributes, how much memory you have, how many flow files, etc.

bbende · ‎10-25-2016

I believe that "List Queue" would be a "View Data" policy on the source, and "Empty Queue" would be a "Modify Data" on the source component. Also keep in mind that if you are clustered, all of the nodes in the cluster also need to be part of this policy because all entities (users + machines) involved in the request need to be authorized for the data.

bbende · ‎10-25-2016

Well there would be a listener on each cluster node, but it is up to you to route the data to each of those listeners if you want to use them all. If you have a cluster of 3 NiFi nodes, and you setup syslog to push data to node 1 then you are only using the listener on node 1 and the other two listeners aren't doing anything. You would need to have the syslog agent distribute the data to all 3 listeners, or you would need to put a load balancer in front of NiFi and have the syslog agent send to the load balancer and the load balancer would distribute to the 3 nodes.

bbende · ‎10-25-2016

You definitely could do this if you wanted to, but it would probably become a bit unmanageable when using a significant number of processors. This post from yesterday talks about how to create a separate log file for LogAttribute, it would be the same for any other processors: https://community.hortonworks.com/questions/63071/in-apache-ni-fi-how-can-i-log-all-the-flowfile-att.html

bbende · ‎10-25-2016

There is currently no way to do this. The idea of chaining together a series of processors and having them operate as if they were one processor has been discussed before, and there are some concepts in the framework that could possibly help support this in the future, but currently it does not exist.

bbende · ‎10-25-2016

The Message Queue is in memory so anything in there would be lost if the node crashed. You could keep the Mx Size of Message Queue really small, possibly even set at 1, to avoid losing anything, but this may not work well for performance. You really need an application level protocol that can send acknowledgements back to the sender when data is successfully written to a flow file, if the sender never receives an ack then it can re-send. The is a ListenRELP processor that works does this, it is just like ListenTCP but the RELP protocol allows for acknowledgements.

bbende · ‎10-25-2016

Here is what you would need to do to configure PublishKafka to talk to a kerberized Kafka... 1) You can either rely on /etc/krb5.conf or you can tell NiFi to use a specific krb5.conf by setting nifi.kerberos.krb5.file= in nifi.properties to point to some other krb5.conf file. 2) Create a JAAS file lets say kafka-jaas.conf with the following: KafkaClient { com.sun.security.auth.module.Krb5LoginModule required useKeyTab=true storeKey=true keyTab="/path/to/nifi-iotdemot.keytab" serviceName="kafka" principal="nifi/iotdemo.field.hortonworks.com@LAKE"; }; Changing the keyTab path to the appropriate path. 3) Specify this in NiFi's bootstrap.conf: java.arg.15=-Djava.security.auth.login.config=/path/to/kafka-jaas.conf 4) Configure PublishKafka: Security Protocol = PLAINTEXTSASL Service Name = kafka The service name should match what is in the JAAS file above. You don't need to do any of the other stuff related to ZooKeeper, unless your NiFi instance is also using a Kerberized ZooKeeper for NiFi's state management. The above steps are what you need to do only for NiFi talking to Kafka.

bbende · ‎10-24-2016

That means the user you are logging in as does not have permission to access the UI. You can check nifi-user.log to see the user identity that is coming from your request (it should be the DN of your cert) and compare that to what is in users.xml and authorizations.xml. If this is your "initial admin" identity then this should have been entered in authorizers.xml as the initial admin, and that would have granted it all the correct permissions. If you had already tried to setup an initial admin before then you need to delete users.xml and authorizations.xml before trying to change the "initial admin", otherwise it won't take effect.

Online	Offline
Last Visited	‎09-10-2020 01:23 PM

Member Since	‎09-29-2015 04:02 PM
Last Visited	‎09-10-2020 01:23 PM
Posts	871
Kudos received	709

Cloudera Community

Re: Using nifi registry in a nifi cluster.

Re: Is there a way to enable a stateful status upd...

Re: Automated Start/Stop of a NiFi Processor

Re: PublishKafkaRecord_0_10 1.2.0.3.0.1.1-5 Error:...

Re: how to configure mergecontent processor

Re: What is best practice for auto deploying Apach...

Re: NiFi queues too large?

Re: Nifi Attribute Limit

Re: NIFI - policies for Connection

Re: Optimizing Performance of Apache NiFi's Networ...

Re: Nifi - How to capture a seperate log file for ...

Re: Nifi - Is it possible to send flowfile from on...

Re: Optimizing Performance of Apache NiFi's Networ...

Re: Error in NiFi Flow:

Re: issues while setting up Nifi Secure cluster ve...