About MattWho

MattWho · ‎03-07-2017

@Anishkumar Valsalam All Nodes will register with ZK to become the cluster coordinator when NiFi cluster is first started. Once all nodes have checked in to ZK or 5 minutes has passed a random node from those who connected will be picked as the cluster coordinator. ZK will also register one node as your primary node. Once a cluster coordinator has been elected, all nodes will start sending heartbeats directly to that node. The cluster coordinator assumes the role of disconnecting nodes from the cluster who do not send heartbeats and reconnecting nodes who heartbeat later after previously being disconnected. Nodes in a cluster also heartbeat with ZK. If either the primary node or cluster coordinator fails to heartbeat, another connected node(s) at random is elected to assume those roles. There is no ability for users to manually assign either of these roles to a specific node in a cluster. Thanks, Matt

MattWho · ‎03-07-2017

@Anishkumar Valsalam Not sure exactly what you are looking for. - Verify you can access the NIFi UI from any one of your nodes URLs. - Verify the UI shows 3/3 connected nodes. - Verify you can add components to the canvas (processors and connections) - Verify you can add controller services and reporting tasks. - Go into "cluster" UI and manually disconnect the "cluster coordinator" and/or " primary node". Make sure that a new node(s) is assigned these roles. - From "Cluster" UI make sure you can reconnect a disconnected node. Since your NiFi cluster is running unsecured there isn't much more to validate from an administration standpoint. - You don't have node to node mutual authentication to worry about - You don't have user authentication or authorization to worry about. Dataflows are built in a cluster the same way they are built in a standalone NiFi installation. Just keep in mind that every node in your cluster runs its own copy of the dataflow and works on its own set of FlowFiles. Thanks, Matt

MattWho · ‎03-06-2017

@Ayaskant Das Based on the output from the nifi-user.log, the following user successfully authenticated to NiFi but was not authorized view the NiFi UI. Did you configure this user as your "Initial Admin Identity"? The "Initial Admin Identity" is configured in the authorizers.xml file and is used to initially create both the users.xml and authorizations.xml files. It is case sensitive and white spaces do count. If you NiFi had previously been started with an incorrect value for "Initial Admin Identity" you will need to correct the authorizers.xml with the correct value, delete both users.xml and authorizations.xml files, and restart NiFi so that these files are created again. Thanks, Matt

MattWho · ‎03-06-2017

@Ayaskant Das @Joe Petro NiFi must be secured to run over HTTPS using a server certificates (loaded into a keystore) and a truststore before you can enable some form of user authentication. You can create your own keystore and truststore using the below procedure: https://community.hortonworks.com/articles/17293/how-to-create-user-generated-keys-for-securing-nif.html Note: If you are going to be using LDAP or Kerberos for user authentication, you can skip the last part of the above procedure about generating a user SSL certificate to load in your browser. Thanks, Matt

MattWho · ‎03-06-2017

@Eric Lloyd Start troubleshooting your flow by stepping a FlowFile through one processor at a time. Try right clicking on the connections and select "list queue". Clicking the "details" icon to the far left of a FlowFile in the queue list to get details of that FlowFile in that connection. Does the filename look correct? Thanks, Matt

MattWho · ‎03-06-2017

@Eric Lloyd Did you try using a "Demarcator" between your merged content? The MergeContent processor allows you to designate a Demarcator. Perhaps use new line or something else unique that can then be used later to differentiate between bits of content in the same HDFS file. Thanks, Matt

MattWho · ‎03-06-2017

@Eric Lloyd If you set an attribute on all your FlowFiles with the a value of "<year/month/day>" for the FlowFile, you can use that attribute as your "Correlation Attribute Name" in the mergeContent processor to make sure that only FlowFile from the same day are added to a bin.

MattWho · ‎03-06-2017

@Eric Lloyd The MergeContent processor adds FlowFiles from the incoming queue to virtual bins. Once the configured criteria on a bin is met all the FlowFile in that Bin are merged. So if you want to continue to merge incoming FlowFiles until X amount of time has passed then setting the "Max bin age" property is what you want. Note: Be careful how many FlowFiles you merge. The FlowFile attributes for all incoming FlowFiles being merged in a single bin live in the NiFi JVM heap memory. Merging to many FlowFiles at once can result in OutOfMemory (OOM) errors. There is no formula for the exact number you can merge per bundle/bin. It depends on how many attributes exist on a FlowFile and how large the values are associated to those attributes. Thanks, Matt

MattWho · ‎03-06-2017

@Ayaskant Das NiFi by default will use a user's SSL certificate if it is included by your browser during the connection to NiFi's URL. NiFi can be configured to use LDAP or Kerberos as alternate Authentication methods. Once configured, these alternate methods will be used only if a user does not pass a SSL certificate. Information about setting up LDAP or kerberos can be found in NiFo's Admin guide: https://nifi.apache.org/docs/nifi-docs/html/administration-guide.html#user-authentication Thanks, Matt

MattWho · ‎03-06-2017

@Gaurav Jain Please provide full use case and examples. It is difficult to provide assistance without the details. The more the better.

Member Since	‎07-30-2019 10:41 AM
Last Visited
Posts	3,132
Kudos received	1560

Cloudera Community

Re: Flowfile stuck in Wait in EnforceOrder process...

Re: Untrusted proxy error Authentication Failed o....

Re: REST API Configuration for NiFi 2.0

Re: Fileflow penalized for certain time before all...

Re: Nifi : Implement Sleep Mechanism in nifi witho...

Re: Nifi- Zookeeper - Coordinator Primary Connecte...

Re: Nifi Cluster Test Cases.

Re: Unable to check access status in Apache Nifi(N...

Re: How to display a NiFi login window for http we...

Re: Relative path in absolute URI error

Re: Output to HDFS separate events one per line

Re: Merge Fileflow files based on time rather than...

Re: Merge Fileflow files based on time rather than...

Re: How can we pop up log in window asking user id...

Re: How to read multiple Excel/CSV file and write ...