About MattWho

MattWho · ‎03-06-2017

@Eric Lloyd If you set an attribute on all your FlowFiles with the a value of "<year/month/day>" for the FlowFile, you can use that attribute as your "Correlation Attribute Name" in the mergeContent processor to make sure that only FlowFile from the same day are added to a bin.

MattWho · ‎03-06-2017

@Eric Lloyd The MergeContent processor adds FlowFiles from the incoming queue to virtual bins. Once the configured criteria on a bin is met all the FlowFile in that Bin are merged. So if you want to continue to merge incoming FlowFiles until X amount of time has passed then setting the "Max bin age" property is what you want. Note: Be careful how many FlowFiles you merge. The FlowFile attributes for all incoming FlowFiles being merged in a single bin live in the NiFi JVM heap memory. Merging to many FlowFiles at once can result in OutOfMemory (OOM) errors. There is no formula for the exact number you can merge per bundle/bin. It depends on how many attributes exist on a FlowFile and how large the values are associated to those attributes. Thanks, Matt

MattWho · ‎03-06-2017

@Ayaskant Das NiFi by default will use a user's SSL certificate if it is included by your browser during the connection to NiFi's URL. NiFi can be configured to use LDAP or Kerberos as alternate Authentication methods. Once configured, these alternate methods will be used only if a user does not pass a SSL certificate. Information about setting up LDAP or kerberos can be found in NiFo's Admin guide: https://nifi.apache.org/docs/nifi-docs/html/administration-guide.html#user-authentication Thanks, Matt

MattWho · ‎03-06-2017

@Gaurav Jain Please provide full use case and examples. It is difficult to provide assistance without the details. The more the better.

MattWho · ‎03-06-2017

@Mark Heydenrych While I like the idea, there is currently no way to have a log message written to a FlowFiles attribute upon routing to a failure relationship. You may want to open an Apache NiFi Jira around this idea. Typically the "failure" relationship is routed back on the source processor so that multiple attempts can be made to deliver the file. In cases like network hicups, duplicate files, etc. this makes a lot of sense. When dealing with processor config failures, permissions issues, etc. the file will never be successful. You could set up a failure count loop. This loop would create an attribute on FlowFiles that are routed to "failure" and continue to loop them back on PutHDFS until the count has reached a configured number. Once that count is reached, the FlowFiles could be routed out of the loop. You could then send a notification via putEmail of the failed FlowFile for user investigation. Here is a link to a retry count loop flow NiFi template: https://cwiki.apache.org/confluence/download/attachments/57904847/Retry_Count_Loop.xml?version=1&modificationDate=1433271239000&api=v2 Thanks, Matt

MattWho · ‎03-06-2017

@adrian white You can use the MergeContent processor followed by a RouteOnAttribute processor to accomplish what you are looking to do. The MergeContent processor writes an attribute named "merge.count" to the FlowFile containing all your merged source flowfiles. - Set Min entries to "50" - Set Max Entries to "50" - Set Max Bin age to "1 min" The bin age timer is trigger once the very first FlowFile is added. At the end of 1 min or 50 FlowFiles (whichever occurs first) the Bin will be merged. Connect the "merged" relationship to a RouteOnAttribute processor that checks the "merge.count" on the merged FlowFiles to verify that they contain 50 entries. "small" will become a new relationship to the RouteOnAttribute processor. - Auto-terminate the "small" relationship so that any merged FlowFile with fewer then 50 entries is deleted. - Route the "unmatched" relationship on down the rest of your dataflow. Thanks, Matt

MattWho · ‎03-03-2017

Is there anything in the nifi-user.log when you try to access the https web address? I also noticed the URLs are for 10.x.x.x web addresses. Are these address reachable from the system where you have your web browser loaded? Can you post a screenshot of your browser when you try to access the https web address? Matt

MattWho · ‎03-03-2017

@Ayaskant Das Now we know that your NiFi is up and running. We also know that it has been configured to run securely over https. After being secured you will not be able to access it over http. Https access will only work if user authentication is successful which comes full circle to my initial response. Here are a few links to assist you there: https://nifi.apache.org/docs/nifi-docs/html/administration-guide.html https://nifi.apache.org/docs/nifi-docs/html/administration-guide.html#user-authentication https://community.hortonworks.com/articles/17293/how-to-create-user-generated-keys-for-securing-nif.html Thanks, Matt

MattWho · ‎03-03-2017

those are normal... it is still starting. it may appear to stall at certain points during startup... Di you perhaps issue a shutdown command before it ever finished actually starting? keep tailing the log until you see it either show the URL line or it shuts down again on its own. This may taken several minutes. perhaps sharing your nifi-app.log after is fails again will help.

MattWho · ‎03-03-2017

@Ayaskant Das So your nifi shut back down because of some error that occurred during startup. That error will be in the nifi-app.log somewhere above the lines you posted. It will likely have a full stack trace with the error that includes the cause. Thanks, Matt

Online	Offline
Last Visited	‎01-22-2026 04:06 PM

Member Since	‎07-30-2019 10:41 AM
Last Visited	‎01-22-2026 04:06 PM
Posts	3,426
Kudos received	1627

Cloudera Community

Re: Best Practice for configuring registry flows

Re: Nifi 2.7.2 Start Problem

Re: Error importing NiFi workflow template from ve...

Re: Error importing NiFi workflow template from ve...

Re: How to elevate a default nifi user to admin - ...

Re: Merge Fileflow files based on time rather than...

Re: Merge Fileflow files based on time rather than...

Re: How can we pop up log in window asking user id...

Re: How to read multiple Excel/CSV file and write ...

Re: Capture error message in NiFi PutHDFS

Re: Can NiFi forward a merged content based on the...

Re: Unable to check access status in Apache Nifi(N...

Re: Unable to check access status in Apache Nifi(N...

Re: Unable to check access status in Apache Nifi(N...

Re: Unable to check access status in Apache Nifi(N...