About mburgess

mburgess · ‎05-08-2018

Do you mean MergeContent rather than UpdateAttribute? The former merges incoming flow files' content into outgoing flow file(s), the latter just adds/deletes/changes metadata about the flow files. If you mean MergeContent, try setting the Demarcator field to the newline character (\n), that should separate the incoming messages by a new line.

mburgess · ‎05-05-2018

Unfortunately at the time of this answer, that field is not being populated by the framework and thus doesn't show up in the output. I have written NIFI-5155 to cover this improvement. Please feel free to comment on the Jira case as to whether you'd like to see the IP, hostname, or both, thanks in advance!

mburgess · ‎05-04-2018

I use the Advanced UI in the JoltTransformJSON processor or this webapp to test out specs, also there are a bunch of examples and doc in the javadoc but it can be a bit difficult to follow. You can also search the jolt tag on StackOverflow for a number of questions, answers, and examples.

mburgess · ‎05-03-2018

There's a bulletinNodeAddress field, it's probably an IP not a hostname (I didn't check), would that work?

mburgess · ‎04-30-2018

I wrote up a quick Chain spec you can use in a JoltTransformJSON processor, that way you can skip the Split/Merge pattern and work on the entire JSON object at once: [ { "operation": "shift", "spec": { "Objects": { "*": { "Item": { "Inventory": { "Elements": { "Element": { "*": { "Height": "[&1].Height", "Weight": "[&1].Weight", "Features": { "Feature": { "*": "[&3].&" } } } } } }, "Status": { "ElementsStatus": { "ElementStatus": { "*": { "@(3,Id)": "[&1].Id", "Status": "[&1].Status" } } } } } } } } } ] Note that this assumes the Element and ElementStatus arrays are parallel, meaning the first object in the Element array corresponds to the first object in the ElementStatus array (i.e. their FeatureId fields match). If that is not true, you'd either need a more complicated JOLT spec or perhaps a scripted solution using ExecuteScript.

mburgess · ‎04-26-2018

Your schema says that null values are allowed. If you don't want to allow nulls for particular fields, try a ValidateRecord processor using a schema that does not allow null values for the desired fields. I can't remember whether the "non-null" schema would be set on the Reader or Writer for ValidateRecord, but I believe it is the Reader. In that case, use the current schema (that allows nulls) for the Writer so the valid and invalid records can be output from the processor. Then you can send the "valid" relationship to the Elasticsearch processor, and handle the flowfiles/records on the "invalid" relationship however you choose.

mburgess · ‎04-26-2018

You only need one session per execution of the script. Using that session, you can get, create, remove, and transfer as many flow files as you want. If you get or create a flow file from the session, then you must transfer or remove it before the end of the script, or else you will get a "Transfer relationship not specified" error. Also you can only transfer each flow file once, if you attempt to transfer the same flow file more than once, you will get the error you describe above.

mburgess · ‎04-26-2018

I can't reproduce this, I used GenerateFlowFile with your input XML (adding two Transactions) -> SplitXML (level 1) and got the same "sub-xml" you did, then I used the same settings for EvaluateXPath and my content attribute has the correct value of 1. The only way I got it to show "Empty string set" is when I used /Transaction/@type as the XPath (note the wrong case for Type/type), is it possible there's a typo or case-sensitivity issue between your input XML and the XPath?

mburgess · ‎04-25-2018

I think it was an error in the blog software, seems to be fixed now?

mburgess · ‎04-24-2018

PutDatabaseRecord allows you to put multiple records from one flow file into a database at a time, without requiring the user to convert to SQL (you can use PutSQL for the latter, but it is less efficient). In your case you just need GetFile -> PutDatabaseRecord. Your CSVReader will have the schema for the data, which will indicate the types of the fields to PutDatabaseRecord. It will use that to insert the fields appropriately into the prepared statement and execute the whole flow file as a single batch.

Online	Offline
Last Visited	‎10-29-2025 10:31 AM

Member Since	‎11-16-2015 02:21 PM
Last Visited	‎10-29-2025 10:31 AM
Posts	905
Kudos received	659

Cloudera Community

Re: Compare data within the JSON using NIFI

Re: how to join three csv files like sql on condit...

Re: How to see the Data Provenance and Lineage in ...

Re: Apache NiFi - RouteText has no matches

Re: Nifi Building error when creating a brand new ...

Re: UpdateAttribute processor is saving the messag...

Re: How to get hostname of a bulletin rather than ...

Re: How to Combine Data from Two Flows Based on Co...

Re: How to get hostname of a bulletin rather than ...

Re: How to Combine Data from Two Flows Based on Co...

Re: How to search and remove null values in elasti...

Re: Execute script groovy error: is not known in t...

Re: NiFi: Extract atrribute value from XML using E...

Re: ExecuteScript Cookbook (part 2)

Re: NIFI load data from CSV to database