About mburgess

mburgess · ‎05-29-2019

In nifi-assembly/target you'll find the built system as you mention, including a "conf" folder that contains (among other things) a file called bootstrap.conf. In that file there's a commented out JVM property to enable attachment by a debugger (the preceding line says "Enable Remote Debugging". When you uncomment that argument and start NiFi, it will listen on port 8000 for a debugger to attach. You can then attach a debugger from your IDE (Eclipse, NetBeans, IntelliJ, etc.). You can change the port and/or set "suspend=y" if you want it to wait until a debugger is attached before continuing startup, the latter is helpful if you are debugging something early in the startup sequence. Otherwise you can wait for NiFi to finish starting up and then attach whenever you like.

mburgess · ‎05-24-2019

You were so close! By using the [] syntax it just adds to the outgoing array, but you wanted to associate them with the same index, namely the one matched by the * "above" the fields. Put #2 inside your braces (#2 is a reference to the array index you're iterating over, "two levels up" from where you are in the spec): [{ "operation": "shift", "spec": { "nummer": "Nummer", "table": { "*": { "zn": "Positionen.[#2].ZeileNr", "datum": "Positionen.[#2].Datum" } } } }]

mburgess · ‎05-10-2019

To address your comment below, I missed the part where you want to call the outgoing field "color". Change this line (8): "$": "colorsLove[].&2" To this: "$": "colorsLove[].color"

mburgess · ‎05-10-2019

This Chain spec will add the hardcoded value 20190905 into the array (after removing empty values): [ { "operation": "shift", "spec": { "color_*": { "": "TRASH", "*": { "$": "colorsLove[].&2" } }, "*": "&" } }, { "operation": "shift", "spec": { "colorsLove": { "*": { "#20190905": "colorsLove[#2].date", "*": "colorsLove[#2].&" } }, "*": "&" } }, { "operation": "remove", "spec": { "TRASH": "" } } ] You should be able to replace "#20190905" with a NiFi Expression Language statement, maybe something like: "#${now:toNumber():format('yyyyddMM')}" ... but I didn't try that part.

mburgess · ‎05-07-2019

What does the generated SQL coming from ConvertJSONToSQL look like? Are the fields correctly uppercased? Does your database lowercase the column names? Did you try setting the "Translate Field Names" property to "true" in ConvertJSONToSQL? Does the case of the table name in the SQL match the case of the table name in the DB? If you're using "dbo.xxxx" as the Table Name property in ConvertJSONToSQL, instead try using just "xxxx" as the Table Name, and setting either Catalog Name or Schema Name (depending on your DB) to "dbo" (or DBO if necessary).

mburgess · ‎05-07-2019

You should be able to use SplitXml -> PutHDFS, using the Split Depth property to specify where to do the tag splitting. Each tag at that depth will be output as a separate flow file which you can send to HDFS via the PutHDFS processor. You may need to use UpdateAttribute to set the filename attribute, which is used by PutHDFS as the target filename.

mburgess · ‎05-07-2019

If you are not obtaining keys from the database, not using fragmented transactions, and not rolling back on failure, then you should see the failed flow files in a batch being routed to the failure relationship. If you must configure the processor differently, then the flow files will be treated as a single transaction. In that case, in order to handle individual failures you'll want to not use batches, meaning set PutSQL's Batch Size property to 1.

mburgess · ‎05-02-2019

I don't think your desired output is valid JSON, as the root object only has an array in it, not a key/value pair. If you want a key in there (let's call it root) the following spec will work in JoltTransformJSON: [ { "operation": "shift", "spec": { "*": "root.[]" } } ] Otherwise if you just want to add braces around the array, you can use ReplaceText, replacing the entire thing with {$1}

mburgess · ‎04-29-2019

You can use the SiteToSiteProvenanceReportingTask for this. Filter the reporting task to only emit events at the "certain point" you mention above. Each event has a "timestampMillis" and "lineageStart" field, you should be able to route on the difference of the two using QueryRecord, with something like: SELECT * FROM FLOWFILE WHERE timestampMillis - lineageStart > 60000 Which should emit a flowfile containing all events for which the associated entity (in this case, the flow file in the system) has been in the flow for over a minute.

mburgess · ‎04-25-2019

You'll need to provide the CLOB as an attribute, meaning you've set attributes like sql.args.1.type to 2005 and sql.args.1.value to the CLOB value. Then your SQL statement would have a ? parameter, and the CLOB value will be inserted when the SQL statement is prepared. See NIFI-4352 for more information.

Online	Offline
Last Visited	‎01-18-2026 06:10 PM

Member Since	‎11-16-2015 02:21 PM
Last Visited	‎01-18-2026 06:10 PM
Posts	911
Kudos received	662

Cloudera Community

Re: Compare data within the JSON using NIFI

Re: how to join three csv files like sql on condit...

Re: How to see the Data Provenance and Lineage in ...

Re: Apache NiFi - RouteText has no matches

Re: Nifi Building error when creating a brand new ...

Re: How does one run NIFI in the debugger?

Re: NiFi - Need help with JOLT-Syntax JoltTransfor...

Re: Nifi - Jolt Transform JSON Processor - Create ...

Re: Nifi - Jolt Transform JSON Processor - Create ...

Re: ConvertJSONToSQL Issue: None of the fields in ...

Re: convert xml with multiple row tags to files us...

Re: Apache NiFi Reconciliation flow

Re: NiFi Jolttransform to package json array into ...

Re: Monitoring processing time for flowfiles in Ni...

Re: load flowfile content into clob datatype using...