About mburgess

mburgess · ‎06-01-2017

FetchElasticsearch is used to get a single document from an ES cluster. Each document in ES has a document identifier (or "_id") associated with it, and that identifier is what should be supplied to the Document Identifier property. If you don't know the document identifier for the document(s) you're looking for, then QueryElasticsearchHttp is your best bet. It allows you to use the Query String "mini-language" to search for fields with desired values (see here for more information). You can then parse the results using any number of processors, such as EvaluateJsonPath to get individual fields from the results, SplitJson if there are multiple results, etc.

mburgess · ‎05-31-2017

You can add dynamic properties in the InvokeHttp processor's configuration dialog as described here.

mburgess · ‎05-24-2017

As of NiFi 1.2.0 you may be able to use ConvertRecord to do this, with a JsonTreeReader and a CSVRecordSetWriter (with a Record separator of comma and a value separator of a single space). Prior to 1.2.0 (or if the above approach doesn't work), you can use ExecuteScript. Here is a sample Groovy script that will read all the "term" values from the incoming JSON object and add an attribute called "terms" containing the comma-separated list: def flowFile = session.get() if(!flowFile) return def input = session.read(flowFile) def json = new groovy.json.JsonSlurper().parse(input) def terms = json.results.collect { it.term }.join(',') input.close() flowFile = session.putAttribute(flowFile, 'terms', terms) session.transfer(flowFile, REL_SUCCESS) If instead you need to replace the content of the flow file with the comma-separated list: def flowFile = session.get() if(!flowFile) return flowFile = session.write(flowFile, { inputStream, outputStream -> def json = new groovy.json.JsonSlurper().parse(inputStream) def terms = json.results.collect { it.term }.join(',') outputStream.write(terms.bytes) } as StreamCallback) flowFile = session.putAttribute(flowFile, 'mime.type', 'text/csv') session.transfer(flowFile, REL_SUCCESS)

mburgess · ‎05-19-2017

I believe you'll also need the following in your my.cnf: binlog_format=row

mburgess · ‎05-18-2017

Have you performed any INSERT, UPDATE, DELETE events since you enabled binary logging? You probably don't need to include Begin/Commit events unless you are doing auditing or your target DB needs them. In general, should you ever want to "reset" the CDC processor to get all the binlog records, set Retrieve All Records to true and clear the state of the processor (i.e. right-click on the stopped processor, choose View State, then Clear State).

mburgess · ‎05-16-2017

You can still use RouteOnAttribute to route the flow files with missing values somewhere separate from the files that have all the values populated. In UpdateAttribute, you can call the attribute whatever you want the message to be, and for the JOLT transform, you can have multiple entries in the spec, each matching a separate entry from the JSON.

mburgess · ‎05-16-2017

After ConvertJSONToSQL you should not have JSON in the flowfile content, rather it should be a SQL statement. From the screenshot, you are sending all relationships to PutSQL, when you should only send the "sql" relationship. The "original" and "failure" relationships contain the input JSON, the "sql" relationship contains the corresponding SQL statement(s).

mburgess · ‎05-15-2017

In response to your comments/requirements on my other answer, you could do the following: 1) EvaluateJsonPath (to read the JSON data), let's say you get attributes field_1, field_2, field_3, field_4. 2) UpdateAttribute, to set attributes "Missing data field_1", "Missing data field_2", "Missing data field_3", "Missing data field_4" to true or false, based on your aforementioned "field_N:isEmpty()" expression 3) AttributesToJSON to put the "Missing data" attributes into the content 4) JoltTransformJSON, to find the missing fields and add an error message, using the following ShiftR spec: { "operation": "shift", "spec": { "Missing data field_*": { "true": { "$1": "errors[].errorMessage" } } } } That gives the following output: { "errors" : [ { "errorMessage" : "Missing data field_2" }, { "errorMessage" : "Missing data field_4" } ] }

mburgess · ‎05-15-2017

What are your attributes named, and what are their values? AttributesToJSON will not create an array for you, but you can use JoltTransformJSON after AttributesToJSON to create an array from the attributes. Given this example input JSON: { "error_message_1": "missing_field_1", "error_message_2": "missing_field_2" } You can use the following Shift spec in the JoltTransformJSON processor: { "operation": "shift", "spec": { "error_message_*": "errors[].errorMessage" } } This gives the following output (which matches your goal above): { "errors" : [ { "errorMessage" : "missing_field_1" }, { "errorMessage" : "missing_field_2" } ] }

mburgess · ‎05-15-2017

I am having trouble importing the "etree" module, I have tried with brew-installed Python 2.7 and Anaconda 2.7 (where I believe the etree submodule is part of "xml" not "lxml"). Do I need any additional configuration? Looking in the lxml package, I see some native libraries (.so files, e.g.). If lxml is a native library, Jython (the "python" script engine in ExecuteScript) will not be able to load/execute it. All imported modules (and their dependencies) must be pure Python (no native code like CPython for example) for Jython to execute the script successfully. Perhaps there is a different library you can use? If you don't have a requirement on Jython/Python, consider using Javascript, Groovy, or Clojure instead. Their Module Directory allows you to use third-party Java libraries to accomplish this conversion, such as NekoHTML, JTidy, or JSoup.

Online	Offline
Last Visited	‎12-03-2025 12:10 PM

Member Since	‎11-16-2015 02:21 PM
Last Visited	‎12-03-2025 12:10 PM
Posts	911
Kudos received	662

Cloudera Community

Re: Compare data within the JSON using NIFI

Re: how to join three csv files like sql on condit...

Re: How to see the Data Provenance and Lineage in ...

Re: Apache NiFi - RouteText has no matches

Re: Nifi Building error when creating a brand new ...

Re: Problems configuring FetchElasticSearch proces...

Re: How to pull data from Rest API post url to NiF...

Re: How to transform a json array to a string list...

Re: CaptureChangeMySQL not working in NiFi 1.2

Re: CaptureChangeMySQL not working in NiFi 1.2

Re: NiFi: JSON Arrays

Re: jsontosql Error :None of the fields in the JSO...

Re: NiFi: JSON Arrays

Re: NiFi: JSON Arrays

Re: Converting HTML to XML File with Nifi