Support Questions

brotmanz · ‎08-06-2019

I have JSON input of the following format:

{
  "Id": 1000000,
  "ReportName": TestReport,
  "Results": [{
    "Id": 1,
    "ResultId": "1000000-0",
    "Query": {
      "Id": 001,
      "Name": "TestQuery0",
    }
  }, {
    "Id": 2,
    "ResultId": "1000000-1",
    "Query": {
      "Id": 002,
      "Name": "TestQuery1",
    }
  }]
}

These file can become quite large depending on the number of Results in the Report and I was hoping to convert the single Flowfile to multiple records for processing. However, due to the format of the JSON a SplitRecord will result in one record per split. There is one report per FlowFile and therefore only 1 root level element.
I am looking for a method or strategy to split the Flowfile into smaller Records while still maintaining the cohesiveness of the report in the end when it put in HDFS.

Current Strategy:

Use JoltTransformJSON to inject report information into each result
Use SplitRecord to split the Flowfile on each result
Process Records
Use MergeRecord to get the modified Flowfile in Step1
Convert to original Flowfile (Not sure of the best method here)
Use MergeContent and push to HDFS

Any advice would be appreciated! Thank you

mburgess · ‎08-09-2019

If I am reading your use case correctly, I think you're looking for what the ForkRecord processor does; it allows you to fork a (usually single) record into multiple records based on a Record Path (similar to JSONPath but different syntax and expressiveness), possibly keeping the "root" elements common to each outgoing record.

View solution in original post

mburgess · ‎08-09-2019

If I am reading your use case correctly, I think you're looking for what the ForkRecord processor does; it allows you to fork a (usually single) record into multiple records based on a Record Path (similar to JSONPath but different syntax and expressiveness), possibly keeping the "root" elements common to each outgoing record.

brotmanz · ‎10-08-2019

Thank you for the response. This was the correct answer, but I was unable to verify until recently.

Cloudera Community

Support Questions

NiFi - Split a record using a non-root JSON attribute

nifi Json data using routeonattributeto to split a...

Switch root user to non-root in NIFI

How to evalute Attribute (Json format)

How to convert JSON to XML using ConvertRecord in...

Record based processors in Apache NiFi 1.2

Using NiFi GetTwitter, UpdateAttributes and Replac...

Split JSON after Convert Record (CSVtoJSON) creati...

Count number of records before and after flowfile ...

How to lower case all JSON attributes in a flow fi...

Split one Nifi flow file into Multiple flow file b...