Support Questions

Sadhana21 · ‎12-27-2023

Hey there,

I'm trying to create a nested JSON using SQL in NiFi, but the output I'm getting has the nested part as a string. Here's the query I'm using:

SELECT
order_id,
JSON_ARRAYAGG(
JSON_OBJECT(
'order_Item_Seq_Id', order_Item_Seq_Id,
'product_Id', product_Id
)
) as order_item
FROM order_item
GROUP BY order_id;

The output looks like this:

[{
"order_id": "10000",
"order_item": "[{\"product_Id\": \"10007\", \"order_Item_Seq_Id\": \"00101\"}]"
}]

But I want it to be a valid nested JSON like this:

[
{
"order_id": "10000",
"order_item": [
{
"product_Id": "10007",
"order_Item_Seq_Id": "00101"
}
]
}
]

I'm hoping for a solution using Apache NiFi or Jolt.
Any help would be appreciated.

Thank you!

SAMSAL · ‎12-28-2023

Hi @Sadhana21 ,

You can do this in two ways that I can think of :

1- This might take few processors but basically after you get the output:

[{
"order_id": "10000",
"order_item": "[{\"product_Id\": \"10007\", \"order_Item_Seq_Id\": \"00101\"}]"
}]

1.1 First EvaluateJsonPath to get the order_id and set the destination to attribute:

1.2 2ed EvaluateJsonPath to get the order_item and set the destination to flowfile and the Return Type as Json

This will produce the following output:

[{"product_Id": "10007", "order_Item_Seq_Id": "00101"}]

1.3 Finally you JoltTransformJson with the following spec:

[
  {
    "operation": "shift",
    "spec": {
      "*": {
        "#${order_id}": "[&1].order_id",
        "@": "[&1].order_item[]"
      }
    }
  }
]

This should give you the needed result

2- Using just UpdateRecord Processor:

I like this option because that is the only processor you need to use. The only caveat is that you need to define JsonRecordReader & JsonRecordSetWriter where in the later you have to define Avro schema for the expected output. In the UpdateRecord you can use the built-in function called "unescapeJson" to read json string and return it as json:

https://nifi.apache.org/docs/nifi-docs/html/record-path-guide.html#unescapejson

UpdateRecord:

JsonRecordSetWriter:

Avro Scheam in the Schema Text:

{
	"name": "nifi",
	"type": "record",
	"fields": [
		{
			"name": "order_id",
			"type": "string"
		},
		{
			"name": "order_item",
			"type": {
				"name": "order_item_array",
				"type": "array",
				"items": {
					"name": "order_item",
					"type": "record",
					"fields": [
						{
							"name": "product_Id",
							"type": "string"
						},
						{
							"name": "order_Item_Seq_Id",
							"type": "string"
						}
					]
				}
			}
		}
	]
}

If that helps please accept solution

Thanks

View solution in original post

DianaTorres · ‎12-28-2023

@Sadhana21 Welcome to the Cloudera Community!

To help you get the best possible solution, I have tagged our NiFi experts @joseomjr @cotopaul who may be able to assist you further.

Please keep us updated on your post, and we hope you find a satisfactory solution to your query.

Regards,

Diana Torres,
Community Moderator

Was your question answered? Make sure to mark the answer as the accepted solution.
If you find a reply useful, say thanks by clicking on the thumbs up button.
Learn more about the Cloudera Community:
Community Guidelines
How to use the forum

SAMSAL · ‎12-28-2023