- Subscribe to RSS Feed
- Mark Question as New
- Mark Question as Read
- Float this Question for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page
JSONtoSQL with JSON containing an array with a variable amount of values
- Labels:
-
Apache NiFi
Created ‎03-07-2017 02:02 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi,
I want to put the following JSON-File into a DB-table:
{ "Time":"03/07/2017 11:45:46.365", "Subscription":"1234567", "ServiceCodes":[ "SERVICE_CODE1", "SERVICE_CODE2", "SERVICE_CODE3", "SERVICE_CODE4" ] }
The amount of servicecodes is variable, so there are JSONs, which consists 4 ServiceCodes but there are also some which just have 1 to 3 servicecodes.
It would be perfect to put it in the table having one row per Servicecode:
TIME | SUBSCRIPTION | SERVICECODE |
03/07/2017 11:45:46.365 | 1234567 | SERVICE_CODE1 |
03/07/2017 11:45:46.365 | 1234567 | SERVICE_CODE2 |
But I don't know if this is possible with nifi.
If not, I would be ok with having it as a comma-seperated list:
TIME | SUBSCRIPTION | SERVICECODE |
03/07/2017 11:45:46.365 |
1234567 |
SERVICE_CODE1, SERVICE_CODE2, SERVICE_CODE3, SERVICE_CODE4 |
At the moment I am using a CONVERTJSONTOSQL-Processor in Combination with a PUTSQL-Processor. This is working for TIME and SUBSCRIPTION, but SERVICECODE is always empty. In the OracleDB ServiceCode is a varchar2(250), but this could be changed.
Any suggestions how to do this?
Thanks a lot for your help.
Rgrds,
Chris
Created ‎03-07-2017 02:42 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
To put each of the ServiceCodes values into its own "row", you can use JoltTransformJSON with the following shift specification:
{ "operation": "shift", "spec": { "ServiceCodes": { "*": { "@(2,Time)": "[&].Time", "@(2,Subscription)": "[&].Subscription", "@": "[&].ServiceCode" } } } }
Given your input above, it will produce the following:
[ { "ServiceCode" : "SERVICE_CODE1", "Subscription" : "1234567", "Time" : "03/07/2017 11:45:46.365" }, { "ServiceCode" : "SERVICE_CODE2", "Subscription" : "1234567", "Time" : "03/07/2017 11:45:46.365" }, { "ServiceCode" : "SERVICE_CODE3", "Subscription" : "1234567", "Time" : "03/07/2017 11:45:46.365" }, { "ServiceCode" : "SERVICE_CODE4", "Subscription" : "1234567", "Time" : "03/07/2017 11:45:46.365" } ]
This might be able to go directly into ConvertJSONToSQL, but if it doesn't, you can use SplitJSON with $[*] or $.* as the JSON Path expression, and it will divide the array up into one flow file per object in the array. Then you should be able to transform it to SQL.
Created ‎03-07-2017 02:38 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
This should be possible using JOLTJsonTransform processor.
It helps you flatten your json. Output still remains json. It's a little more complex to learn so might take a day or two to understand all of it but its very handy. You should look at both the links below:
Created ‎03-08-2017 09:00 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Thanks. Managed to do so with the Jolt Processor.
Created ‎03-07-2017 02:42 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
To put each of the ServiceCodes values into its own "row", you can use JoltTransformJSON with the following shift specification:
{ "operation": "shift", "spec": { "ServiceCodes": { "*": { "@(2,Time)": "[&].Time", "@(2,Subscription)": "[&].Subscription", "@": "[&].ServiceCode" } } } }
Given your input above, it will produce the following:
[ { "ServiceCode" : "SERVICE_CODE1", "Subscription" : "1234567", "Time" : "03/07/2017 11:45:46.365" }, { "ServiceCode" : "SERVICE_CODE2", "Subscription" : "1234567", "Time" : "03/07/2017 11:45:46.365" }, { "ServiceCode" : "SERVICE_CODE3", "Subscription" : "1234567", "Time" : "03/07/2017 11:45:46.365" }, { "ServiceCode" : "SERVICE_CODE4", "Subscription" : "1234567", "Time" : "03/07/2017 11:45:46.365" } ]
This might be able to go directly into ConvertJSONToSQL, but if it doesn't, you can use SplitJSON with $[*] or $.* as the JSON Path expression, and it will divide the array up into one flow file per object in the array. Then you should be able to transform it to SQL.
Created ‎03-08-2017 08:59 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Perfect answer. Worked out perfectly. Thanks a lot.
