- Subscribe to RSS Feed
- Mark Question as New
- Mark Question as Read
- Float this Question for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page
How to scale SplitJson queues?
- Labels:
-
Apache NiFi
Created on 07-11-2018 05:15 PM - edited 08-18-2019 01:54 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
I have processes that capture data from a SGDB => Converts to AvroJSON => SpliteJSON => Publish in Google PUBSUB
But it is accumulating and I would like to escalate the queues during the Split (putting 3 processors) and lasts the publish in Google it's possible?
Created on 07-12-2018 04:27 AM - edited 08-18-2019 01:54 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Make use of Record oriented processors to do your split on json array,
Try with the below approach
Once you feed the success relation to SplitRecord processor then you need to define RecordReader Controller service to read the contents of flowfile and Record Writer as JsonRecordSetWriter.
Mention the Records per split property value as 1 and feed only the splits relationship from SplitRecord processor to PublishGCPubsub processor.
If you find any OOM issues then it's better to use Series of SplitRecord processors to Make Records Per split to 1 message into each flowfile.
Refer to this and this links regarding usage of series of split processors.
Refer to this link regarding configuring Record Reader/Writer controller services.
-
Created on 07-12-2018 04:27 AM - edited 08-18-2019 01:54 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Make use of Record oriented processors to do your split on json array,
Try with the below approach
Once you feed the success relation to SplitRecord processor then you need to define RecordReader Controller service to read the contents of flowfile and Record Writer as JsonRecordSetWriter.
Mention the Records per split property value as 1 and feed only the splits relationship from SplitRecord processor to PublishGCPubsub processor.
If you find any OOM issues then it's better to use Series of SplitRecord processors to Make Records Per split to 1 message into each flowfile.
Refer to this and this links regarding usage of series of split processors.
Refer to this link regarding configuring Record Reader/Writer controller services.
-
Created 07-12-2018 08:04 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Thanks very much @Shu