- Subscribe to RSS Feed
- Mark Question as New
- Mark Question as Read
- Float this Question for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page
Split Large Json file into multiple files with a specified number of records
- Labels:
-
Apache NiFi
Created on
‎11-27-2019
12:05 PM
- last edited on
‎11-27-2019
12:10 PM
by
ask_bill_brooks
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi,
I am a newbie to Nifi and would like some guidance please.
We want to split a large Json file into multiple files with a specified number of records. I am able to split a file into individual records using SplitJson and the Json Path Expression set as $.<objectthatcontainsthearray>.* I have also added an UpdateAttribute Processor with filename set to ${filename}_${fragment.index} so that we have the sequence of the files as order is important.
However, we might want to have say a 100,000 records split into 100 files of 1000 records each . What is the easiest way to do this ?
Thanks very much in advance
Created ‎12-04-2019 06:23 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
You'll want to use the SplitRecord processor to accomplish what you are looking to do.
Configure it with a RecordReader and RecordWriter for you input data and desired output data formats along with the desired 1000 records per split.
Hope this helps,
Matt
