- Subscribe to RSS Feed
- Mark Question as New
- Mark Question as Read
- Float this Question for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page
Converting Large CSV into JSON
- Labels:
-
Apache NiFi
Created 06-18-2019 12:05 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
I have a relatively large CSV (~80GB) I need to transform into multiple JSON documents/records. I'm using a ConvertRecord processor with a CSVReader and AvroRecordSetWriter and that's where my CSV gets stuck. What's the best approach? Break up the CSV prior to converting it or try to get more horsepower on the server?
- Server Mem: 16GB
- Cores: 4
- Maximum Timer Driven Thread Count : 16
- Java Min/Max Heap: 2GB/10GB
Created 06-19-2019 01:21 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Try with series of SplitRecord processors to create smaller chunks of files.
Follow the similar approach mentioned in this thread and see if you get any performance with this approach.
Created 06-19-2019 01:21 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Try with series of SplitRecord processors to create smaller chunks of files.
Follow the similar approach mentioned in this thread and see if you get any performance with this approach.
