Created 01-25-2023 02:13 AM
We using PutHive3Streaming processor to send data to Hive from NiFi, I have an issue where we are getting LOTS of small delta files on our busier feeds which is causing issues with compaction etc.
I have used a series of merges in NiFi to ensure each flowfile contains many thousand records but it still creates many delta files.
I wondered if anyone had any advice on tuning 'Records Per Transaction' and 'Transactions per Batch' options on the PutHive3Streaming processor, I believe this could help with my issue but have had mixed/confusing results from testing. There isn't a great deal of information on best practice that I have found.
Has anyone else had similar issues/found adjustments helpful?
Created 02-02-2023 07:07 AM
Want to get a detailed solution you have to login/registered on the community
Register/LoginCreated 02-02-2023 07:07 AM
Want to get a detailed solution you have to login/registered on the community
Register/LoginCreated 02-05-2023 11:08 PM
@Griggsy, Has the reply helped resolve your issue? If so, please mark the appropriate reply as the solution, as it will make it easier for others to find the answer in the future.
Regards,
Vidya Sargur,