Created 06-12-2020 03:16 PM
Hello,
Could you help me understand why my inserts into transaction table (bucketed, stored as ORC) always involves one last reducer task?
Hive: 3.1.2 , Tez: 0.9
I mean, hive query plan looks good, it creates appropriate amount of stages / mapper tasks and reducer tasks according to the volume of data but it always has one last reducer with only one task.
I can not understand why it is so and it seems inefficient. I read that if there are multiple buckets, hive is able to write simultaneously to transnational table using multiple reducer tasks.
Created 06-15-2020 06:53 AM
anyone any thoughts on this?
Created on 06-17-2020 08:55 AM - edited 06-17-2020 08:58 AM
Could you provide some examples?