Support Questions

Find answers, ask questions, and share your expertise
Announcements
Now Live: Explore expert insights and technical deep dives on the new Cloudera Community BlogsRead the Announcement

Why doesn't hive bucketing work at partition level?

avatar
Frequent Visitor

Any hive bucketing experts here?

I have this scenario where my each of my individual partitions needs to be bucketed at a different level. I have tried the following scenarios but it doesn't work.

  1. Created a bucketed table and created a partition (date_id='2022-10-22') and set the bucketing number to 64.

Result: No of files created in the target partition is equivalent to no of buckets in the source table (6). Expected file count is 64.

  1. Created a non bucketed table and created a partition (date_id='2022-10-22') and set the bucketing number to 64. Result: No of files created the target is not related to source files count either

Please share your thoughts.

Thanks

1 REPLY 1

avatar
Master Collaborator

Could you kindly provide the DDL and a sample dataset to facilitate a more in-depth explanation?