Support Questions
Find answers, ask questions, and share your expertise
Announcements
Alert: Please see the Cloudera blog for information on the Cloudera Response to CVE-2021-4428

in some ORC tables in Hive we get duplicate partition "base" directory inside base directory

Contributor

​Hi All,

 

Periodically, in some ORC tables in Hive we get duplicate partition "base" directory inside /table_name/partition_date=/base/ meaning: all contents of /table_name/partition_date=/base/* are in /table_name/partition_date=/base/base/*. After that partition become bad and from this bad partition we can’t do select count(*) or any other selects because of error occurring.

 

But when we dropping duplicate “base” directory problem goes away.

 

Why we got this duplicate folder in our buckets?

0 REPLIES 0