Support Questions
Find answers, ask questions, and share your expertise
Announcements
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

Can columnar format occupy more space than row format in hive compression?

Solved Go to solution
Highlighted

Can columnar format occupy more space than row format in hive compression?

Explorer

Even after columnar compression techniques like parquet my files are turning out to be bigger than sequence files. I wanted to know that is columnar compression a sure shot way to compression or is there some kind of data which fails here.

1 ACCEPTED SOLUTION

Accepted Solutions

Re: Can columnar format occupy more space than row format in hive compression?

Yess its possible only and only if there are no repetition in a column.
In this case one will end up with the file and meta info of the columnar file format.

View solution in original post

1 REPLY 1

Re: Can columnar format occupy more space than row format in hive compression?

Yess its possible only and only if there are no repetition in a column.
In this case one will end up with the file and meta info of the columnar file format.

View solution in original post

Don't have an account?
Coming from Hortonworks? Activate your account here