Support Questions

Find answers, ask questions, and share your expertise
Announcements
Check out our newest addition to the community, the Cloudera Data Analytics (CDA) group hub.

Can columnar format occupy more space than row format in hive compression?

Explorer

Even after columnar compression techniques like parquet my files are turning out to be bigger than sequence files. I wanted to know that is columnar compression a sure shot way to compression or is there some kind of data which fails here.

1 ACCEPTED SOLUTION

Yess its possible only and only if there are no repetition in a column.
In this case one will end up with the file and meta info of the columnar file format.

View solution in original post

1 REPLY 1

Yess its possible only and only if there are no repetition in a column.
In this case one will end up with the file and meta info of the columnar file format.

Take a Tour of the Community
Don't have an account?
Your experience may be limited. Sign in to explore more.