Archives of Support Questions (Read Only)

This is an archived board for historical reference. Information and links may no longer be available or relevant
Announcements
This board is archived and read-only for historical reference. To ask a new question, please post a new topic on the appropriate active board.

Can columnar format occupy more space than row format in hive compression?

avatar
Contributor

Even after columnar compression techniques like parquet my files are turning out to be bigger than sequence files. I wanted to know that is columnar compression a sure shot way to compression or is there some kind of data which fails here.

1 ACCEPTED SOLUTION

avatar

Yess its possible only and only if there are no repetition in a column.
In this case one will end up with the file and meta info of the columnar file format.

View solution in original post

1 REPLY 1

avatar

Yess its possible only and only if there are no repetition in a column.
In this case one will end up with the file and meta info of the columnar file format.