- Subscribe to RSS Feed
- Mark Question as New
- Mark Question as Read
- Float this Question for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page
Can columnar format occupy more space than row format in hive compression?
- Labels:
-
Apache Hadoop
-
Apache Hive
Created ‎02-16-2018 06:36 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Even after columnar compression techniques like parquet my files are turning out to be bigger than sequence files. I wanted to know that is columnar compression a sure shot way to compression or is there some kind of data which fails here.
Created ‎02-19-2018 11:42 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Yess its possible only and only if there are no repetition in a column.
In this case one will end up with the file and meta info of the columnar file format.
Created ‎02-19-2018 11:42 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Yess its possible only and only if there are no repetition in a column.
In this case one will end up with the file and meta info of the columnar file format.
