Created 01-24-2014 03:46 AM
I'm benchmarking some tables and want to confirm if and what type of compression has been enabled. My understanding is that there is input, intermediate/map and output compression -- is there a method to identify a table's compression status? I've set output compression like this:
SET hive.exec.compress.output=true;
SET mapred.max.split.size=256000000;
SET mapred.output.compression.type=BLOCK;
SET mapred.output.compression.codec=org.apache.hadoop.io.compress.SnappyCodec;
And then used INSERT OVERWRITE to populate a table. Then I used:
hive> desc formatted table_name;
...which displays compression: No. So I wonder which compression type status is displayed in desc formatted? And is there a way to learn all compression type status for a table?
Created 08-07-2014 07:08 AM
Created 08-07-2014 07:30 AM
Sorry, Sreeman, never got an answer. Even spoke to a few of the Cloudera engineers and it didn't seem to be a big issue with anybody else -- I gave up.