Sometimes I insert data to hive table using two ways:Hive and Hive on Tez.The HDFS output file size is twice when using hive on Tez. It take up more hdfs space.Is there any configurations to reduce the size?
Have you looked into CompressedStorage features on Hive?
You should be able to use this (for Snappy at least):
View solution in original post
@Jun Chen are you still having issues with this? Can you accept best answer or provide your own solution?