Support Questions
Find answers, ask questions, and share your expertise
Announcements
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

hive table compaction

SOLVED Go to solution

hive table compaction

New Contributor

Havent found anything in cloudera on hive table compaction as described in

http://www.openkb.info/2015/06/hive-transaction-feature-in-hive-10.html

 

Has anyone tried this in CDH and any performance impact?

1 ACCEPTED SOLUTION

Accepted Solutions

Re: hive table compaction

Cloudera Employee
Here's what the most recent version of the CDH Hive documentation says about this: http://www.cloudera.com/documentation/enterprise/latest/topics/hive.html#hive_transaction_support "Transaction (ACID) Support in Hive The CDH distribution of Hive does not support transactions (HIVE-5317). Currently, transaction support in Hive is an experimental feature that only works with the ORC file format. Cloudera recommends using the Parquet file format, which works across many tools. Merge updates in Hive tables using existing functionality, including statements such as INSERT, INSERT OVERWRITE, and CREATE TABLE AS SELECT."
3 REPLIES 3

Re: hive table compaction

Contributor

This feature is not supported in CDH yet. We consider this feature is experimental, and incomplete (only works with ORC file format).

Highlighted

Re: hive table compaction

New Contributor

Any updates to this feature? Is it still experimental? what about parquet formats?

Re: hive table compaction

Cloudera Employee
Here's what the most recent version of the CDH Hive documentation says about this: http://www.cloudera.com/documentation/enterprise/latest/topics/hive.html#hive_transaction_support "Transaction (ACID) Support in Hive The CDH distribution of Hive does not support transactions (HIVE-5317). Currently, transaction support in Hive is an experimental feature that only works with the ORC file format. Cloudera recommends using the Parquet file format, which works across many tools. Merge updates in Hive tables using existing functionality, including statements such as INSERT, INSERT OVERWRITE, and CREATE TABLE AS SELECT."