Archives of Support Questions (Read Only)

This is an archived board for historical reference. Information and links may no longer be available or relevant
Announcements
This board is archived and read-only for historical reference. To ask a new question, please post a new topic on the appropriate active board.

hive table compaction

avatar
New Contributor

Havent found anything in cloudera on hive table compaction as described in

http://www.openkb.info/2015/06/hive-transaction-feature-in-hive-10.html

 

Has anyone tried this in CDH and any performance impact?

1 ACCEPTED SOLUTION

avatar
Expert Contributor
Here's what the most recent version of the CDH Hive documentation says about this: http://www.cloudera.com/documentation/enterprise/latest/topics/hive.html#hive_transaction_support "Transaction (ACID) Support in Hive The CDH distribution of Hive does not support transactions (HIVE-5317). Currently, transaction support in Hive is an experimental feature that only works with the ORC file format. Cloudera recommends using the Parquet file format, which works across many tools. Merge updates in Hive tables using existing functionality, including statements such as INSERT, INSERT OVERWRITE, and CREATE TABLE AS SELECT."

View solution in original post

3 REPLIES 3

avatar
Rising Star

This feature is not supported in CDH yet. We consider this feature is experimental, and incomplete (only works with ORC file format).

avatar
Visitor

Any updates to this feature? Is it still experimental? what about parquet formats?

avatar
Expert Contributor
Here's what the most recent version of the CDH Hive documentation says about this: http://www.cloudera.com/documentation/enterprise/latest/topics/hive.html#hive_transaction_support "Transaction (ACID) Support in Hive The CDH distribution of Hive does not support transactions (HIVE-5317). Currently, transaction support in Hive is an experimental feature that only works with the ORC file format. Cloudera recommends using the Parquet file format, which works across many tools. Merge updates in Hive tables using existing functionality, including statements such as INSERT, INSERT OVERWRITE, and CREATE TABLE AS SELECT."