Support Questions

Find answers, ask questions, and share your expertise

Hive Data replication using Hbase

Explorer

Hello,

I would like to replicate hive data from one cluster to another cluster, in the beginning I can do snapshot and I can us distcp to copy to other cluster, but for me to keep in sync rather than running disctcp cronjob. Can I have use hbase replication feature to sync the hive data? Since it is in hdfs I just need to replicate /apps/warehouse/hive/dir1

Is some thing I can do using habase?

2 REPLIES 2

Explorer

The hive metastore, I can replicate using regular sql dump.

The "Cadillac Answer" from Hortonworks is to use Data Lifecycle Manager, https://hortonworks.com/products/data-platforms/dataplane/data-lifecycle-manager/, as it handles Hive replication as documented at https://docs.hortonworks.com/HDPDocuments/DLM1/DLM-1.2.0/administration/content/dlm_replication_of_d.... Hive does not natively have the same cluster-to-cluster replication features that HBase has.

Take a Tour of the Community
Don't have an account?
Your experience may be limited. Sign in to explore more.