I would like to replicate hive data from one cluster to another cluster, in the beginning I can do snapshot and I can us distcp to copy to other cluster, but for me to keep in sync rather than running disctcp cronjob. Can I have use hbase replication feature to sync the hive data? Since it is in hdfs I just need to replicate /apps/warehouse/hive/dir1
Is some thing I can do using habase?
The "Cadillac Answer" from Hortonworks is to use Data Lifecycle Manager, https://hortonworks.com/products/data-platforms/dataplane/data-lifecycle-manager/, as it handles Hive replication as documented at https://docs.hortonworks.com/HDPDocuments/DLM1/DLM-1.2.0/administration/content/dlm_replication_of_d.... Hive does not natively have the same cluster-to-cluster replication features that HBase has.