Created 09-04-2022 04:17 AM
Hi Experts,
We are planning to implement BDR in our project. As a back up and recovery solution I have proposed the BDR cloudera cluster where in the HDFS and Hive data will be backed up, But I want to know if there is an alternative way by backing up the data in the cold storage, Not sure if this approach can be the best as per cost perspective since we want the options available to reduce cost.
If any one share a use case and best solution for this.
Thanks.
Created on 09-05-2022 09:16 AM - edited 09-05-2022 09:16 AM
@HanzalaShaikh You may consider DLM replication. This is explained here and here.
You set the hive.repl.rootdir to set the location where you you want to store the backup, and use the REPL DUMP command to dump your data and metadata:
e.g.
REPL DUMP db1 WITH('hive.repl.rootdir'='s3a://blah/');
Refer to the Cloudera documentation for for more details and examples.
Created on 09-05-2022 09:16 AM - edited 09-05-2022 09:16 AM
@HanzalaShaikh You may consider DLM replication. This is explained here and here.
You set the hive.repl.rootdir to set the location where you you want to store the backup, and use the REPL DUMP command to dump your data and metadata:
e.g.
REPL DUMP db1 WITH('hive.repl.rootdir'='s3a://blah/');
Refer to the Cloudera documentation for for more details and examples.
Created 09-08-2022 07:51 AM
Thanks for your quick reply @smruti , really appreciated.
I have gone through this approach and will surely consider it for DR strategy.