02-27-2018 03:10 AM - edited 02-27-2018 03:11 AM
Not sure where to post this question, let me know if this is the wrong section.
I've an Oozie bundle with some coordinators inside which import data from various sources and generate hive tables with some transformations. This is scheduled once every day.
I need to design a rollback procedure that brings the cluster to the status of the previous day.
I was thinking to add these two operations before starting the daily import/transformation tasks:
Then when I need to rollback I can stop the current Oozie bundle, overwrite the hdfs Hive data with the data in the backup folder and restore the Hive Metastore database.
Thanks for any information