Created 05-16-2017 09:58 AM
I have an HDP2.3 cluster. I need to setup backup and restore of hive tables in S3. Could you please suggest the best way to do the same? Can Falcon Web UI be used? Can I schedule the replication activity from Falcon?
Created 05-17-2017 09:27 PM
@Phoncy Joseph Read through this post, it has some good insight into methods of copying Hive data to S3: https://community.hortonworks.com/questions/39405/options-for-copying-hive-data-to-s3.html. This can be scheduled using Oozie, or if you need additional functionality, Falcon can be used to build a more complex data pipeline. Also be aware of a new feature called Ambari Workflow Manager, which is available now. Refer to http://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.6.0/bk_workflow-management/content/ch_wfm_basics... for details. Please don't forget to accept this answer if you find it helpful.
Created 05-18-2017 05:48 AM
@Mike Riggs Thanks for the response. I'm looking for the availability of a backup and recovery option for Hive tables without much scripting work to be done. There is an option to mirror HDFS data to S3 from Falcon Web UI, is something similar available for Hive table?