Reply
Highlighted
Explorer
Posts: 37
Registered: ‎05-03-2017

Consistent DATA Backup


Now that I know how I can backup my metadata and restore it, I am thinking how to backup all DATA stored in HDFS. I was thinking to run a cron job and execute distcp and store it in S3 regularly but I am not sure about the consistency of these backups.

Anyone has any experience in DR o restoring data using this method?

 

Thoughts??

Thanks!

Announcements