Created on 02-24-2017 10:39 AM - edited 09-16-2022 04:09 AM
I have a folder in HDFS that will have files coming in everyday. I want to duplicate the folder in such a way that whenever a new file comes to the original folder, I want that to be duplicated/synced in the duplicate folder.
Basically, I want to sync a folder with another in HDFS
How can we achieve that in hadoop?
Created 02-24-2017 01:08 PM
Created 02-24-2017 02:01 PM
@mbigelow I would go with syncing and scheduling the sync on regular basis. But I am confused with distcp and cron usage together.
Could you please give me an example on how we can achieve this
Thank you
Created 02-24-2017 02:11 PM