Support Questions

Find answers, ask questions, and share your expertise
Announcements
Celebrating as our community reaches 100,000 members! Thank you!

Export/Import HDFS snapshots

avatar
Contributor

Hi

Is it possible to copy HDFS snapshots to another cluster and use them (via distCP for instance)? What does this add compared to copy data directly (not snapshots) via distCP for DR?

Thanks

1 ACCEPTED SOLUTION

avatar

Hi @Houssam Manik,

The big benefit that you get by utilizing snapshots with distCP is that you can do incremental backups when distCP'ing the snapshotted directory in the future by leveraging the differential between the snapshots. Jing provides some context around this in the second answer here. The work to complete this is discussed in HDFS-7535 and some more context is provided there. This was first pulled into Hadoop 2.7.0

View solution in original post

1 REPLY 1

avatar

Hi @Houssam Manik,

The big benefit that you get by utilizing snapshots with distCP is that you can do incremental backups when distCP'ing the snapshotted directory in the future by leveraging the differential between the snapshots. Jing provides some context around this in the second answer here. The work to complete this is discussed in HDFS-7535 and some more context is provided there. This was first pulled into Hadoop 2.7.0