Support Questions

Find answers, ask questions, and share your expertise
Announcements
Check out our newest addition to the community, the Cloudera Data Analytics (CDA) group hub.

Export/Import HDFS snapshots

Hi

Is it possible to copy HDFS snapshots to another cluster and use them (via distCP for instance)? What does this add compared to copy data directly (not snapshots) via distCP for DR?

Thanks

1 ACCEPTED SOLUTION

Hi @Houssam Manik,

The big benefit that you get by utilizing snapshots with distCP is that you can do incremental backups when distCP'ing the snapshotted directory in the future by leveraging the differential between the snapshots. Jing provides some context around this in the second answer here. The work to complete this is discussed in HDFS-7535 and some more context is provided there. This was first pulled into Hadoop 2.7.0

View solution in original post

1 REPLY 1

Hi @Houssam Manik,

The big benefit that you get by utilizing snapshots with distCP is that you can do incremental backups when distCP'ing the snapshotted directory in the future by leveraging the differential between the snapshots. Jing provides some context around this in the second answer here. The work to complete this is discussed in HDFS-7535 and some more context is provided there. This was first pulled into Hadoop 2.7.0

Take a Tour of the Community
Don't have an account?
Your experience may be limited. Sign in to explore more.