Archives of Support Questions (Read Only)

This is an archived board for historical reference. Information and links may no longer be available or relevant
Announcements
This board is archived and read-only for historical reference. To ask a new question, please post a new topic on the appropriate active board.

Export/Import HDFS snapshots

avatar
New Member

Hi

Is it possible to copy HDFS snapshots to another cluster and use them (via distCP for instance)? What does this add compared to copy data directly (not snapshots) via distCP for DR?

Thanks

1 ACCEPTED SOLUTION

avatar

Hi @Houssam Manik,

The big benefit that you get by utilizing snapshots with distCP is that you can do incremental backups when distCP'ing the snapshotted directory in the future by leveraging the differential between the snapshots. Jing provides some context around this in the second answer here. The work to complete this is discussed in HDFS-7535 and some more context is provided there. This was first pulled into Hadoop 2.7.0

View solution in original post

1 REPLY 1

avatar

Hi @Houssam Manik,

The big benefit that you get by utilizing snapshots with distCP is that you can do incremental backups when distCP'ing the snapshotted directory in the future by leveraging the differential between the snapshots. Jing provides some context around this in the second answer here. The work to complete this is discussed in HDFS-7535 and some more context is provided there. This was first pulled into Hadoop 2.7.0