Support Questions
Find answers, ask questions, and share your expertise
Announcements
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

HDFS Snapshot

HDFS Snapshot

New Contributor

Can someone explain me about HDFS Snapshot. I know it is point in time copies of file system, does it mean it keeps another copy of data? Lets say i have consumed 10 TB out of 100 TB and taken HDFS snapshot from / directory.

 

1. Does it keep another copy of 10TB under /.snapshot?

2. If snapshot is only metadata, how does NN constructs data when datablocks are deleted from datanode? or woudn't it delete datablocks at all? 

 

I'm confused can someone explain this?

 

 

 

2 REPLIES 2
Highlighted

Re: HDFS Snapshot

Rising Star
1. No, snapshot is just for the metadata operation.

2. Once you make particular directory snapshottable, the blocks belonging
the underlying files never be deleted.

Re: HDFS Snapshot

Master Guru
In addition to Dice's notes, please also read the design and efficiency overview at http://archive.cloudera.com/cdh5/cdh/5/hadoop/hadoop-project-dist/hadoop-hdfs/HdfsSnapshots.html#Ove... It will help gain a better understanding of the feature.