New Contributor
Posts: 1
Registered: ‎11-27-2015

HDFS Snapshot

Can someone explain me about HDFS Snapshot. I know it is point in time copies of file system, does it mean it keeps another copy of data? Lets say i have consumed 10 TB out of 100 TB and taken HDFS snapshot from / directory.


1. Does it keep another copy of 10TB under /.snapshot?

2. If snapshot is only metadata, how does NN constructs data when datablocks are deleted from datanode? or woudn't it delete datablocks at all? 


I'm confused can someone explain this?




Cloudera Employee
Posts: 92
Registered: ‎08-01-2013

Re: HDFS Snapshot

1. No, snapshot is just for the metadata operation.

2. Once you make particular directory snapshottable, the blocks belonging
the underlying files never be deleted.

Posts: 1,896
Kudos: 433
Solutions: 303
Registered: ‎07-31-2013

Re: HDFS Snapshot

In addition to Dice's notes, please also read the design and efficiency overview at It will help gain a better understanding of the feature.

Our community is getting a little larger. And a lot better.

Learn More about the Cloudera and Hortonworks community merger planned for late July and early August.