01-24-2018 07:31 AM - last edited on 01-24-2018 08:45 AM by cjervis
The snapshot feature we have in Cloudera, i would like to know if it take a full backup or incremental backup.
Because if it is taking a full backup everytime then there is a drawback to it as for an example i make changes and add a 2MB data to my 1000GB file then it will again take a backup of entire 1000GB file and not just a 2MB changes i made.
Would request someone to answer on above query.
01-24-2018 03:23 PM
The snapshot is not a full copy of the data, rather a copy of the metadata at that point in time. Blocks in datanodes are not copied: the snapshot files record the block list and the file size. There is no data copying.
01-24-2018 11:45 PM
Does that mean, when i am copying my data from Prod to DR cluster it is just copying the Meta Data and not the actual data, if that is the case then i may loose my data if my Prod goes down because my DR only has meta data and not the actual data.
01-25-2018 12:00 AM