We have recently upgraded to CDH5.4 with the help of Cloudera engineer. We have build our cluster as a POC, it is running fine at the moment. We are looking at devising a Backup Strategy for HDFS Metadata, and would like to test them. Please can you help me understand the following
a. How much space would it need. Assuming HDFS Namenode has 100GB of namenode dir (fsimage & editlogs+)
b. What are the security privileges required for the user to take the backup
c. How can we test the backup strategy.
d. How can we ensure that the namenode dir is courrption free?
How frequent these backups should be taken? Looking forward to your advice.