Support Questions
Find answers, ask questions, and share your expertise

hadoop cluster + backup strategy , is it necessary to backup HA namnode cluster?

hadoop cluster + backup strategy , is it necessary to backup HA namnode cluster?

The most importat data on namenode machine are :

 

/var/hadoop/hdfs/namenode/current folders

 

folders include the fsimage end edit_logs

 

Since we have HA cluster , its means one active namenode and the secondary is the standby namenode

 

I am wondering if we need to backup every point of time the folder - /var/hadoop/hdfs/namenode/current

 

I will give here practical example

We have two namenode machines

Namenode1
Namenode2

in spite both namenodes react as HA cluster

 

I want to add additional machines let say – backup_server1 machines

That will backup every 10 second the folder /var/hadoop/hdfs/namenode/current from the active namenode to backup_server1:/data/backup/current

 

What is caludera  users opinion about this , or other opinions ?

Michael-Bronson