Created on 09-16-2015 11:57 PM - edited 09-16-2022 02:41 AM
Hello Guys,
We are planning to reboot our secondary name node. Below is our hdfs-site.xml file. Please let me know best step by step procedure to reboot the secondary namenode. Do we have to run "hdfs secondarynamenode -checkpoint " after the reboot or need to check uncheckpointed transactions before reboot. thanks in advance for your help
=========
<configuration>
<property>
<name>dfs.datanode.max.xcievers</name>
<value>4096</value>
</property>
<property>
<name>dfs.data.dir</name>
<value>/mnt/scecondary/dfs-data</value>
</property>
<property>
<name>dfs.datanode.socket.write.timeout</name>
<value>0</value>
</property><property>
<name>fs.checkpoint.period</name>
<value>1800</value>
</property>
</configuration>
=================
Created 09-17-2015 12:11 AM
Created 09-17-2015 10:24 PM
Created 09-17-2015 12:11 AM
Created 09-17-2015 01:15 AM
Hello Harsh,
The issue is that this server is having high load everytime. The configuration seems to be like DN as you said, but the dfsadmin report is not showing this server.
The jps is showing as
=======
# jps
8014 SecondaryNameNode
22290 Jps
=======
SNN process is runnnig.
hdfs 8014 7.6 3.8 1427044 149836 ? Sl 2013 90941:12 java -Dproc_secondarynamenode -Xmx1000m -Dhadoop.log.dir=/usr/lib/hadoop-0.20/logs -Dhadoop.log.file=hadoop-hadoop-secondarynamenode
=======
How can I confirm this ?
Thanks
Created 09-17-2015 10:24 PM
Created 05-02-2016 06:29 AM
Hi Harsh,
I just observed that in one of the clusters where HA is not enabled, there has been no checkpointing right from the start.
Can you confirm why this behaviour?
The graph is like a linear line(never came down) for 'Transactions since last log checkpoint'
Created 05-02-2016 06:33 AM
Created 05-02-2016 08:08 AM
Hi Harsh,
The Error is
Exception in doCheckpoint
java.io.exception:Inconsistent checkpoint fields
There is mismatch between namespaceID and blockpoolID
The LV,Ctime and clusterID are matching
I will create this in another thread as well but for urgent solution can you please reply here?
Parallely creating new thread now
Created 05-02-2016 08:15 AM
Hi Harsh, I have created a new thread with topic
Exception in doCheckpoint