Support Questions
Find answers, ask questions, and share your expertise

Re: Checkpoint Status on name node

delete the dir is unsafe.
After I restart the hdfs cluster,the error message gone away.

Re: Checkpoint Status on name node

Hi Harsh,


I am getting below exception on the namenode though it doesnt affect my services. But once there wasnt an automatic failover though it was enabled. I found out following error logs :




Forwardable Ticket true
Forwarded Ticket false
Proxiable Ticket false
Proxy Ticket false
Postdated Ticket false
Renewable Ticket false

Initial Ticket false
Auth Time = Wed Feb 03 13:49:37 CET 2016
Start Time = Wed Feb 03 13:49:40 CET 2016
End Time = Wed Feb 03 23:49:37 CET 2016
Renew Till = null
Client Addresses  Null

2016-02-03 14:49:49,093 ERROR org.apache.hadoop.hdfs.server.namenode.ha.StandbyCheckpointer: Exception in doCheckpoint Exception during image upload: GSSException: No valid credentials provided (Mechanism level: Server not found in Kerberos database (7))
        at org.apache.hadoop.hdfs.server.namenode.ha.StandbyCheckpointer.doCheckpoint(
        at org.apache.hadoop.hdfs.server.namenode.ha.StandbyCheckpointer.access$1400(
        at org.apache.hadoop.hdfs.server.namenode.ha.StandbyCheckpointer$CheckpointerThread.doWork(
        at org.apache.hadoop.hdfs.server.namenode.ha.StandbyCheckpointer$CheckpointerThread.access$700(
        at org.apache.hadoop.hdfs.server.namenode.ha.StandbyCheckpointer$CheckpointerThread$
        at Method)
        at org.apache.hadoop.hdfs.server.namenode.ha.StandbyCheckpointer$
Caused by: GSSException: No valid credentials provided (Mechanism level: Server not found in Kerberos database (7))
        at org.apache.hadoop.hdfs.server.namenode.TransferFsImage.uploadImage(
        at org.apache.hadoop.hdfs.server.namenode.TransferFsImage.uploadImageFromStorage(
        at org.apache.hadoop.hdfs.server.namenode.ha.StandbyCheckpointer$







Re: Checkpoint Status on name node

Community Manager

This is a Kerberos configuration issue, most likely with the principal for the second NameNode.  When a checkpoint is attempted (copying the fsimage file from the Standby NameNode to the Active), the connection is failing due to the GSSAPI authentication with the Kerberos credential.


The failover controller logs will probably contain similar messages.


Since the server is able to start, your basic Kerberos setup is allowing the server to obtain it's initial credential but it appears it is expiring.


A few possible causes:


* The principal needs to have renewable tickets.  In your output this is set to false.  The problem could be with the /etc/krb5.conf file on the Standby or with the principal in your KDC.


* Reverse DNS lookup for the hostname is not working.  The packet sent from one server has the information "my hostname is:, IP:".  The source does a reverse DNS lookup for and is not receiving a hostname or is receiving a hostname that does not match the one provided.


* You are having an intermittent outage with your KDC or DNS that is causing the above mentioned problems.


Depending upon the type of KDC in use and how it is configured, there may be additional issues.   Since you report the rest of the cluster is functional (no loss to the DataNodes), this is most likely isolated to the one NameNode's principal. 

David Wilder, Community Manager

Was your question answered? Make sure to mark the answer as the accepted solution.
If you find a reply useful, say thanks by clicking on the thumbs up button.

Learn more about the Cloudera Community:

Terms of Service

Community Guidelines

How to use the forum

Re: Checkpoint Status on name node

I found out that

principal of namenode : hdfs/

hostname :

hostname --fqdn :


So from above 3 values you can see that hostname is not same as principal and FQDN.

But as per my knowledge, only FQDN matters.

Still, do you think incorrect hostname can cause this issue?

Re: Checkpoint Status on name node

Please find a part of krb5.cnf


dns_lookup_realm = false
  dns_lookup_kdc = false
  ticket_lifetime = 24h
  renew_lifetime = 7d
  forwardable = true

Re: Checkpoint Status on name node

New Contributor

In my case 


reinstalled hdfs in CDH.


in SNN machine /hadoop/dfs/snn/current/fsimage_* is different NN  /hadoop/dfs/nn/current/fsimage_*


Delete /hadoop/dfs/snn in SNN machine and then restart the SNN.