Created 02-25-2017 11:15 PM
Could not determine the age of the last HDFS checkpoint. Please ensure that you have a recent checkpoint. Otherwise, the NameNode(s) can take a very long time to start up.
Created 02-25-2017 11:54 PM
I saw this happening on a relatively idle cluster. You can create a checkpoint manually, I think the instructions are given on the dialog showing the warning, but here they are: Login to the active Namenode and run
su - hdfs hdfs dfsadmin -safemode enter hdfs dfsadmin -saveNamespace hdfs dfsadmin -safemode leave
Created 02-25-2017 11:54 PM
I saw this happening on a relatively idle cluster. You can create a checkpoint manually, I think the instructions are given on the dialog showing the warning, but here they are: Login to the active Namenode and run
su - hdfs hdfs dfsadmin -safemode enter hdfs dfsadmin -saveNamespace hdfs dfsadmin -safemode leave
Created 02-26-2017 02:20 AM
HDFS metadata consists of two parts:
Checkpointing is a process of reconciling fsimage with edits to produce a new version of fsimage. There are two benefits arising out of this:
The following properties can help to set how often Checkpointing happens:
Also, if you would like to manually checkpoint you can follow:
https://community.hortonworks.com/content/supportkb/49438/how-to-manually-checkpoint.html
Created 02-27-2017 05:35 PM
From HDFS perspective, in some rare circumstances it was noticed that secondary (or standby) namenode fails to consume edit log. This results in more complicated situations if active namenode is restarted meanwhile (unconsumed edit logs will have to be ignored). The simpler solution to handle such scenario more gracefully is to always make sure that fsimage is updated before stopping namenode.
So as precautionary measure work was done in Ambari to check and warn user if user tries to stop NameNode that has a checkpoint older than 12 hours. [1]
HDFS-3.0.0.0 has implemented this check natively and going forward Ambari might skip this warning. [2]
Following Jira's and their description are used as references for this answer: