Created 11-29-2015 04:30 PM
I have Cluster Express with 1 master role, 1 management role and 6 worker roles. My Namenode and Master showed not running and I cannot restart the Namenode. Error below:
2015-11-29 19:27:32,758 FATAL org.apache.hadoop.hdfs.server.namenode.NameNode: Exception in namenode join java.io.IOException: Failed to load an FSImage file! at org.apache.hadoop.hdfs.server.namenode.FSImage.loadFSImage(FSImage.java:657) at org.apache.hadoop.hdfs.server.namenode.FSImage.recoverTransitionRead(FSImage.java:275) at org.apache.hadoop.hdfs.server.namenode.FSNamesystem.loadFSImage(FSNamesystem.java:880) at org.apache.hadoop.hdfs.server.namenode.FSNamesystem.loadFromDisk(FSNamesystem.java:639) at org.apache.hadoop.hdfs.server.namenode.NameNode.loadNamesystem(NameNode.java:500) at org.apache.hadoop.hdfs.server.namenode.NameNode.initialize(NameNode.java:556) at org.apache.hadoop.hdfs.server.namenode.NameNode.<init>(NameNode.java:721) at org.apache.hadoop.hdfs.server.namenode.NameNode.<init>(NameNode.java:705) at org.apache.hadoop.hdfs.server.namenode.NameNode.createNameNode(NameNode.java:1355) at org.apache.hadoop.hdfs.server.namenode.NameNode.main(NameNode.java:1421)
I ssh'ed into the master role VM and tried to fix it manually:
hdfs namenode -recover
But it is still the same. Let me know where I should troubleshoot next. Thanks
Created 11-30-2015 02:05 AM
Hi hawkphil,
What did you actually do in the cluster before you face the error?
Which version are you running on?
Dice.
Created 11-30-2015 05:51 AM
I am on the latest version, whichever I downaloded last week v5.5 and I installed in vSphere
My lab got an outage so the entire cluster went off disruptively. That was the main reason causing all weird thing. I didn't have too much valuable data in there yet
Created 12-02-2015 10:16 AM
Just a gentle nudge...
Any suggestion please?
Created 12-06-2015 10:44 PM
Created 12-07-2015 09:04 AM