Support Questions

Find answers, ask questions, and share your expertise
Announcements
Celebrating as our community reaches 100,000 members! Thank you!

Zookeeper not starting on one node inconsistent epoch

avatar

Dear All,

I recently upgraded our cluster (3 node not secure) from HDP 2.3 to 2.5. After the upgrade was finalized and when i was restarting some of the services I saw that zookeeper went down on one of the nodes with the following errors in the log

2016-10-04 05:38:06,427 - INFO [main:QuorumPeer@1038] - initLimit set to 10 2016-10-04 05:38:06,439 - INFO [main:FileSnap@83] - Reading snapshot /hadoop/zookeeper/version-2/snapshot.4400000000 2016-10-04 05:38:06,556 - ERROR [main:QuorumPeer@530] - Unable to load database on disk java.io.IOException: The current epoch, 45 is less than the accepted epoch, 32 at org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:527) at org.apache.zookeeper.server.quorum.QuorumPeer.start(QuorumPeer.java:473) at org.apache.zookeeper.server.quorum.QuorumPeerMain.runFromConfig(QuorumPeerMain.java:153) at org.apache.zookeeper.server.quorum.QuorumPeerMain.initializeAndRun(QuorumPeerMain.java:111) at org.apache.zookeeper.server.quorum.QuorumPeerMain.main(QuorumPeerMain.java:78) 2016-10-04 05:38:06,558 - ERROR [main:QuorumPeerMain@89] - Unexpected exception, exiting abnormally java.lang.RuntimeException: Unable to run quorum server at org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:531) at org.apache.zookeeper.server.quorum.QuorumPeer.start(QuorumPeer.java:473) at org.apache.zookeeper.server.quorum.QuorumPeerMain.runFromConfig(QuorumPeerMain.java:153) at org.apache.zookeeper.server.quorum.QuorumPeerMain.initializeAndRun(QuorumPeerMain.java:111) at org.apache.zookeeper.server.quorum.QuorumPeerMain.main(QuorumPeerMain.java:78) Caused by: java.io.IOException: The current epoch, 45 is less than the accepted epoch, 32 at org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:527) ... 4 more 2016-10-04 05:50:32,198 - INFO [main:FourLetterWordMain@43] - connecting to localhost 2181 2016-10-04 06:01:35,510 - INFO [main:FourLetterWordMain@43] - connecting to localhost 2181



I saw that there is a JIRA https://issues.apache.org/jira/browse/ZOOKEEPER-2307, do i need to apply this patch. Has anyone else faced this problem.

Thanks,

Anand

1 ACCEPTED SOLUTION

avatar
Super Collaborator
3 REPLIES 3

avatar
Super Collaborator

avatar

Thanks Mahesh. I followed the resolution in the https://community.mapr.com/docs/DOC-1215 and it worked.

avatar
Explorer

The link https://community.mapr.com/docs/DOC-1215 is not working now.