Support Questions

Find answers, ask questions, and share your expertise

RM is getting down after enabling kerberos

avatar
Contributor

 

Hi Friends

RM is getting down after enabling kerberos below is the error. Kind;y note i have already removed znode and tried to start RM but still getting down.

 

[zk: Host1(CONNECTED) 3] rmr /yarn-leader-election/yarn-cluster
Authentication is not valid : /yarn-leader-election/yarn-cluster
[zk: Host1(CONNECTED) 3] ls /yarn-leader-election
Authentication is not valid : /yarn-leader-election
[zk: Host1(CONNECTED) 4] addauth digest super:password
[zk: Host1(CONNECTED) 5] ls /yarn-leader-election
[]
[zk: Host1(CONNECTED) 2] rmr /yarn-leader-election/yarn-cluster
[zk: Host1(CONNECTED) 6] rmr /yarn-leader-election
[zk: Host1(CONNECTED) 7] ls /yarn-leader-election
Node does not exist: /yarn-leader-election
[zk: Host1(CONNECTED) 1] ls /yarn-leader-election/yarn-cluster
Node does not exist: /yarn-leader-election/yarn-cluster

 

 

 

java.lang.InterruptedException
at java.util.concurrent.locks.AbstractQueuedSynchronizer.doAcquireSharedInterruptibly(AbstractQueuedSynchronizer.java:998)
at java.util.concurrent.locks.AbstractQueuedSynchronizer.acquireSharedInterruptibly(AbstractQueuedSynchronizer.java:1304)
at java.util.concurrent.Semaphore.acquire(Semaphore.java:312)
at org.apache.ranger.audit.provider.AuditProviderFactory$RangerAsyncAuditCleanup.run(AuditProviderFactory.java:495)
at java.lang.Thread.run(Thread.java:748)


Caused by: java.io.IOException: Couldn't create /yarn-leader-election/yarn-cluster
at org.apache.hadoop.ha.ActiveStandbyElector.ensureParentZNode(ActiveStandbyElector.java:361)
at org.apache.hadoop.yarn.server.resourcemanager.ActiveStandbyElectorBasedElectorService.serviceInit(ActiveStandbyElectorBasedElectorService.java:110)
at org.apache.hadoop.service.AbstractService.init(AbstractService.java:164)
... 4 more
Caused by: org.apache.zookeeper.KeeperException$NoAuthException: KeeperErrorCode = NoAuth for /yarn-leader-election/yarn-cluster

 

at org.apache.hadoop.yarn.server.resourcemanager.ActiveStandbyElectorBasedElectorService.serviceInit(ActiveStandbyElectorBasedElectorService.java:110)
at org.apache.hadoop.service.AbstractService.init(AbstractService.java:164)
... 4 more
Caused by: org.apache.zookeeper.KeeperException$NoAuthException: KeeperErrorCode = NoAuth for /yarn-leader-election/yarn-cluster
at org.apache.zookeeper.KeeperException.create(KeeperException.java:113)

 

3 REPLIES 3

avatar
Master Guru

@mohammad_shamim 

You might be hitting a known bug with Java recent versions (OpenJDK 1.8u242 or JDK 11.0.6). TSB-394. 

 

To resolve this issue, take the following action on all impacted nodes solved as appropriate for the environment.

  1. Edit java.security file located in the active JDK on the clusters.
  2. Add or alter sun.security.krb5.disableReferrals parameter, to ensure that the following is set to true:
    sun.security.krb5.disableReferrals=true

     


Cheers!
Was your question answered? Make sure to mark the answer as the accepted solution.
If you find a reply useful, say thanks by clicking on the thumbs up button.

avatar
Master Mentor

@GangWar 

Can you regenerate the keytabs through  Cloudare manager? That could resolve  the problem if it doean't please revert with the error cêncountered?

avatar
Contributor

Thanks for your reply. I have already tried but still getting the same error.