<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: RM Crashed -  STATE_STORE_OP_FAILED. in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/RM-Crashed-STATE-STORE-OP-FAILED/m-p/132131#M27284</link>
    <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/3560/anandranganathan.html" nodeid="3560"&gt;@Anandha L Ranganathan&lt;/A&gt; &lt;/P&gt;&lt;P&gt;I will recommend to contact hortonworks support for such cases.&lt;/P&gt;</description>
    <pubDate>Sun, 20 Nov 2016 17:00:13 GMT</pubDate>
    <dc:creator>rpathak</dc:creator>
    <dc:date>2016-11-20T17:00:13Z</dc:date>
    <item>
      <title>RM Crashed -  STATE_STORE_OP_FAILED.</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/RM-Crashed-STATE-STORE-OP-FAILED/m-p/132129#M27282</link>
      <description>&lt;P&gt;Today , we have seen the RM crashed and threw the following error message. There are bunch of JIRA tickets related to that error .    One of my job is killed but the application is running in orphaned mode. The app_id is displaying in RM-UI. &lt;/P&gt;&lt;P&gt;I  am unable to kill that App_id using yarn -application &amp;lt;app_id&amp;gt; . I restarted the RM and ZK but unable to remove that from displaying in RM -UI.  It is not consuming any resources.  How do I remove it from displaying ? &lt;/P&gt;&lt;PRE&gt;t: maxCompletedAppsInMemory = 10000, removing app application_1452798563961_0971 from memory:
2016-05-04 19:00:30,449 INFO  capacity.CapacityScheduler (CapacityScheduler.java:completedContainer(1193)) - Null container completed...
2016-05-04 19:00:30,568 INFO  capacity.CapacityScheduler (CapacityScheduler.java:completedContainer(1193)) - Null container completed...
2016-05-04 19:00:31,251 INFO  capacity.CapacityScheduler (CapacityScheduler.java:completedContainer(1193)) - Null container completed...
2016-05-04 19:00:32,252 INFO  capacity.CapacityScheduler (CapacityScheduler.java:completedContainer(1193)) - Null container completed...
2016-05-04 19:00:45,325 FATAL resourcemanager.ResourceManager (ResourceManager.java:handle(753)) - Received a org.apache.hadoop.yarn.server.resourcemanager.RMFatalEvent of type STATE_STORE_OP_FAILED. Cause:
java.io.IOException: Wait for ZKClient creation timed out
        at org.apache.hadoop.yarn.server.resourcemanager.recovery.ZKRMStateStore$ZKAction.runWithCheck(ZKRMStateStore.java:1073)
        at org.apache.hadoop.yarn.server.resourcemanager.recovery.ZKRMStateStore$ZKAction.runWithRetries(ZKRMStateStore.java:1097)
        at org.apache.hadoop.yarn.server.resourcemanager.recovery.ZKRMStateStore.doMultiWithRetries(ZKRMStateStore.java:934)
        at org.apache.hadoop.yarn.server.resourcemanager.recovery.ZKRMStateStore.storeRMDelegationTokenAndSequenceNumberState(ZKRMStateStore.java:734)
        at org.apache.hadoop.yarn.server.resourcemanager.recovery.RMStateStore.storeRMDelegationTokenAndSequenceNumber(RMStateStore.java:650)
        at org.apache.hadoop.yarn.server.resourcemanager.security.RMDelegationTokenSecretManager.storeNewToken(RMDelegationTokenSecretManager.java:112)
        at org.apache.hadoop.yarn.server.resourcemanager.security.RMDelegationTokenSecretManager.storeNewToken(RMDelegationTokenSecretManager.java:49)
        at org.apache.hadoop.security.token.delegation.AbstractDelegationTokenSecretManager.storeToken(AbstractDelegationTokenSecretManager.java:272)
        at org.apache.hadoop.security.token.delegation.AbstractDelegationTokenSecretManager.createPassword(AbstractDelegationTokenSecretManager.java:391)
        at org.apache.hadoop.security.token.delegation.AbstractDelegationTokenSecretManager.createPassword(AbstractDelegationTokenSecretManager.java:47)
        at org.apache.hadoop.security.token.Token.&amp;lt;init&amp;gt;(Token.java:59)
        at org.apache.hadoop.yarn.server.resourcemanager.ClientRMService.getDelegationToken(ClientRMService.java:907)
        at org.apache.hadoop.yarn.api.impl.pb.service.ApplicationClientProtocolPBServiceImpl.getDelegationToken(ApplicationClientProtocolPBServiceImpl.java:291)
        at org.apache.hadoop.yarn.proto.ApplicationClientProtocol$ApplicationClientProtocolService$2.callBlockingMethod(ApplicationClientProtocol.java:417)
        at org.apache.hadoop.ipc.ProtobufRpcEngine$Server$ProtoBufRpcInvoker.call(ProtobufRpcEngine.java:619)
        at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:962)
        at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2039)
        at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2035)
        at java.security.AccessController.doPrivileged(Native Method)
        at javax.security.auth.Subject.doAs(Subject.java:422)
        at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1628)
        at org.apache.hadoop.ipc.Server$Handler.run(Server.java:2033)
				
&lt;/PRE&gt;</description>
      <pubDate>Fri, 16 Sep 2022 10:17:19 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/RM-Crashed-STATE-STORE-OP-FAILED/m-p/132129#M27282</guid>
      <dc:creator>anand_ranganath</dc:creator>
      <dc:date>2022-09-16T10:17:19Z</dc:date>
    </item>
    <item>
      <title>Re: RM Crashed -  STATE_STORE_OP_FAILED.</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/RM-Crashed-STATE-STORE-OP-FAILED/m-p/132130#M27283</link>
      <description>&lt;A rel="user" href="https://community.cloudera.com/users/3560/anandranganathan.html" nodeid="3560"&gt;@Anandha L Ranganathan&lt;/A&gt;&lt;P&gt;Can you try to delete it using rest api. pls find sample link below -&lt;/P&gt;&lt;PRE&gt;curl -v -X PUT -d '{"state": "KILLED"}''http://localhost:8088/ws/v1/cluster/apps/application_xxxxxxxx_xxxx'&lt;/PRE&gt;</description>
      <pubDate>Sun, 20 Nov 2016 13:28:06 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/RM-Crashed-STATE-STORE-OP-FAILED/m-p/132130#M27283</guid>
      <dc:creator>sshimpi</dc:creator>
      <dc:date>2016-11-20T13:28:06Z</dc:date>
    </item>
    <item>
      <title>Re: RM Crashed -  STATE_STORE_OP_FAILED.</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/RM-Crashed-STATE-STORE-OP-FAILED/m-p/132131#M27284</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/3560/anandranganathan.html" nodeid="3560"&gt;@Anandha L Ranganathan&lt;/A&gt; &lt;/P&gt;&lt;P&gt;I will recommend to contact hortonworks support for such cases.&lt;/P&gt;</description>
      <pubDate>Sun, 20 Nov 2016 17:00:13 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/RM-Crashed-STATE-STORE-OP-FAILED/m-p/132131#M27284</guid>
      <dc:creator>rpathak</dc:creator>
      <dc:date>2016-11-20T17:00:13Z</dc:date>
    </item>
  </channel>
</rss>

