<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: CDH5.2: yarn :Error starting yarn nodemanagers in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/CDH5-2-yarn-Error-starting-yarn-nodemanagers/m-p/24878#M38844</link>
    <description>&lt;P&gt;Thanks for the update!&lt;/P&gt;</description>
    <pubDate>Fri, 20 Feb 2015 17:55:58 GMT</pubDate>
    <dc:creator>harsha_v</dc:creator>
    <dc:date>2015-02-20T17:55:58Z</dc:date>
    <item>
      <title>CDH5.2: yarn :Error starting yarn nodemanagers</title>
      <link>https://community.cloudera.com/t5/Support-Questions/CDH5-2-yarn-Error-starting-yarn-nodemanagers/m-p/21700#M38836</link>
      <description>&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Trying to start yarn when i get the following error on some of the nodes , anyone see this before? &amp;nbsp;( Not sure what caused this corruption since yarnm was running ok for a couple of days )&lt;/P&gt;&lt;P&gt;If the files expected are missing, how to recover to prior state ?&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Error starting NodeManager&lt;BR /&gt;org.apache.hadoop.service.ServiceStateException: org.fusesource.leveldbjni.internal.NativeDB$DBException: Corruption: 3 missing files; e.g.: /tmp/hadoop-yarn/yarn-nm-recovery/yarn-nm-state/000032.sst&lt;BR /&gt;at org.apache.hadoop.service.ServiceStateException.convert(ServiceStateException.java:59)&lt;BR /&gt;at org.apache.hadoop.service.AbstractService.init(AbstractService.java:172)&lt;BR /&gt;at org.apache.hadoop.yarn.server.nodemanager.NodeManager.initAndStartRecoveryStore(NodeManager.java:152)&lt;BR /&gt;at org.apache.hadoop.yarn.server.nodemanager.NodeManager.serviceInit(NodeManager.java:190)&lt;BR /&gt;at org.apache.hadoop.service.AbstractService.init(AbstractService.java:163)&lt;BR /&gt;at org.apache.hadoop.yarn.server.nodemanager.NodeManager.initAndStartNodeManager(NodeManager.java:445)&lt;BR /&gt;at org.apache.hadoop.yarn.server.nodemanager.NodeManager.main(NodeManager.java:492)&lt;BR /&gt;Caused by: org.fusesource.leveldbjni.internal.NativeDB$DBException: Corruption: 3 missing files; e.g.: /tmp/hadoop-yarn/yarn-nm-recovery/yarn-nm-state/000032.sst&lt;BR /&gt;at org.fusesource.leveldbjni.internal.NativeDB.checkStatus(NativeDB.java:200)&lt;BR /&gt;at org.fusesource.leveldbjni.internal.NativeDB.open(NativeDB.java:218)&lt;BR /&gt;at org.fusesource.leveldbjni.JniDBFactory.open(JniDBFactory.java:168)&lt;BR /&gt;at org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService.initStorage(NMLeveldbStateStoreService.java:842)&lt;BR /&gt;at org.apache.hadoop.yarn.server.nodemanager.recovery.NMStateStoreService.serviceInit(NMStateStoreService.java:195)&lt;BR /&gt;at org.apache.hadoop.service.AbstractService.init(AbstractService.java:163)&lt;BR /&gt;org.apache.hadoop.service.ServiceStateException: org.fusesource.leveldbjni.internal.NativeDB$DBException: Corruption: 3 missing files; e.g.: /tmp/hadoop-yarn/yarn-nm-recovery/yarn-nm-state/000032.sst&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 16 Sep 2022 09:13:23 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/CDH5-2-yarn-Error-starting-yarn-nodemanagers/m-p/21700#M38836</guid>
      <dc:creator>harsha_v</dc:creator>
      <dc:date>2022-09-16T09:13:23Z</dc:date>
    </item>
    <item>
      <title>Re: CDH5.2: yarn :Error starting yarn nodemanagers</title>
      <link>https://community.cloudera.com/t5/Support-Questions/CDH5-2-yarn-Error-starting-yarn-nodemanagers/m-p/21777#M38837</link>
      <description>&lt;P&gt;same issue for me too&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;chmod: changing permissions of `/var/run/cloudera-scm-agent/process/3669-yarn-NODEMANAGER/container-executor.cfg': Operation not permitted&lt;BR /&gt;chmod: changing permissions of `/var/run/cloudera-scm-agent/process/3669-yarn-NODEMANAGER/topology.map': Operation not permitted&lt;BR /&gt;+ exec /usr/lib/hadoop-yarn/bin/yarn nodemanager&lt;/P&gt;</description>
      <pubDate>Wed, 19 Nov 2014 03:50:10 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/CDH5-2-yarn-Error-starting-yarn-nodemanagers/m-p/21777#M38837</guid>
      <dc:creator>praveen25</dc:creator>
      <dc:date>2014-11-19T03:50:10Z</dc:date>
    </item>
    <item>
      <title>Re: CDH5.2: yarn :Error starting yarn nodemanagers</title>
      <link>https://community.cloudera.com/t5/Support-Questions/CDH5-2-yarn-Error-starting-yarn-nodemanagers/m-p/21824#M38838</link>
      <description>This is a totally different issue, as the error messages are different.&lt;BR /&gt;&lt;BR /&gt;What version of Cloudera Manager are you using? This may be a problem with /var/run being a noexec mount by default on your OS, which CM works around in more recent versions.</description>
      <pubDate>Wed, 19 Nov 2014 19:52:56 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/CDH5-2-yarn-Error-starting-yarn-nodemanagers/m-p/21824#M38838</guid>
      <dc:creator>Darren</dc:creator>
      <dc:date>2014-11-19T19:52:56Z</dc:date>
    </item>
    <item>
      <title>Re: CDH5.2: yarn :Error starting yarn nodemanagers</title>
      <link>https://community.cloudera.com/t5/Support-Questions/CDH5-2-yarn-Error-starting-yarn-nodemanagers/m-p/21894#M38839</link>
      <description>&lt;P&gt;Hi Harsha,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;This is a known issue with NM and restart recovrey turned on. We are not 100% sure how and why it happens yet and are looking for as much data as we can. Before we fix this please make a copy of the whole directory and zip it up :&lt;/P&gt;&lt;P&gt;&amp;nbsp; tar czf&amp;nbsp; yarn-recovery.tgz /tmp/hadoop-yarn&lt;/P&gt;&lt;P&gt;After you have done that remove the directory and start it again.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Can you also tell me how long the NM was up for and if you have a /tmp cleaner running on that host?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thank you,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Wlfred&lt;/P&gt;</description>
      <pubDate>Fri, 21 Nov 2014 00:59:34 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/CDH5-2-yarn-Error-starting-yarn-nodemanagers/m-p/21894#M38839</guid>
      <dc:creator>Wilfred</dc:creator>
      <dc:date>2014-11-21T00:59:34Z</dc:date>
    </item>
    <item>
      <title>Re: CDH5.2: yarn :Error starting yarn nodemanagers</title>
      <link>https://community.cloudera.com/t5/Support-Questions/CDH5-2-yarn-Error-starting-yarn-nodemanagers/m-p/21895#M38840</link>
      <description>&lt;P&gt;Praveen,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;This does not loo like the NM recovery issue.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;For this case can you tell me when this happens? This sounds and looks like the agent trying to change the permissions during the distribution. The two files have special settings and as dlo said in his update it is most likely a non execute mount or directory permission issue.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Wilfred&lt;/P&gt;</description>
      <pubDate>Fri, 21 Nov 2014 01:03:21 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/CDH5-2-yarn-Error-starting-yarn-nodemanagers/m-p/21895#M38840</guid>
      <dc:creator>Wilfred</dc:creator>
      <dc:date>2014-11-21T01:03:21Z</dc:date>
    </item>
    <item>
      <title>Re: CDH5.2: yarn :Error starting yarn nodemanagers</title>
      <link>https://community.cloudera.com/t5/Support-Questions/CDH5-2-yarn-Error-starting-yarn-nodemanagers/m-p/23927#M38841</link>
      <description>&lt;P&gt;&lt;SPAN&gt;Fixed the issue by deleting /tmp/hadoop-yarn/yarn-nm-recovery. LevelDB never writes in place. It always appends to a log file.&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 22 Jan 2015 00:03:38 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/CDH5-2-yarn-Error-starting-yarn-nodemanagers/m-p/23927#M38841</guid>
      <dc:creator>Gurpreet27</dc:creator>
      <dc:date>2015-01-22T00:03:38Z</dc:date>
    </item>
    <item>
      <title>Re: CDH5.2: yarn :Error starting yarn nodemanagers</title>
      <link>https://community.cloudera.com/t5/Support-Questions/CDH5-2-yarn-Error-starting-yarn-nodemanagers/m-p/24821#M38842</link>
      <description>&lt;P&gt;Hi Wilfred,&lt;/P&gt;&lt;P&gt;Sorry for the late reply, never got notifed about movement on this thread..&lt;/P&gt;&lt;P&gt;I was able to resolve it then by having the /tmp/.../yarn-nm-state dir deleted and retstarting yarn..&lt;/P&gt;&lt;P&gt;But, to answer your question:&lt;/P&gt;&lt;P&gt;The NM was up atleast for a week and there may have been a /tmp cleaner for large files only..&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Do you have any more info as to why the issue occurs and timeline when this issue could be fixed?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 18 Feb 2015 19:04:45 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/CDH5-2-yarn-Error-starting-yarn-nodemanagers/m-p/24821#M38842</guid>
      <dc:creator>harsha_v</dc:creator>
      <dc:date>2015-02-18T19:04:45Z</dc:date>
    </item>
    <item>
      <title>Re: CDH5.2: yarn :Error starting yarn nodemanagers</title>
      <link>https://community.cloudera.com/t5/Support-Questions/CDH5-2-yarn-Error-starting-yarn-nodemanagers/m-p/24869#M38843</link>
      <description>&lt;P&gt;We have made a configuration&amp;nbsp; change in Cloudera Manager 5.2.1 which solves this issue. After upgrading the files will be moved to a different area which is not affected by the tmp cleaner.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Wilfred&lt;/P&gt;</description>
      <pubDate>Fri, 20 Feb 2015 06:33:25 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/CDH5-2-yarn-Error-starting-yarn-nodemanagers/m-p/24869#M38843</guid>
      <dc:creator>Wilfred</dc:creator>
      <dc:date>2015-02-20T06:33:25Z</dc:date>
    </item>
    <item>
      <title>Re: CDH5.2: yarn :Error starting yarn nodemanagers</title>
      <link>https://community.cloudera.com/t5/Support-Questions/CDH5-2-yarn-Error-starting-yarn-nodemanagers/m-p/24878#M38844</link>
      <description>&lt;P&gt;Thanks for the update!&lt;/P&gt;</description>
      <pubDate>Fri, 20 Feb 2015 17:55:58 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/CDH5-2-yarn-Error-starting-yarn-nodemanagers/m-p/24878#M38844</guid>
      <dc:creator>harsha_v</dc:creator>
      <dc:date>2015-02-20T17:55:58Z</dc:date>
    </item>
    <item>
      <title>Re: CDH5.2: yarn :Error starting yarn nodemanagers</title>
      <link>https://community.cloudera.com/t5/Support-Questions/CDH5-2-yarn-Error-starting-yarn-nodemanagers/m-p/41279#M38845</link>
      <description>&lt;P&gt;I have tried the solutions mentioned but still getting the ERROR. Its CDH5.7.Please help me to get it resolved.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;PRE&gt;Error starting NodeManager
org.apache.hadoop.service.ServiceStateException: EPERM: Operation not permitted
	at org.apache.hadoop.service.ServiceStateException.convert(ServiceStateException.java:59)
	at org.apache.hadoop.service.AbstractService.init(AbstractService.java:172)
	at org.apache.hadoop.yarn.server.nodemanager.NodeManager.initAndStartNodeManager(NodeManager.java:474)
	at org.apache.hadoop.yarn.server.nodemanager.NodeManager.main(NodeManager.java:521)
Caused by: EPERM: Operation not permitted
	at org.apache.hadoop.io.nativeio.NativeIO$POSIX.chmodImpl(Native Method)
	at org.apache.hadoop.io.nativeio.NativeIO$POSIX.chmod(NativeIO.java:230)
	at org.apache.hadoop.fs.RawLocalFileSystem.setPermission(RawLocalFileSystem.java:660)
	at org.apache.hadoop.fs.RawLocalFileSystem.mkdirs(RawLocalFileSystem.java:452)
	at org.apache.hadoop.fs.FilterFileSystem.mkdirs(FilterFileSystem.java:309)
	at org.apache.hadoop.yarn.server.nodemanager.NodeManager.initAndStartRecoveryStore(NodeManager.java:152)
	at org.apache.hadoop.yarn.server.nodemanager.NodeManager.serviceInit(NodeManager.java:195)
	at org.apache.hadoop.service.AbstractService.init(AbstractService.java:163)
	... 2 more&lt;/PRE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 25 May 2016 12:14:15 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/CDH5-2-yarn-Error-starting-yarn-nodemanagers/m-p/41279#M38845</guid>
      <dc:creator>Sidharth27</dc:creator>
      <dc:date>2016-05-25T12:14:15Z</dc:date>
    </item>
    <item>
      <title>Re: CDH5.2: yarn :Error starting yarn nodemanagers</title>
      <link>https://community.cloudera.com/t5/Support-Questions/CDH5-2-yarn-Error-starting-yarn-nodemanagers/m-p/41287#M38846</link>
      <description>&lt;P&gt;Sidharth,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Please create a new thread for a new issue, re-using an old thread could lead to strange comments when people make assumptions based on irrelevant information.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;For your issue: EPERM means that the OS is not allowing you to create the NM recovery DB and you have recovery turned on. Check the access to the recovery DB directory that you have configured.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Wilfred&lt;/P&gt;</description>
      <pubDate>Wed, 25 May 2016 07:34:00 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/CDH5-2-yarn-Error-starting-yarn-nodemanagers/m-p/41287#M38846</guid>
      <dc:creator>Wilfred</dc:creator>
      <dc:date>2016-05-25T07:34:00Z</dc:date>
    </item>
  </channel>
</rss>

