<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Nodemanager fails to start in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Nodemanager-fails-to-start/m-p/132432#M55912</link>
    <description>&lt;P&gt;Hey Juan, &lt;/P&gt;&lt;P&gt;Thanks for this answer.  This actually did fix the nodemanager situation.  &lt;/P&gt;</description>
    <pubDate>Mon, 06 Mar 2017 23:00:28 GMT</pubDate>
    <dc:creator>jose_bonilla</dc:creator>
    <dc:date>2017-03-06T23:00:28Z</dc:date>
    <item>
      <title>Nodemanager fails to start</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Nodemanager-fails-to-start/m-p/132428#M55908</link>
      <description>&lt;P&gt;Hello Hortonworks Community,&lt;/P&gt;&lt;P&gt;I'm having some issues with two of my nodemanagers on a 4 node cluster. This cluster is running on CentOS 7 with HDP 2.5. I noticed 2/4 nodemanagers being started so my first attempt to resolve the situation was to start the two nodemanagers from the ambari front end. After starting both nodemanagers the same number was being reported: 2/4 started. Then, I tried a second possible solution.  I removed the two nodemanagers that did not start and reinstalled them. This did not work either. I am looking at the log and this is the reason for the failed start: (/var/log/hadoop-yarn/yarn/yarn-yarn-nodemanager-&amp;lt;FQDN&amp;gt;.log)&lt;/P&gt;&lt;PRE&gt;2017-03-01 09:51:05,115 INFO  service.AbstractService (AbstractService.java:noteFailure(272)) - Service NodeManager failed in state INITED; cause: org.apache.hadoop.service.ServiceStateException: org.fusesource.leveldbjni.internal.NativeDB$DBException: Corruption: checksum mismatch
org.apache.hadoop.service.ServiceStateException: org.fusesource.leveldbjni.internal.NativeDB$DBException: Corruption: checksum mismatch
        at org.apache.hadoop.service.ServiceStateException.convert(ServiceStateException.java:59)
        at org.apache.hadoop.service.AbstractService.init(AbstractService.java:172)
        at org.apache.hadoop.yarn.server.nodemanager.NodeManager.initAndStartRecoveryStore(NodeManager.java:178)
        at org.apache.hadoop.yarn.server.nodemanager.NodeManager.serviceInit(NodeManager.java:220)
        at org.apache.hadoop.service.AbstractService.init(AbstractService.java:163)
        at org.apache.hadoop.yarn.server.nodemanager.NodeManager.initAndStartNodeManager(NodeManager.java:546)
        at org.apache.hadoop.yarn.server.nodemanager.NodeManager.main(NodeManager.java:594)
Caused by: org.fusesource.leveldbjni.internal.NativeDB$DBException: Corruption: checksum mismatch
        at org.fusesource.leveldbjni.internal.NativeDB.checkStatus(NativeDB.java:200)
        at org.fusesource.leveldbjni.internal.NativeDB.open(NativeDB.java:218)
        at org.fusesource.leveldbjni.JniDBFactory.open(JniDBFactory.java:168)
        at org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService.openDatabase(NMLeveldbStateStoreService.java:966)
        at org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService.initStorage(NMLeveldbStateStoreService.java:953)
        at org.apache.hadoop.yarn.server.nodemanager.recovery.NMStateStoreService.serviceInit(NMStateStoreService.java:200)
        at org.apache.hadoop.service.AbstractService.init(AbstractService.java:163)
        ... 5 more
2017-03-01 09:51:05,116 FATAL nodemanager.NodeManager (NodeManager.java:initAndStartNodeManager(549)) - Error starting NodeManager
org.apache.hadoop.service.ServiceStateException: org.fusesource.leveldbjni.internal.NativeDB$DBException: Corruption: checksum mismatch
        at org.apache.hadoop.service.ServiceStateException.convert(ServiceStateException.java:59)
        at org.apache.hadoop.service.AbstractService.init(AbstractService.java:172)
        at org.apache.hadoop.yarn.server.nodemanager.NodeManager.initAndStartRecoveryStore(NodeManager.java:178)
        at org.apache.hadoop.yarn.server.nodemanager.NodeManager.serviceInit(NodeManager.java:220)
        at org.apache.hadoop.service.AbstractService.init(AbstractService.java:163)
        at org.apache.hadoop.yarn.server.nodemanager.NodeManager.initAndStartNodeManager(NodeManager.java:546)
        at org.apache.hadoop.yarn.server.nodemanager.NodeManager.main(NodeManager.java:594)
Caused by: org.fusesource.leveldbjni.internal.NativeDB$DBException: Corruption: checksum mismatch
        at org.fusesource.leveldbjni.internal.NativeDB.checkStatus(NativeDB.java:200)
        at org.fusesource.leveldbjni.internal.NativeDB.open(NativeDB.java:218)
        at org.fusesource.leveldbjni.JniDBFactory.open(JniDBFactory.java:168)
        at org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService.openDatabase(NMLeveldbStateStoreService.java:966)
        at org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService.initStorage(NMLeveldbStateStoreService.java:953)
        at org.apache.hadoop.yarn.server.nodemanager.recovery.NMStateStoreService.serviceInit(NMStateStoreService.java:200)
        at org.apache.hadoop.service.AbstractService.init(AbstractService.java:163)
        ... 5 more
2017-03-01 09:51:05,120 INFO  nodemanager.NodeManager (LogAdapter.java:info(45)) - SHUTDOWN_MSG:
/************************************************************
SHUTDOWN_MSG: Shutting down NodeManager at &amp;lt;FQDN&amp;gt;/&amp;lt;IP&amp;gt;
************************************************************/
&lt;/PRE&gt;&lt;P&gt;Does anyone have any ideas on how to resolve this problem?&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;&lt;P&gt;-Jose&lt;/P&gt;</description>
      <pubDate>Wed, 01 Mar 2017 23:01:46 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Nodemanager-fails-to-start/m-p/132428#M55908</guid>
      <dc:creator>jose_bonilla</dc:creator>
      <dc:date>2017-03-01T23:01:46Z</dc:date>
    </item>
    <item>
      <title>Re: Nodemanager fails to start</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Nodemanager-fails-to-start/m-p/132429#M55909</link>
      <description>&lt;P&gt;Hi Jose,&lt;/P&gt;&lt;P&gt;From error log message I found that it's because of checksum mismatch. Please refer below links. Hope it will work.&lt;/P&gt;&lt;P&gt;1. &lt;A href="http://stackoverflow.com/questions/15434709/checksum-exception-when-reading-from-or-copying-to-hdfs-in-apache-hadoop" target="_blank"&gt;http://stackoverflow.com/questions/15434709/checksum-exception-when-reading-from-or-copying-to-hdfs-in-apache-hadoop&lt;/A&gt;&lt;/P&gt;&lt;P&gt;2. &lt;A href="https://issues.apache.org/jira/browse/HDFS-6804" target="_blank"&gt;https://issues.apache.org/jira/browse/HDFS-6804&lt;/A&gt;&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;&lt;P&gt;Mahesh&lt;/P&gt;</description>
      <pubDate>Thu, 02 Mar 2017 23:05:54 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Nodemanager-fails-to-start/m-p/132429#M55909</guid>
      <dc:creator>maheshmsh88</dc:creator>
      <dc:date>2017-03-02T23:05:54Z</dc:date>
    </item>
    <item>
      <title>Re: Nodemanager fails to start</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Nodemanager-fails-to-start/m-p/132430#M55910</link>
      <description>&lt;P&gt;Thank you for the answer; however, both are not for levelDB, which is used in node manager.&lt;/P&gt;&lt;P&gt;Do you have any idea to initialize levelDB. I try to find it, but i can't find any good article.&lt;/P&gt;</description>
      <pubDate>Fri, 03 Mar 2017 04:35:35 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Nodemanager-fails-to-start/m-p/132430#M55910</guid>
      <dc:creator>shigeru_takehar</dc:creator>
      <dc:date>2017-03-03T04:35:35Z</dc:date>
    </item>
    <item>
      <title>Re: Nodemanager fails to start</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Nodemanager-fails-to-start/m-p/132431#M55911</link>
      <description>&lt;P&gt;Hi Jose,&lt;/P&gt;&lt;P&gt;Maybe a sst file got corrupt can you try to remove the folder of /var/log/hadoop-yarn/nodemanager/recovery-state from failed nodemanagers and check if starts?&lt;/P&gt;&lt;P&gt;These files stays in the system even if you decomission the nodes.&lt;/P&gt;&lt;P&gt;.&lt;/P&gt;</description>
      <pubDate>Sat, 04 Mar 2017 17:51:46 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Nodemanager-fails-to-start/m-p/132431#M55911</guid>
      <dc:creator>juan_manuel_nie</dc:creator>
      <dc:date>2017-03-04T17:51:46Z</dc:date>
    </item>
    <item>
      <title>Re: Nodemanager fails to start</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Nodemanager-fails-to-start/m-p/132432#M55912</link>
      <description>&lt;P&gt;Hey Juan, &lt;/P&gt;&lt;P&gt;Thanks for this answer.  This actually did fix the nodemanager situation.  &lt;/P&gt;</description>
      <pubDate>Mon, 06 Mar 2017 23:00:28 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Nodemanager-fails-to-start/m-p/132432#M55912</guid>
      <dc:creator>jose_bonilla</dc:creator>
      <dc:date>2017-03-06T23:00:28Z</dc:date>
    </item>
  </channel>
</rss>

