<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Failed to start role     -YARN- NodeManager (node) in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Failed-to-start-role-YARN-NodeManager-node/m-p/89500#M37119</link>
    <description>&lt;P&gt;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/4054"&gt;@bgooley&lt;/a&gt;&amp;nbsp; ,&lt;BR /&gt;&lt;BR /&gt;thank you for your feedback and your clear&amp;nbsp;explanation ,&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;in fact the problem was resolved by removing&amp;nbsp;&lt;SPAN&gt;the contents of /var/lib/hadoop-yarn/yarn-nm-recovery/ directory and then the Nodemanager&amp;nbsp;role started successfully.&lt;BR /&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;BR /&gt;the solution that I've found was from&amp;nbsp; &amp;nbsp;:&lt;BR /&gt;&lt;A href="https://community.cloudera.com/t5/Batch-Processing-and-Workflow/Yarn-NodeManager-fails-to-start-and-crashing-with-SIGBUS/m-p/66590#M3611" target="_blank"&gt;https://community.cloudera.com/t5/Batch-Processing-and-Workflow/Yarn-NodeManager-fails-to-start-and-crashing-with-SIGBUS/m-p/66590#M3611&lt;/A&gt;&lt;/P&gt;</description>
    <pubDate>Wed, 24 Apr 2019 09:27:42 GMT</pubDate>
    <dc:creator>Bildervic</dc:creator>
    <dc:date>2019-04-24T09:27:42Z</dc:date>
    <item>
      <title>Failed to start role     -YARN- NodeManager (node)</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Failed-to-start-role-YARN-NodeManager-node/m-p/89470#M37117</link>
      <description>&lt;P&gt;hello folks ,&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;the nodeManager has suddently stopped in a instance (while stille running for other nodes/intances ).&lt;BR /&gt;so when I try to start/restart it -via cloudera manager - , an error is shown in the first step :&amp;nbsp;&lt;/P&gt;&lt;PRE&gt;Failed to start role.&lt;/PRE&gt;&lt;P&gt;and I'm using&amp;nbsp;CentOS release 6.10 (Final)&lt;BR /&gt;please what do you suggest me to look or check in order to resolve this problem ?&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;here's my stdout log :&amp;nbsp;&lt;/P&gt;&lt;PRE&gt;Tue Apr 23 10:18:56 PDT 2019
JAVA_HOME=/usr/java/jdk.1.8.0_144
using /usr/java/jdk.1.8.0_144 as JAVA_HOME
using 5 as CDH_VERSION
using /opt/cloudera/parcels/CDH-5.14.2-1.cdh5.14.2.p0.3/lib/hadoop-yarn as CDH_YARN_HOME
using /opt/cloudera/parcels/CDH-5.14.2-1.cdh5.14.2.p0.3/lib/hadoop-mapreduce as CDH_MR2_HOME
using /var/run/cloudera-scm-agent/process/23960-yarn-NODEMANAGER as CONF_DIR
CONF_DIR=/var/run/cloudera-scm-agent/process/23960-yarn-NODEMANAGER
CMF_CONF_DIR=/etc/cloudera-scm-agent
#
# A fatal error has been detected by the Java Runtime Environment:
#
#  SIGBUS (0x7) at pc=0x00007f8c1fde51a1, pid=3004, tid=0x00007f8c4f44c700
#
# JRE version: Java(TM) SE Runtime Environment (8.0_144-b01) (build 1.8.0_144-b01)
# Java VM: Java HotSpot(TM) 64-Bit Server VM (25.144-b01 mixed mode linux-amd64 compressed oops)
# Problematic frame:
# C  [libleveldbjni-64-1-8170950501904951615.8+0x491a1]  leveldb::ReadBlock(leveldb::RandomAccessFile*, leveldb::ReadOptions const&amp;amp;, leveldb::BlockHandle const&amp;amp;, leveldb::BlockContents*)+0x191
#
# Failed to write core dump. Core dumps have been disabled. To enable core dumping, try "ulimit -c unlimited" before starting Java again
#&lt;/PRE&gt;&lt;P&gt;&amp;nbsp;and this is my log.out error :&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;NodeManager&lt;/P&gt;&lt;P&gt;Node Manager health check script is not available or doesn't have execute permission, so not starting the node health script runner.&lt;/P&gt;&lt;P&gt;AsyncDispatcher&lt;/P&gt;&lt;P&gt;Registering class org.apache.hadoop.yarn.server.nodemanager.containermanager.container.ContainerEventType for class org.apache.hadoop.yarn.server.nodemanager.containermanager.ContainerManagerImpl$ContainerEventDispatcher&lt;/P&gt;&lt;P&gt;AsyncDispatcher&lt;/P&gt;&lt;P&gt;Registering class org.apache.hadoop.yarn.server.nodemanager.containermanager.application.ApplicationEventType for class org.apache.hadoop.yarn.server.nodemanager.containermanager.ContainerManagerImpl$ApplicationEventDispatcher&lt;BR /&gt;AsyncDispatcher&lt;/P&gt;&lt;P&gt;Registering class org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.event.LocalizationEventType for class org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService&lt;BR /&gt;AsyncDispatcher&lt;/P&gt;&lt;P&gt;Registering class org.apache.hadoop.yarn.server.nodemanager.containermanager.AuxServicesEventType for class org.apache.hadoop.yarn.server.nodemanager.containermanager.AuxServices&lt;/P&gt;&lt;P&gt;AsyncDispatcher&lt;/P&gt;&lt;P&gt;Registering class org.apache.hadoop.yarn.server.nodemanager.containermanager.monitor.ContainersMonitorEventType for class org.apache.hadoop.yarn.server.nodemanager.containermanager.monitor.ContainersMonitorImpl&lt;BR /&gt;AsyncDispatcher&lt;/P&gt;&lt;P&gt;Registering class org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainersLauncherEventType for class org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainersLauncher&lt;/P&gt;</description>
      <pubDate>Fri, 16 Sep 2022 14:19:52 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Failed-to-start-role-YARN-NodeManager-node/m-p/89470#M37117</guid>
      <dc:creator>Bildervic</dc:creator>
      <dc:date>2022-09-16T14:19:52Z</dc:date>
    </item>
    <item>
      <title>Re: Failed to start role     -YARN- NodeManager (node)</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Failed-to-start-role-YARN-NodeManager-node/m-p/89471#M37118</link>
      <description>&lt;P&gt;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/31926"&gt;@Bildervic&lt;/a&gt;,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;"SIGBUS (0x7)" can mean a few things, but one of the most common ones is that a directory that Java needs to use is full (no more free disk space).&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The fact that your Node Manager was running and then failed and then failed to start supports that type of possible cause.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Since the crash is in libleveldbjni, that gives us more evidence that a directory may be full since that indicates Java was accessing local files (on disk).&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I would suggest checking disk space on all volumes on that host.&amp;nbsp; If there a volume that is full, then try freeing up some space and start the Nodemanager again.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 23 Apr 2019 18:44:08 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Failed-to-start-role-YARN-NodeManager-node/m-p/89471#M37118</guid>
      <dc:creator>bgooley</dc:creator>
      <dc:date>2019-04-23T18:44:08Z</dc:date>
    </item>
    <item>
      <title>Re: Failed to start role     -YARN- NodeManager (node)</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Failed-to-start-role-YARN-NodeManager-node/m-p/89500#M37119</link>
      <description>&lt;P&gt;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/4054"&gt;@bgooley&lt;/a&gt;&amp;nbsp; ,&lt;BR /&gt;&lt;BR /&gt;thank you for your feedback and your clear&amp;nbsp;explanation ,&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;in fact the problem was resolved by removing&amp;nbsp;&lt;SPAN&gt;the contents of /var/lib/hadoop-yarn/yarn-nm-recovery/ directory and then the Nodemanager&amp;nbsp;role started successfully.&lt;BR /&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;BR /&gt;the solution that I've found was from&amp;nbsp; &amp;nbsp;:&lt;BR /&gt;&lt;A href="https://community.cloudera.com/t5/Batch-Processing-and-Workflow/Yarn-NodeManager-fails-to-start-and-crashing-with-SIGBUS/m-p/66590#M3611" target="_blank"&gt;https://community.cloudera.com/t5/Batch-Processing-and-Workflow/Yarn-NodeManager-fails-to-start-and-crashing-with-SIGBUS/m-p/66590#M3611&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 24 Apr 2019 09:27:42 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Failed-to-start-role-YARN-NodeManager-node/m-p/89500#M37119</guid>
      <dc:creator>Bildervic</dc:creator>
      <dc:date>2019-04-24T09:27:42Z</dc:date>
    </item>
  </channel>
</rss>

