<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Yarn NodeManager fails to start and crashing with SIGBUS in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Yarn-NodeManager-fails-to-start-and-crashing-with-SIGBUS/m-p/66590#M77469</link>
    <description>&lt;P&gt;Hi,&lt;BR /&gt;&lt;SPAN&gt;Hi,&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;In &lt;STRONG&gt;CDH 5.12.0&lt;/STRONG&gt; and &lt;STRONG&gt;5.14.2&lt;/STRONG&gt; releases (centos 6.9) the Yarn &lt;STRONG&gt;NodeManager fails to start&lt;/STRONG&gt; and crashing with &lt;STRONG&gt;SIGBUS&lt;/STRONG&gt;.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;Here is the error msg in :&lt;/SPAN&gt;&lt;/P&gt;&lt;PRE&gt;#
# A fatal error has been detected by the Java Runtime Environment:
#
#  SIGBUS (0x7) at pc=0x00007f4d5b1aff4f, pid=20067, tid=0x00007f4d869dd700
#
# JRE version: Java(TM) SE Runtime Environment (8.0_144-b01) (build 1.8.0_144-b01)
# Java VM: Java HotSpot(TM) 64-Bit Server VM (25.144-b01 mixed mode linux-amd64 compressed oops)
# Problematic frame:
# C  [libleveldbjni-64-1-5336493915245210176.8+0x4af4f]  snappy::RawUncompress(snappy::Source*, char*)+0x31f
#
# Failed to write core dump. Core dumps have been disabled. To enable core dumping, try "ulimit -c unlimited" before starting Java again
#
# An error report file with more information is saved as:
# /var/run/cloudera-scm-agent/process/14104-yarn-NODEMANAGER/hs_err_pid20067.log
#
# If you would like to submit a bug report, please visit:
#   http://bugreport.java.com/bugreport/crash.jsp
# The crash happened outside the Java Virtual Machine in native code.
# See problematic frame for where to report the bug.
#&lt;/PRE&gt;&lt;P&gt;&lt;SPAN&gt;Here is the&amp;nbsp;&lt;/SPAN&gt;&lt;U&gt;&lt;FONT color="#333333"&gt;hs_err_pid20067.log&lt;/FONT&gt;&lt;/U&gt;&lt;SPAN&gt; file:&amp;nbsp;&lt;A href="https://ufile.io/dl8lu" target="_self"&gt;https://ufile.io/dl8lu&lt;/A&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;BR /&gt;&lt;EM&gt;JIRA link&lt;/EM&gt;:&amp;nbsp;&lt;A href="https://issues.apache.org/jira/browse/YARN-8190" target="_self"&gt;https://issues.apache.org/jira/browse/YARN-8190&lt;/A&gt;&lt;/P&gt;</description>
    <pubDate>Fri, 16 Sep 2022 13:07:38 GMT</pubDate>
    <dc:creator>AcharkiMed</dc:creator>
    <dc:date>2022-09-16T13:07:38Z</dc:date>
    <item>
      <title>Yarn NodeManager fails to start and crashing with SIGBUS</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Yarn-NodeManager-fails-to-start-and-crashing-with-SIGBUS/m-p/66590#M77469</link>
      <description>&lt;P&gt;Hi,&lt;BR /&gt;&lt;SPAN&gt;Hi,&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;In &lt;STRONG&gt;CDH 5.12.0&lt;/STRONG&gt; and &lt;STRONG&gt;5.14.2&lt;/STRONG&gt; releases (centos 6.9) the Yarn &lt;STRONG&gt;NodeManager fails to start&lt;/STRONG&gt; and crashing with &lt;STRONG&gt;SIGBUS&lt;/STRONG&gt;.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;Here is the error msg in :&lt;/SPAN&gt;&lt;/P&gt;&lt;PRE&gt;#
# A fatal error has been detected by the Java Runtime Environment:
#
#  SIGBUS (0x7) at pc=0x00007f4d5b1aff4f, pid=20067, tid=0x00007f4d869dd700
#
# JRE version: Java(TM) SE Runtime Environment (8.0_144-b01) (build 1.8.0_144-b01)
# Java VM: Java HotSpot(TM) 64-Bit Server VM (25.144-b01 mixed mode linux-amd64 compressed oops)
# Problematic frame:
# C  [libleveldbjni-64-1-5336493915245210176.8+0x4af4f]  snappy::RawUncompress(snappy::Source*, char*)+0x31f
#
# Failed to write core dump. Core dumps have been disabled. To enable core dumping, try "ulimit -c unlimited" before starting Java again
#
# An error report file with more information is saved as:
# /var/run/cloudera-scm-agent/process/14104-yarn-NODEMANAGER/hs_err_pid20067.log
#
# If you would like to submit a bug report, please visit:
#   http://bugreport.java.com/bugreport/crash.jsp
# The crash happened outside the Java Virtual Machine in native code.
# See problematic frame for where to report the bug.
#&lt;/PRE&gt;&lt;P&gt;&lt;SPAN&gt;Here is the&amp;nbsp;&lt;/SPAN&gt;&lt;U&gt;&lt;FONT color="#333333"&gt;hs_err_pid20067.log&lt;/FONT&gt;&lt;/U&gt;&lt;SPAN&gt; file:&amp;nbsp;&lt;A href="https://ufile.io/dl8lu" target="_self"&gt;https://ufile.io/dl8lu&lt;/A&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;BR /&gt;&lt;EM&gt;JIRA link&lt;/EM&gt;:&amp;nbsp;&lt;A href="https://issues.apache.org/jira/browse/YARN-8190" target="_self"&gt;https://issues.apache.org/jira/browse/YARN-8190&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 16 Sep 2022 13:07:38 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Yarn-NodeManager-fails-to-start-and-crashing-with-SIGBUS/m-p/66590#M77469</guid>
      <dc:creator>AcharkiMed</dc:creator>
      <dc:date>2022-09-16T13:07:38Z</dc:date>
    </item>
    <item>
      <title>Re: Yarn NodeManager fails to start and crashing with SIGBUS</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Yarn-NodeManager-fails-to-start-and-crashing-with-SIGBUS/m-p/67382#M77470</link>
      <description>The pattern of your issue isn't clear - could you help answer a few more questions?&lt;BR /&gt;&lt;BR /&gt;- Is this consistently occurring on all your NodeManagers?&lt;BR /&gt;- Did this start occurring after you upgraded? If yes, what was the earlier version and the upgraded version?&lt;BR /&gt;- Did this instead start occurring after an abrupt restart of the daemon or the host?&lt;BR /&gt;- Do you have NodeManager logs covering the earliest time period this issue was observed? Could you share those here?&lt;BR /&gt;&lt;BR /&gt;Overall this appears to be related to NodeManager's container recovery feature (a corruption of the data stored for this feature in the local filesystem of the NodeManager) and you should be able to bypass the issue if you (re)moved the contents of /var/lib/hadoop-yarn/yarn-nm-recovery/ directory on the affected NodeManagers. This effectively resets the states maintained, which should be OK to perform on a NodeManager that is down.&lt;BR /&gt;&lt;BR /&gt;Full trace for posterity:&lt;BR /&gt;&lt;BR /&gt;Java frames: (J=compiled Java code, j=interpreted, Vv=VM code)&lt;BR /&gt;j org.fusesource.leveldbjni.internal.NativeDB$DBJNI.Get(JLorg/fusesource/leveldbjni/internal/NativeReadOptions;Lorg/fusesource/leveldbjni/internal/NativeSlice;J)J+0&lt;BR /&gt;j org.fusesource.leveldbjni.internal.NativeDB.get(Lorg/fusesource/leveldbjni/internal/NativeReadOptions;Lorg/fusesource/leveldbjni/internal/NativeSlice;)[B+22&lt;BR /&gt;j org.fusesource.leveldbjni.internal.NativeDB.get(Lorg/fusesource/leveldbjni/internal/NativeReadOptions;Lorg/fusesource/leveldbjni/internal/NativeBuffer;)[B+10&lt;BR /&gt;j org.fusesource.leveldbjni.internal.NativeDB.get(Lorg/fusesource/leveldbjni/internal/NativeReadOptions;[B)[B+20&lt;BR /&gt;j org.fusesource.leveldbjni.internal.JniDB.get([BLorg/iq80/leveldb/ReadOptions;)[B+27&lt;BR /&gt;j org.fusesource.leveldbjni.internal.JniDB.get([B)[B+26&lt;BR /&gt;j org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService.loadVersion()Lorg/apache/hadoop/yarn/server/records/Version;+9&lt;BR /&gt;j org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService.checkVersion()V+1&lt;BR /&gt;j org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService.initStorage(Lorg/apache/hadoop/conf/Configuration;)V+10&lt;BR /&gt;j org.apache.hadoop.yarn.server.nodemanager.recovery.NMStateStoreService.serviceInit(Lorg/apache/hadoop/conf/Configuration;)V+2&lt;BR /&gt;j org.apache.hadoop.service.AbstractService.init(Lorg/apache/hadoop/conf/Configuration;)V+80&lt;BR /&gt;j org.apache.hadoop.yarn.server.nodemanager.NodeManager.initAndStartRecoveryStore(Lorg/apache/hadoop/conf/Configuration;)V+98&lt;BR /&gt;j org.apache.hadoop.yarn.server.nodemanager.NodeManager.serviceInit(Lorg/apache/hadoop/conf/Configuration;)V+20&lt;BR /&gt;j org.apache.hadoop.service.AbstractService.init(Lorg/apache/hadoop/conf/Configuration;)V+80&lt;BR /&gt;j org.apache.hadoop.yarn.server.nodemanager.NodeManager.initAndStartNodeManager(Lorg/apache/hadoop/conf/Configuration;Z)V+50&lt;BR /&gt;j org.apache.hadoop.yarn.server.nodemanager.NodeManager.main([Ljava/lang/String;)V+39</description>
      <pubDate>Thu, 17 May 2018 08:12:07 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Yarn-NodeManager-fails-to-start-and-crashing-with-SIGBUS/m-p/67382#M77470</guid>
      <dc:creator>Harsh J</dc:creator>
      <dc:date>2018-05-17T08:12:07Z</dc:date>
    </item>
    <item>
      <title>Re: Yarn NodeManager fails to start and crashing with SIGBUS</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Yarn-NodeManager-fails-to-start-and-crashing-with-SIGBUS/m-p/67425#M77471</link>
      <description>&lt;P&gt;Hi &lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/213"&gt;@Harsh J&lt;/a&gt;&lt;BR /&gt;&lt;BR /&gt;It's only in one NodeManager, its happen suddenly without any upgrade in CDH 5.12.0 and even if I upgrade to 5.14.2 the issue persist..&lt;BR /&gt;Anyway your solution has resolve the issue.&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Thank you.&lt;/P&gt;</description>
      <pubDate>Fri, 18 May 2018 14:18:22 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Yarn-NodeManager-fails-to-start-and-crashing-with-SIGBUS/m-p/67425#M77471</guid>
      <dc:creator>AcharkiMed</dc:creator>
      <dc:date>2018-05-18T14:18:22Z</dc:date>
    </item>
  </channel>
</rss>

