<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Fails to start ambari-metrics-collector in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Fails-to-start-ambari-metrics-collector/m-p/235251#M197071</link>
    <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/115264/bze16036.html"&gt;@YOSUKE SHIBUYA&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;In your "&lt;A href="https://community.cloudera.com/legacyfs/online/attachments/109826-hbase-ams-master-kvm07log.txt"&gt;hbase-ams-master-kvm07log.txt&lt;/A&gt;" log we see the following message.&lt;/P&gt;&lt;PRE&gt;2019-07-11 19:11:58,731 INFO &amp;nbsp;[Thread-23] wal.ProcedureWALFile: Opening file:/var/lib/ambari-metrics-collector/hbase/MasterProcWALs/pv2-00000000000000000001.log length=45336
2019-07-11 19:11:58,743 WARN &amp;nbsp;[Thread-23] wal.WALProcedureStore: Unable to read tracker for file:/var/lib/ambari-metrics-collector/hbase/MasterProcWALs/pv2-00000000000000000001.log
org.apache.hadoop.hbase.procedure2.store.wal.ProcedureWALFormat$InvalidWALDataException: Invalid Trailer version. got 48 expected 1
&amp;nbsp; &amp;nbsp; at org.apache.hadoop.hbase.procedure2.store.wal.ProcedureWALFormat.readTrailer(ProcedureWALFormat.java:189)&lt;/PRE&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Looks like the WAL Data "/var/lib/ambari-metrics-collector/hbase/MasterProcWALs/" got corrupted.&lt;/P&gt;&lt;PRE&gt;# ls -lart /var/lib/ambari-metrics-collector/hbase/MasterProcWALs/*&lt;/PRE&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;May be you can take a backup of the dir "/var/lib/ambari-metrics-collector/hbase/"&lt;/P&gt;&lt;P&gt;and then try to clean the file present inside the "/var/lib/ambari-metrics-collector/hbase/MasterProcWALs/*"&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Then try to perform a tmp dir cleanup. After taking a backup of "/var/lib/ambari-metrics-collector/hbase-tmp/" Then&lt;/P&gt;&lt;P&gt;remove the AMS zookeeper data by backing up and removing the contents of 'hbase.tmp.dir'/zookeeper AND any Phoenix spool files from 'hbase.tmp.dir'/phoenix-spool folder&lt;/P&gt;&lt;P&gt;"hbase.tmp.dir": (default value: /var/lib/ambari-metrics-collector/hbase-tmp) This is on local filesystem for both modes:&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;PRE&gt;# rm -fr /var/lib/ambari-metrics-collector/hbase-tmp/zookeeper/*
# rm -fr /var/lib/ambari-metrics-collector/hbase-tmp/phoenix-spool/*&lt;/PRE&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Then try to restart the AMS.&lt;/P&gt;&lt;P&gt;Better if you also increase the Metrics Collector Heap Size 1024MB and HBase Master Maximum Memory 2048MB. (or 4096MB) if you repeatedly see similar issue.&lt;/P&gt;</description>
    <pubDate>Thu, 11 Jul 2019 20:06:43 GMT</pubDate>
    <dc:creator>jsensharma</dc:creator>
    <dc:date>2019-07-11T20:06:43Z</dc:date>
    <item>
      <title>Fails to start ambari-metrics-collector</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Fails-to-start-ambari-metrics-collector/m-p/235248#M197068</link>
      <description>&lt;P&gt;We have failed to start ambari-metrics-collector.&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;The following error appeared in hbase-ams-master.log.&lt;/P&gt;&lt;P&gt;I can not find another ERROR, what should I check?&lt;/P&gt;&lt;P&gt;----------------&lt;/P&gt;&lt;P&gt;/var/log/ambari-metrics-collector/hbase-ams-master-host.log&lt;/P&gt;&lt;P&gt;2019-07-11 15: 34: 41,040 ERROR [main] master.HMasterCommandLine: Master exiting&lt;/P&gt;&lt;P&gt;java.lang.RuntimeException: Master not initialized after 200000ms&lt;/P&gt;&lt;P&gt;        at org.apache.hadoop.hbase.util.JVMClusterUtil.waitForEvent (JVMClusterUtil.java: 229)&lt;/P&gt;&lt;P&gt;        at org.apache.hadoop.hbase.util.JVMClusterUtil.startup (JVMClusterUtil.java: 197)&lt;/P&gt;&lt;P&gt;        at org.apache.hadoop.hbase.LocalHBaseCluster.startup (LocalHBaseCluster.java:413)&lt;/P&gt;&lt;P&gt;        at org.apache.hadoop.hbase.master.HMasterCommandLine.startMaster (HMasterCommandLine.java: 232)&lt;/P&gt;&lt;P&gt;        at org.apache.hadoop.hbase.master.HMasterCommandLine.run (HMasterCommandLine.java: 140)&lt;/P&gt;&lt;P&gt;        at org.apache.hadoop.util.ToolRunner.run (ToolRunner.java: 76)&lt;/P&gt;&lt;P&gt;        at org.apache.hadoop.hbase.util.ServerCommandLine.doMain (ServerCommandLine.java: 149)&lt;/P&gt;&lt;P&gt;        at org.apache.hadoop.hbase.master.HMaster.main (HMaster.java:3100)&lt;/P&gt;&lt;P&gt;2019-07-11 15: 34: 41,043 INFO [shutdown-hook-0] regionserver.ShutdownHook: Shutdown hook starting; hbase.shutdown.hook = true; fsShutdownHook = org.apache.hadoop.fs.FileSystem $ Cache $ ClientFinalizer @ 4a29f290&lt;/P&gt;&lt;P&gt;2019-07-11 15: 34: 41,044 INFO [shutdown-hook-0] regionserver.HRegionServer: ***** STOPPING region server 'areaportal-kvm07, 61320, 1562826676313' *****&lt;/P&gt;&lt;P&gt;2019-07-11 15: 34: 41,044 INFO [shutdown-hook-0] regionserver.HRegionServer: STOPPED: Shutdown hook&lt;/P&gt;</description>
      <pubDate>Thu, 11 Jul 2019 16:24:57 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Fails-to-start-ambari-metrics-collector/m-p/235248#M197068</guid>
      <dc:creator>bze16036</dc:creator>
      <dc:date>2019-07-11T16:24:57Z</dc:date>
    </item>
    <item>
      <title>Re: Fails to start ambari-metrics-collector</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Fails-to-start-ambari-metrics-collector/m-p/235249#M197069</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/115264/bze16036.html" nodeid="115264"&gt;@YOSUKE SHIBUYA&lt;/A&gt;&lt;/P&gt;&lt;P&gt;The error snippet which you posted is just the after effect of the actual cause and a very generic message.&lt;/P&gt;&lt;P&gt;Can you please share the following logs for initial review?&lt;/P&gt;&lt;PRE&gt;/var/log/ambari-metrics-collector/ambari-metrics-collector.log
/var/log/ambari-metrics-collector/hbase-ams-master-xxxxxxxx.log
/var/log/ambari-metrics-collector/gc.log
/var/log/ambari-metrics-collector/collector-gc.log&lt;/PRE&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Also most probably the AMS failure can happen due to incorrect tuning or heavy load. So can you please let us know the following:&lt;/P&gt;&lt;P&gt;1. How many nodes are there in your cluster?&lt;/P&gt;&lt;P&gt;2. How much memory have you allocated to the AMS collector and HMaster.&lt;/P&gt;&lt;P&gt;3. I guess you might be using default Embedded Mode AMS (not distributed) Both require slightly different kind of tuning.&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 11 Jul 2019 16:31:14 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Fails-to-start-ambari-metrics-collector/m-p/235249#M197069</guid>
      <dc:creator>jsensharma</dc:creator>
      <dc:date>2019-07-11T16:31:14Z</dc:date>
    </item>
    <item>
      <title>Re: Fails to start ambari-metrics-collector</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Fails-to-start-ambari-metrics-collector/m-p/235250#M197070</link>
      <description>&lt;P&gt;@&lt;A rel="user" href="https://community.hortonworks.com/users/3418/jsensharma.html"&gt;Jay Kumar SenSharma&lt;/A&gt;&lt;/P&gt;&lt;P&gt;I have attached a log file.&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;The cluster has four nodes. Each node has 32GB of memory.&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;The memory is specified as follows.&lt;/P&gt;&lt;P&gt;   Metrics Collector Heap Size 512MB&lt;/P&gt;&lt;P&gt;   HBase Master Maximum Memory 1408 MB&lt;/P&gt;&lt;P&gt;   hbase_master_maxperm_size 128MB&lt;/P&gt;&lt;P&gt;   HBase Master maximum value for Xmn 1024MB&lt;/P&gt;&lt;P&gt;   HBase RegionServer Maximum Memory 768 MB&lt;/P&gt;</description>
      <pubDate>Thu, 11 Jul 2019 18:38:21 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Fails-to-start-ambari-metrics-collector/m-p/235250#M197070</guid>
      <dc:creator>bze16036</dc:creator>
      <dc:date>2019-07-11T18:38:21Z</dc:date>
    </item>
    <item>
      <title>Re: Fails to start ambari-metrics-collector</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Fails-to-start-ambari-metrics-collector/m-p/235251#M197071</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/115264/bze16036.html"&gt;@YOSUKE SHIBUYA&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;In your "&lt;A href="https://community.cloudera.com/legacyfs/online/attachments/109826-hbase-ams-master-kvm07log.txt"&gt;hbase-ams-master-kvm07log.txt&lt;/A&gt;" log we see the following message.&lt;/P&gt;&lt;PRE&gt;2019-07-11 19:11:58,731 INFO &amp;nbsp;[Thread-23] wal.ProcedureWALFile: Opening file:/var/lib/ambari-metrics-collector/hbase/MasterProcWALs/pv2-00000000000000000001.log length=45336
2019-07-11 19:11:58,743 WARN &amp;nbsp;[Thread-23] wal.WALProcedureStore: Unable to read tracker for file:/var/lib/ambari-metrics-collector/hbase/MasterProcWALs/pv2-00000000000000000001.log
org.apache.hadoop.hbase.procedure2.store.wal.ProcedureWALFormat$InvalidWALDataException: Invalid Trailer version. got 48 expected 1
&amp;nbsp; &amp;nbsp; at org.apache.hadoop.hbase.procedure2.store.wal.ProcedureWALFormat.readTrailer(ProcedureWALFormat.java:189)&lt;/PRE&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Looks like the WAL Data "/var/lib/ambari-metrics-collector/hbase/MasterProcWALs/" got corrupted.&lt;/P&gt;&lt;PRE&gt;# ls -lart /var/lib/ambari-metrics-collector/hbase/MasterProcWALs/*&lt;/PRE&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;May be you can take a backup of the dir "/var/lib/ambari-metrics-collector/hbase/"&lt;/P&gt;&lt;P&gt;and then try to clean the file present inside the "/var/lib/ambari-metrics-collector/hbase/MasterProcWALs/*"&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Then try to perform a tmp dir cleanup. After taking a backup of "/var/lib/ambari-metrics-collector/hbase-tmp/" Then&lt;/P&gt;&lt;P&gt;remove the AMS zookeeper data by backing up and removing the contents of 'hbase.tmp.dir'/zookeeper AND any Phoenix spool files from 'hbase.tmp.dir'/phoenix-spool folder&lt;/P&gt;&lt;P&gt;"hbase.tmp.dir": (default value: /var/lib/ambari-metrics-collector/hbase-tmp) This is on local filesystem for both modes:&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;PRE&gt;# rm -fr /var/lib/ambari-metrics-collector/hbase-tmp/zookeeper/*
# rm -fr /var/lib/ambari-metrics-collector/hbase-tmp/phoenix-spool/*&lt;/PRE&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Then try to restart the AMS.&lt;/P&gt;&lt;P&gt;Better if you also increase the Metrics Collector Heap Size 1024MB and HBase Master Maximum Memory 2048MB. (or 4096MB) if you repeatedly see similar issue.&lt;/P&gt;</description>
      <pubDate>Thu, 11 Jul 2019 20:06:43 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Fails-to-start-ambari-metrics-collector/m-p/235251#M197071</guid>
      <dc:creator>jsensharma</dc:creator>
      <dc:date>2019-07-11T20:06:43Z</dc:date>
    </item>
    <item>
      <title>Re: Fails to start ambari-metrics-collector</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Fails-to-start-ambari-metrics-collector/m-p/235252#M197072</link>
      <description>&lt;P&gt;@&lt;A rel="user" href="https://community.hortonworks.com/users/3418/jsensharma.html"&gt;Jay Kumar SenSharma&lt;/A&gt;&lt;/P&gt;&lt;P&gt;I am able to start ambari-metrics-collector.&lt;/P&gt;&lt;P&gt;Thank you for your support.&lt;/P&gt;</description>
      <pubDate>Thu, 11 Jul 2019 20:28:05 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Fails-to-start-ambari-metrics-collector/m-p/235252#M197072</guid>
      <dc:creator>bze16036</dc:creator>
      <dc:date>2019-07-11T20:28:05Z</dc:date>
    </item>
    <item>
      <title>Re: Fails to start ambari-metrics-collector</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Fails-to-start-ambari-metrics-collector/m-p/235253#M197073</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/115264/bze16036.html" nodeid="115264"&gt;@YOSUKE SHIBUYA&lt;/A&gt;&lt;/P&gt;&lt;P&gt;Good to know that your issue is resolved. It will be great if you can mark this thread as Answered by clicking on the "Accept" button on the helpful answer.&lt;/P&gt;</description>
      <pubDate>Thu, 11 Jul 2019 20:46:32 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Fails-to-start-ambari-metrics-collector/m-p/235253#M197073</guid>
      <dc:creator>jsensharma</dc:creator>
      <dc:date>2019-07-11T20:46:32Z</dc:date>
    </item>
    <item>
      <title>Re: Fails to start ambari-metrics-collector</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Fails-to-start-ambari-metrics-collector/m-p/235254#M197074</link>
      <description>&lt;P&gt;The above question was originally posted in the &lt;A href="https://community.hortonworks.com/spaces/101/index.html"&gt;Community Help&lt;/A&gt; track. On Sun Jul 14 17:04 UTC 2019, a member of the HCC moderation staff moved it to the &lt;A href="https://community.hortonworks.com/spaces/61/operations-track_2.html"&gt;Cloud &amp;amp; Operations&lt;/A&gt; track. The &lt;EM&gt;Community Help Track&lt;/EM&gt; is intended for questions about using the HCC site itself.&lt;/P&gt;</description>
      <pubDate>Mon, 15 Jul 2019 00:06:19 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Fails-to-start-ambari-metrics-collector/m-p/235254#M197074</guid>
      <dc:creator>ask_bill_brooks</dc:creator>
      <dc:date>2019-07-15T00:06:19Z</dc:date>
    </item>
  </channel>
</rss>

