<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: HBASE &amp;quot;archive&amp;quot;.   How to clean?   My disk space is vanishing.... in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/HBASE-quot-archive-quot-How-to-clean-My-disk-space-is/m-p/212910#M174840</link>
    <description>&lt;P&gt;Check whether you have hbase.master.hfilecleaner.ttl configuration property in hbase-site.xml. It defines TTL for archived files.&lt;/P&gt;&lt;P&gt;Archive directory can keep:&lt;/P&gt;&lt;P&gt;1. old WAL files&lt;/P&gt;&lt;P&gt;2. Old region files after compaction&lt;/P&gt;&lt;P&gt;3. files for snapshots. &lt;/P&gt;&lt;P&gt;I believe that you have some old snapshots and that's why you have so big archive directory. Delete snapshots that are not required  and those files will be deleted automatically.  &lt;/P&gt;</description>
    <pubDate>Sat, 05 Aug 2017 01:58:16 GMT</pubDate>
    <dc:creator>ssoldatov</dc:creator>
    <dc:date>2017-08-05T01:58:16Z</dc:date>
    <item>
      <title>HBASE "archive".   How to clean?   My disk space is vanishing....</title>
      <link>https://community.cloudera.com/t5/Support-Questions/HBASE-quot-archive-quot-How-to-clean-My-disk-space-is/m-p/212909#M174839</link>
      <description>&lt;P&gt;hi!  So, I'm the sysadmin of a hadoop cluster.  I am not a developer, nor do I "use" it.   But...   I make sure it's running and happy and secure and...  so on.&lt;BR /&gt;&lt;BR /&gt;In reviewing HDFS disk use lately, I noticed our numbers are kinda high.&lt;BR /&gt;&lt;BR /&gt;After some digging, it appears all of the space is going into hbase.  OK cool, that's what our developers are doing.   Stuffing things in hbase.  &lt;BR /&gt;&lt;BR /&gt;But I appear to be losing a bunch of disk space to the hbase "archives" folder.    Which is something I assume that hbase is putting stuff in when tables are deleted or...?&lt;BR /&gt;&lt;BR /&gt;I checked with one of our developers, he sees that in the archive there's tables he deleted long ago.&lt;BR /&gt;So... my simple question is, how do I clean out unneeded things from the hbase "archive"?   I assume manually deleting stuff via hdfs is **not** the way to go.&lt;BR /&gt;&lt;BR /&gt;[hdfs dfs -du -s -h /apps/hbase/data/*
&lt;BR /&gt;338.6 K  /apps/hbase/data/.hbase-snapshot
&lt;BR /&gt;0  /apps/hbase/data/.tmp
&lt;BR /&gt;20  /apps/hbase/data/MasterProcWALs
&lt;BR /&gt;830  /apps/hbase/data/WALs
&lt;BR /&gt;6.6 T  /apps/hbase/data/archive    &amp;lt;=== THIS.   &lt;BR /&gt;0  /apps/hbase/data/corrupt
&lt;BR /&gt;4.1 T  /apps/hbase/data/data
&lt;BR /&gt;42  /apps/hbase/data/hbase.id
&lt;BR /&gt;7  /apps/hbase/data/hbase.version
&lt;BR /&gt;30.7 K  /apps/hbase/data/oldWALs&lt;BR /&gt;&lt;BR /&gt;ANY and all help for an hbase newbie would be really appreciated&lt;/P&gt;</description>
      <pubDate>Sat, 05 Aug 2017 01:35:28 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/HBASE-quot-archive-quot-How-to-clean-My-disk-space-is/m-p/212909#M174839</guid>
      <dc:creator>kbrodie</dc:creator>
      <dc:date>2017-08-05T01:35:28Z</dc:date>
    </item>
    <item>
      <title>Re: HBASE "archive".   How to clean?   My disk space is vanishing....</title>
      <link>https://community.cloudera.com/t5/Support-Questions/HBASE-quot-archive-quot-How-to-clean-My-disk-space-is/m-p/212910#M174840</link>
      <description>&lt;P&gt;Check whether you have hbase.master.hfilecleaner.ttl configuration property in hbase-site.xml. It defines TTL for archived files.&lt;/P&gt;&lt;P&gt;Archive directory can keep:&lt;/P&gt;&lt;P&gt;1. old WAL files&lt;/P&gt;&lt;P&gt;2. Old region files after compaction&lt;/P&gt;&lt;P&gt;3. files for snapshots. &lt;/P&gt;&lt;P&gt;I believe that you have some old snapshots and that's why you have so big archive directory. Delete snapshots that are not required  and those files will be deleted automatically.  &lt;/P&gt;</description>
      <pubDate>Sat, 05 Aug 2017 01:58:16 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/HBASE-quot-archive-quot-How-to-clean-My-disk-space-is/m-p/212910#M174840</guid>
      <dc:creator>ssoldatov</dc:creator>
      <dc:date>2017-08-05T01:58:16Z</dc:date>
    </item>
    <item>
      <title>Re: HBASE "archive".   How to clean?   My disk space is vanishing....</title>
      <link>https://community.cloudera.com/t5/Support-Questions/HBASE-quot-archive-quot-How-to-clean-My-disk-space-is/m-p/212911#M174841</link>
      <description>&lt;P&gt;As far as I can fine, the hbase.master.hfilecleaner.ttl value was not set at all.   (does that then mean..  NO cleaning?).   I set it to 900000 (15 minutes) and we'll see if anything happens.&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;</description>
      <pubDate>Sat, 05 Aug 2017 02:14:31 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/HBASE-quot-archive-quot-How-to-clean-My-disk-space-is/m-p/212911#M174841</guid>
      <dc:creator>kbrodie</dc:creator>
      <dc:date>2017-08-05T02:14:31Z</dc:date>
    </item>
    <item>
      <title>Re: HBASE "archive".   How to clean?   My disk space is vanishing....</title>
      <link>https://community.cloudera.com/t5/Support-Questions/HBASE-quot-archive-quot-How-to-clean-My-disk-space-is/m-p/212912#M174842</link>
      <description>&lt;P&gt;Actually that's supposed to be something like 5 minutes by default. So, check whether you have any old snapshots that you don't need anymore. &lt;/P&gt;</description>
      <pubDate>Sat, 05 Aug 2017 02:21:29 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/HBASE-quot-archive-quot-How-to-clean-My-disk-space-is/m-p/212912#M174842</guid>
      <dc:creator>ssoldatov</dc:creator>
      <dc:date>2017-08-05T02:21:29Z</dc:date>
    </item>
    <item>
      <title>Re: HBASE "archive".   How to clean?   My disk space is vanishing....</title>
      <link>https://community.cloudera.com/t5/Support-Questions/HBASE-quot-archive-quot-How-to-clean-My-disk-space-is/m-p/212913#M174843</link>
      <description>&lt;P&gt;You're exactly right that you shouldn't delete things by hand &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;/P&gt;&lt;P&gt;If you're on &amp;gt;=HDP-2.5.x, make sure to disable the HBase backup feature. This can hold on to archived WALs. You'd want to set hbase.backup.enable=false in hbase-site.xml.&lt;/P&gt;&lt;P&gt;If you have HBase replication set up, that's also another potential candidate for why those files are not being automatically removed. Lots of HBase snapshots are another candidate (like Sergey suggested already) -- drop the old snapshots you don't need anymore).&lt;/P&gt;&lt;P&gt;Turning on DEBUG in the HBase master should give you some insight to the various "Chores" that run inside the Master to automatically remove (or retain) data.&lt;/P&gt;</description>
      <pubDate>Sat, 05 Aug 2017 02:26:45 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/HBASE-quot-archive-quot-How-to-clean-My-disk-space-is/m-p/212913#M174843</guid>
      <dc:creator>elserj</dc:creator>
      <dc:date>2017-08-05T02:26:45Z</dc:date>
    </item>
    <item>
      <title>Re: HBASE "archive".   How to clean?   My disk space is vanishing....</title>
      <link>https://community.cloudera.com/t5/Support-Questions/HBASE-quot-archive-quot-How-to-clean-My-disk-space-is/m-p/212914#M174844</link>
      <description>&lt;A rel="user" href="https://community.cloudera.com/users/13333/brodie.html" nodeid="13333"&gt;@Kent Brodie&lt;/A&gt;&lt;P&gt;I am assuming you run major compactions probably once a week or some regular schedule. So that is not an issue.&lt;/P&gt;&lt;P&gt;Do you have a lot of snapshots? Here is how snapshots work. When you create a snapshot, it only captures metadata at that point in time. So in case you ever have to restore to that point in time, you restore snapshot. Through metadata that was captured, Snapshot knows which data to restore.&lt;/P&gt;&lt;P&gt;Now, as HBase is running, you might be deleting data. Usually when Major compaction runs, your deleted data is gone for good. Disk space is recovered. However, if you have Snapshots created which are pointing to data that is being deleted, HBase will not delete that data because what if you trying to recover to that particular point in time by restoring the snapshot? So, in that case, the data that snapshot is pointing to is moved to archive folder.&lt;BR /&gt;&lt;BR /&gt;The more Snapshots you have, the more archive folder will grow as needed by Snapshots.&lt;/P&gt;&lt;P&gt;I can only guess, but a reasonable guess of what you are seeing is that you have too many snapshots.&lt;/P&gt;</description>
      <pubDate>Sat, 05 Aug 2017 02:28:11 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/HBASE-quot-archive-quot-How-to-clean-My-disk-space-is/m-p/212914#M174844</guid>
      <dc:creator>mqureshi</dc:creator>
      <dc:date>2017-08-05T02:28:11Z</dc:date>
    </item>
    <item>
      <title>Re: HBASE "archive".   How to clean?   My disk space is vanishing....</title>
      <link>https://community.cloudera.com/t5/Support-Questions/HBASE-quot-archive-quot-How-to-clean-My-disk-space-is/m-p/212915#M174845</link>
      <description>&lt;P&gt;yup yup yup.   Found the snapshots....      guessing THAT is the culprit.  Time to have a conversation with the developers....   there's.. a lot.  &lt;/P&gt;</description>
      <pubDate>Sat, 05 Aug 2017 02:39:57 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/HBASE-quot-archive-quot-How-to-clean-My-disk-space-is/m-p/212915#M174845</guid>
      <dc:creator>kbrodie</dc:creator>
      <dc:date>2017-08-05T02:39:57Z</dc:date>
    </item>
    <item>
      <title>Re: HBASE "archive".   How to clean?   My disk space is vanishing....</title>
      <link>https://community.cloudera.com/t5/Support-Questions/HBASE-quot-archive-quot-How-to-clean-My-disk-space-is/m-p/212916#M174846</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/13333/brodie.html" nodeid="13333"&gt;@Kent Brodie&lt;BR /&gt;&lt;/A&gt;Did you get a solution? Please share&lt;/P&gt;</description>
      <pubDate>Sat, 23 Dec 2017 11:13:46 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/HBASE-quot-archive-quot-How-to-clean-My-disk-space-is/m-p/212916#M174846</guid>
      <dc:creator>mayank_mahajan0</dc:creator>
      <dc:date>2017-12-23T11:13:46Z</dc:date>
    </item>
    <item>
      <title>Re: HBASE "archive".   How to clean?   My disk space is vanishing....</title>
      <link>https://community.cloudera.com/t5/Support-Questions/HBASE-quot-archive-quot-How-to-clean-My-disk-space-is/m-p/212917#M174847</link>
      <description>&lt;P&gt;I deleted all the snapshots and data after getting a go-ahead from the developers...&lt;/P&gt;</description>
      <pubDate>Wed, 27 Dec 2017 22:48:42 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/HBASE-quot-archive-quot-How-to-clean-My-disk-space-is/m-p/212917#M174847</guid>
      <dc:creator>kbrodie</dc:creator>
      <dc:date>2017-12-27T22:48:42Z</dc:date>
    </item>
  </channel>
</rss>

