<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: How to delete data older than x days on hbase tables? in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-delete-data-older-than-x-days-on-hbase-tables/m-p/25639#M5348</link>
    <description>You should be able to simply set a TTL on your tables and run a major&lt;BR /&gt;compaction to delete older-than-TTL-time data. More on TTL at&lt;BR /&gt;&lt;A target="_blank" href="http://archive.cloudera.com/cdh5/cdh/5/hbase/book.html#ttl."&gt;http://archive.cloudera.com/cdh5/cdh/5/hbase/book.html#ttl.&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;</description>
    <pubDate>Tue, 17 Mar 2015 07:20:34 GMT</pubDate>
    <dc:creator>Harsh J</dc:creator>
    <dc:date>2015-03-17T07:20:34Z</dc:date>
    <item>
      <title>How to delete data older than x days on hbase tables?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-delete-data-older-than-x-days-on-hbase-tables/m-p/25638#M5347</link>
      <description>&lt;P&gt;Hi All,&lt;BR /&gt;&amp;nbsp;&lt;BR /&gt;Since my hadoop cluster capacity is low and there is no business need to keep old data, I'm trying to find and delete records older than 200 days in hbase tables. I found that there is no tool or ready to use program available to achieve this.&lt;BR /&gt;&amp;nbsp;&lt;BR /&gt;Can someone give me the best approach to accomplish this? Should I write a MR Job? If yes, is there any pseudo code or algorithm?&lt;BR /&gt;&amp;nbsp;&lt;BR /&gt;Thanks&lt;/P&gt;</description>
      <pubDate>Fri, 16 Sep 2022 09:24:27 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-delete-data-older-than-x-days-on-hbase-tables/m-p/25638#M5347</guid>
      <dc:creator>YBSNR</dc:creator>
      <dc:date>2022-09-16T09:24:27Z</dc:date>
    </item>
    <item>
      <title>Re: How to delete data older than x days on hbase tables?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-delete-data-older-than-x-days-on-hbase-tables/m-p/25639#M5348</link>
      <description>You should be able to simply set a TTL on your tables and run a major&lt;BR /&gt;compaction to delete older-than-TTL-time data. More on TTL at&lt;BR /&gt;&lt;A target="_blank" href="http://archive.cloudera.com/cdh5/cdh/5/hbase/book.html#ttl."&gt;http://archive.cloudera.com/cdh5/cdh/5/hbase/book.html#ttl.&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Tue, 17 Mar 2015 07:20:34 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-delete-data-older-than-x-days-on-hbase-tables/m-p/25639#M5348</guid>
      <dc:creator>Harsh J</dc:creator>
      <dc:date>2015-03-17T07:20:34Z</dc:date>
    </item>
    <item>
      <title>Re: How to delete data older than x days on hbase tables?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-delete-data-older-than-x-days-on-hbase-tables/m-p/25643#M5349</link>
      <description>&lt;P&gt;Thank you.. Looks like TTL is a good option. But I remember, Major compaction was running for days. When we keep the frequent/ periodic compaction enabled, regions were going offline. how to optimize and control the compactions? To enable TTL, should we compromize on the availability of region?&lt;/P&gt;&lt;P&gt;Please guide me&lt;/P&gt;</description>
      <pubDate>Tue, 17 Mar 2015 11:26:57 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-delete-data-older-than-x-days-on-hbase-tables/m-p/25643#M5349</guid>
      <dc:creator>YBSNR</dc:creator>
      <dc:date>2015-03-17T11:26:57Z</dc:date>
    </item>
    <item>
      <title>Re: How to delete data older than x days on hbase tables?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-delete-data-older-than-x-days-on-hbase-tables/m-p/25790#M5350</link>
      <description>&lt;P&gt;Hi Harsh,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The TTL option works well on most of the tables/cases. But, flume agents loads data to staging tables contineously. In this case, when we run compaction, the regions will go offline and data load fails. So, I had to turnoff the major compaction. Can you help me on how to handle major compaction on these tables to purge old data using TTL?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks&lt;/P&gt;</description>
      <pubDate>Mon, 23 Mar 2015 06:43:31 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-delete-data-older-than-x-days-on-hbase-tables/m-p/25790#M5350</guid>
      <dc:creator>YBSNR</dc:creator>
      <dc:date>2015-03-23T06:43:31Z</dc:date>
    </item>
  </channel>
</rss>

