<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Impala/Kudu non-covering range partition to support rolling window data retention in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Impala-Kudu-non-covering-range-partition-to-support-rolling/m-p/50801#M54233</link>
    <description>&lt;P&gt;Hi Impala/Kudu gurus,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I'm extremely excited by the new Impala/Kudu release that supports non-covering range partition, as described here:&amp;nbsp;&lt;A href="https://github.com/cloudera/kudu/blob/master/docs/design-docs/non-covering-range-partitions.md" target="_self"&gt;https://github.com/cloudera/kudu/blob/master/docs/design-docs/non-covering-range-partitions.md&lt;/A&gt;&lt;/P&gt;&lt;P&gt;and here:&amp;nbsp;&lt;A href="https://gerrit.cloudera.org/#/c/4856/" target="_self"&gt;https://gerrit.cloudera.org/#/c/4856/&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Yet I haven't figured out how exactly to use it to&amp;nbsp;support rolling window data retention that our business&amp;nbsp;needs. The syntax descibed in the 2nd document above still seems to require static partition specification.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;What we need is the ability to auto-create new partitions based on a timestamp expression so that each partition contains x days of data only. We then can drop the old partitions based on our data retention policy on a per table basis.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;As a comparison, the similar function is provided by Oracle's range interval partition:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;PRE&gt;PARTITION BY RANGE (CREATION_DATE)
INTERVAL (NUMTODSINTERVAL(7, 'DAY'))&lt;/PRE&gt;&lt;P&gt;and Vertica's partition key expression:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;PRE&gt;PARTITION BY (floor((((tbl.creation_ts)::date - '0001-12-31 BC'::date) / 3)))&lt;/PRE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;&lt;P&gt;Brian&lt;/P&gt;</description>
    <pubDate>Fri, 16 Sep 2022 11:04:18 GMT</pubDate>
    <dc:creator>brianwhu</dc:creator>
    <dc:date>2022-09-16T11:04:18Z</dc:date>
    <item>
      <title>Impala/Kudu non-covering range partition to support rolling window data retention</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Impala-Kudu-non-covering-range-partition-to-support-rolling/m-p/50801#M54233</link>
      <description>&lt;P&gt;Hi Impala/Kudu gurus,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I'm extremely excited by the new Impala/Kudu release that supports non-covering range partition, as described here:&amp;nbsp;&lt;A href="https://github.com/cloudera/kudu/blob/master/docs/design-docs/non-covering-range-partitions.md" target="_self"&gt;https://github.com/cloudera/kudu/blob/master/docs/design-docs/non-covering-range-partitions.md&lt;/A&gt;&lt;/P&gt;&lt;P&gt;and here:&amp;nbsp;&lt;A href="https://gerrit.cloudera.org/#/c/4856/" target="_self"&gt;https://gerrit.cloudera.org/#/c/4856/&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Yet I haven't figured out how exactly to use it to&amp;nbsp;support rolling window data retention that our business&amp;nbsp;needs. The syntax descibed in the 2nd document above still seems to require static partition specification.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;What we need is the ability to auto-create new partitions based on a timestamp expression so that each partition contains x days of data only. We then can drop the old partitions based on our data retention policy on a per table basis.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;As a comparison, the similar function is provided by Oracle's range interval partition:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;PRE&gt;PARTITION BY RANGE (CREATION_DATE)
INTERVAL (NUMTODSINTERVAL(7, 'DAY'))&lt;/PRE&gt;&lt;P&gt;and Vertica's partition key expression:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;PRE&gt;PARTITION BY (floor((((tbl.creation_ts)::date - '0001-12-31 BC'::date) / 3)))&lt;/PRE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;&lt;P&gt;Brian&lt;/P&gt;</description>
      <pubDate>Fri, 16 Sep 2022 11:04:18 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Impala-Kudu-non-covering-range-partition-to-support-rolling/m-p/50801#M54233</guid>
      <dc:creator>brianwhu</dc:creator>
      <dc:date>2022-09-16T11:04:18Z</dc:date>
    </item>
    <item>
      <title>Re: Impala/Kudu non-covering range partition to support rolling window data retention</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Impala-Kudu-non-covering-range-partition-to-support-rolling/m-p/52524#M54234</link>
      <description>&lt;P&gt;Hi Brian,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Unfortunately Kudu partitions must be pre-defined as you suspected, so the Oracle syntax you described won't work for Impala. However, you can add and drop range partitions even after the table is created, so you can manually add the next hour/day/week partition, and drop some historical partition. The syntax is described in the latest version of the CDH documentation:&lt;/P&gt;&lt;P&gt;&lt;A href="https://www.cloudera.com/documentation/enterprise/latest/topics/impala_kudu.html#kudu_range_partitioning" target="_blank"&gt;https://www.cloudera.com/documentation/enterprise/latest/topics/impala_kudu.html#kudu_range_partitioning&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Best,&lt;/P&gt;&lt;P&gt;Matt&lt;/P&gt;</description>
      <pubDate>Wed, 22 Mar 2017 18:11:43 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Impala-Kudu-non-covering-range-partition-to-support-rolling/m-p/52524#M54234</guid>
      <dc:creator>Matt Jacobs</dc:creator>
      <dc:date>2017-03-22T18:11:43Z</dc:date>
    </item>
    <item>
      <title>Re: Impala/Kudu non-covering range partition to support rolling window data retention</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Impala-Kudu-non-covering-range-partition-to-support-rolling/m-p/53634#M54235</link>
      <description>&lt;P&gt;Hi Matt,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;It seems we are going down this path for now. It is close enough to what we have in&amp;nbsp;Vertica, which does grow new partitions automatically though.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;&lt;P&gt;Brian&lt;/P&gt;</description>
      <pubDate>Thu, 13 Apr 2017 13:36:28 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Impala-Kudu-non-covering-range-partition-to-support-rolling/m-p/53634#M54235</guid>
      <dc:creator>brianwhu</dc:creator>
      <dc:date>2017-04-13T13:36:28Z</dc:date>
    </item>
  </channel>
</rss>

