<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Impala compute incremental stats on specific columns in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Impala-compute-incremental-stats-on-specific-columns/m-p/277966#M207749</link>
    <description>&lt;BLOCKQUOTE&gt;&lt;HR /&gt;&lt;P&gt;So it looks like column specific is only on a table without partitions (&lt;SPAN&gt;non-incremental&lt;/SPAN&gt;)&lt;/P&gt;&lt;HR /&gt;&lt;/BLOCKQUOTE&gt;&lt;P&gt;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/24831"&gt;@hores&lt;/a&gt;&amp;nbsp;that's incorrect, non-incremental compute stats works on partitioned tables and is generally the preferred method for collecting stats on partitioned tables.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;We've generally tried to steer people away from incremental stats because of the size issues on large tables,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;It would also be error-prone to use correctly and complex to implement - what happens if you compute incremental stats with different subsets of the columns? You can end up with different subsets of the columns on different partitions and then you have to somehow reconcile it all each time.&lt;/P&gt;</description>
    <pubDate>Fri, 20 Sep 2019 17:11:27 GMT</pubDate>
    <dc:creator>Tim Armstrong</dc:creator>
    <dc:date>2019-09-20T17:11:27Z</dc:date>
  </channel>
</rss>

