<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Why use &amp;quot;COMPUTE STATS&amp;quot; instead of &amp;quot;COMPUTE INCREMENTAL STATS&amp;quot; if the Incremental stats size exceeds 200MB? in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Why-use-quot-COMPUTE-STATS-quot-instead-of-quot-COMPUTE/m-p/356768#M237373</link>
    <description>&lt;P&gt;&amp;gt; No metrics/graph to check " inc_stats_size"&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;That's what I thought.&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&amp;gt; If 1GB is insufficient, Try to use "compute stats" instead of "compute incremental stats"&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;However, there is a problem in that case.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;This table is updated every hour adding a new partition.&lt;BR /&gt;But, the "compute stats" take well over an hour to complete.&lt;/P&gt;</description>
    <pubDate>Wed, 02 Nov 2022 15:51:28 GMT</pubDate>
    <dc:creator>yassan</dc:creator>
    <dc:date>2022-11-02T15:51:28Z</dc:date>
    <item>
      <title>Why use "COMPUTE STATS" instead of "COMPUTE INCREMENTAL STATS" if the Incremental stats size exceeds 200MB?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Why-use-quot-COMPUTE-STATS-quot-instead-of-quot-COMPUTE/m-p/354959#M237032</link>
      <description>&lt;P&gt;Updating statistics using "COMPUTE INCREMENTAL STATS" produced the following error.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;LI-CODE lang="java"&gt;Server version: impalad version 3.2.0-cdh6.3.2 RELEASE (build 1bb9836227301b839a32c6bc230e35439d5984ac)
Query: COMPUTE INCREMENTAL STATS hoge_db.huga_tbl PARTITION ( dt &amp;gt;= "20221015" )
ERROR: AnalysisException: Incremental stats size estimate exceeds 200.00MB. Please try COMPUTE STATS instead.&lt;/LI-CODE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;In the error message, it says "Please try COMPUTE STATS instead.".&lt;BR /&gt;I don't understand why. Wouldn't "COMPUTE STATS" give the same result?&lt;/P&gt;</description>
      <pubDate>Mon, 17 Oct 2022 04:47:23 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Why-use-quot-COMPUTE-STATS-quot-instead-of-quot-COMPUTE/m-p/354959#M237032</guid>
      <dc:creator>yassan</dc:creator>
      <dc:date>2022-10-17T04:47:23Z</dc:date>
    </item>
    <item>
      <title>Re: Why use "COMPUTE STATS" instead of "COMPUTE INCREMENTAL STATS" if the Incremental stats size exceeds 200MB?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Why-use-quot-COMPUTE-STATS-quot-instead-of-quot-COMPUTE/m-p/356118#M237196</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/30784"&gt;@yassan&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P class="p1"&gt;I would like to let you know that, the default value on the flag(inc_stats_size_limit_bytes) is set to 200 MB, as a safety check to prevent Impala from hitting the maximum limit for the table metadata.&lt;/P&gt;&lt;P class="p1"&gt;&amp;nbsp;&lt;/P&gt;&lt;P class="p1"&gt;Whereas, the error reported usually serves as an indication that 'COMPUTE INCREMENTAL STATS' should not be used on the particular table and consider spitting the table thereby, using regular 'COMPUTE STATS'&lt;SPAN class="Apple-converted-space"&gt;&amp;nbsp; &lt;/SPAN&gt;statement if possible.&lt;SPAN class="Apple-converted-space"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;&lt;P class="p1"&gt;&amp;nbsp;&lt;/P&gt;&lt;P class="p1"&gt;However, incase if you are not able to use the 'Compute Stats' statement then you could try to increase the default limit on the flag(inc_stats_size_limit_bytes) where, it should be set less than 1 GB limit and the value is measured in bytes.&lt;/P&gt;&lt;P class="p1"&gt;&amp;nbsp;&lt;/P&gt;&lt;P class="p1"&gt;Below is the seteps:&lt;/P&gt;&lt;P class="p1"&gt;&amp;nbsp;&lt;/P&gt;&lt;P class="p1"&gt;1. CM &amp;gt; Impala Service &amp;gt; Configuration &amp;gt; Search "Impala Command Line Argument Advanced Configuration Snippet (Safety Valve)"&lt;/P&gt;&lt;P class="p1"&gt;2. Add --inc_stats_size_limit_bytes=&lt;SPAN class="Apple-converted-space"&gt;&amp;nbsp; &amp;nbsp; &lt;/SPAN&gt;#####Please note that the above value is in bytes.&lt;SPAN class="Apple-converted-space"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;&lt;P class="p1"&gt;For example, if you want to set 400 Mb, please input 419430400(400*1024*1024).&lt;/P&gt;&lt;P class="p1"&gt;3. Please save and restart Impala service.&lt;/P&gt;&lt;P class="p1"&gt;&amp;nbsp;&lt;/P&gt;&lt;P class="p1"&gt;&lt;EM&gt;Note: If I answered your question please give a thumbs up and Accept it as a solution.&lt;/EM&gt;&lt;/P&gt;&lt;P class="p1"&gt;&amp;nbsp;&lt;/P&gt;&lt;P class="p1"&gt;Regards,&lt;/P&gt;&lt;P class="p1"&gt;Chethan YM&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 25 Oct 2022 11:47:34 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Why-use-quot-COMPUTE-STATS-quot-instead-of-quot-COMPUTE/m-p/356118#M237196</guid>
      <dc:creator>ChethanYM</dc:creator>
      <dc:date>2022-10-25T11:47:34Z</dc:date>
    </item>
    <item>
      <title>Re: Why use "COMPUTE STATS" instead of "COMPUTE INCREMENTAL STATS" if the Incremental stats size exceeds 200MB?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Why-use-quot-COMPUTE-STATS-quot-instead-of-quot-COMPUTE/m-p/356761#M237370</link>
      <description>&lt;P&gt;Hi &lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/75213"&gt;@ChethanYM&lt;/a&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I’m sorry for the late reply.&lt;/P&gt;&lt;P&gt;Thank you for the helpful information.&lt;/P&gt;&lt;P&gt;COMPUTE STATS Statement | 6.3.x | Cloudera Documentation&lt;BR /&gt;&lt;A href="https://docs.cloudera.com/documentation/enterprise/6/6.3/topics/impala_compute_stats.html#compute_stats" target="_blank"&gt;https://docs.cloudera.com/documentation/enterprise/6/6.3/topics/impala_compute_stats.html#compute_stats&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&amp;gt; If this metadata for all tables exceeds 2 GB, you might experience service downtime. In Impala 3.1 and higher, the issue was alleviated with an improved handling of incremental stats.&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;As stated above, Impala service down (or just 1 Impalad down?) is my concern.&lt;BR /&gt;Are there any metrics that would check the inc_stats_size?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Also, what should we do if inc_stats_size_limit_bytes is insufficient even if it is 1GB?&lt;BR /&gt;We assume that the number of columns and partitions is too large and therefore insufficient.&lt;BR /&gt;In that case, how should we take countermeasures?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Regards,&lt;BR /&gt;yassan&lt;/P&gt;</description>
      <pubDate>Wed, 02 Nov 2022 12:52:02 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Why-use-quot-COMPUTE-STATS-quot-instead-of-quot-COMPUTE/m-p/356761#M237370</guid>
      <dc:creator>yassan</dc:creator>
      <dc:date>2022-11-02T12:52:02Z</dc:date>
    </item>
    <item>
      <title>Re: Why use "COMPUTE STATS" instead of "COMPUTE INCREMENTAL STATS" if the Incremental stats size exceeds 200MB?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Why-use-quot-COMPUTE-STATS-quot-instead-of-quot-COMPUTE/m-p/356766#M237372</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;gt; As per the document it is service down time, So i think it is complete impala service down time. (However I haven't seen the issue on live)&lt;/P&gt;&lt;P&gt;&amp;gt; No metrics/graph to check "&lt;SPAN&gt;&amp;nbsp;inc_stats_size"&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;&amp;gt; If 1GB is insufficient, Try to use "compute stats" instead&amp;nbsp;of&amp;nbsp;&amp;nbsp;"compute incremental stats"&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Regards,&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Chethan YM&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 02 Nov 2022 14:28:45 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Why-use-quot-COMPUTE-STATS-quot-instead-of-quot-COMPUTE/m-p/356766#M237372</guid>
      <dc:creator>ChethanYM</dc:creator>
      <dc:date>2022-11-02T14:28:45Z</dc:date>
    </item>
    <item>
      <title>Re: Why use "COMPUTE STATS" instead of "COMPUTE INCREMENTAL STATS" if the Incremental stats size exceeds 200MB?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Why-use-quot-COMPUTE-STATS-quot-instead-of-quot-COMPUTE/m-p/356768#M237373</link>
      <description>&lt;P&gt;&amp;gt; No metrics/graph to check " inc_stats_size"&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;That's what I thought.&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&amp;gt; If 1GB is insufficient, Try to use "compute stats" instead of "compute incremental stats"&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;However, there is a problem in that case.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;This table is updated every hour adding a new partition.&lt;BR /&gt;But, the "compute stats" take well over an hour to complete.&lt;/P&gt;</description>
      <pubDate>Wed, 02 Nov 2022 15:51:28 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Why-use-quot-COMPUTE-STATS-quot-instead-of-quot-COMPUTE/m-p/356768#M237373</guid>
      <dc:creator>yassan</dc:creator>
      <dc:date>2022-11-02T15:51:28Z</dc:date>
    </item>
  </channel>
</rss>

