<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Refresh the Impala metadata from Hive Metastore? in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Refresh-the-Impala-metadata-from-Hive-Metastore/m-p/37716#M15957</link>
    <description>&lt;P&gt;Thanks for the great question! We have even created a &lt;A href="https://community.cloudera.com/t5/Interactive-Short-cycle-SQL/Refresh-the-Impala-metadata-from-Hive-Metastore/ta-p/30186" target="_blank"&gt;Community Knowledge Article&lt;/A&gt; based on this thread.&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Mon, 22 Feb 2016 22:00:42 GMT</pubDate>
    <dc:creator>cjervis</dc:creator>
    <dc:date>2016-02-22T22:00:42Z</dc:date>
    <item>
      <title>Refresh the Impala metadata from Hive Metastore?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Refresh-the-Impala-metadata-from-Hive-Metastore/m-p/28120#M15955</link>
      <description>&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;&lt;STRONG&gt;Refresh the Impala metadata from Hive Metastore? (Invalidate Metadata / Refresh )&lt;/STRONG&gt;&lt;UL&gt;&lt;LI&gt;From what we analyzed, Invalidate metadata is a costly operation and in the scenario of adding new data files to an existing table, we can do a REFRESH rather than INVALIDATE METADATA&lt;/LI&gt;&lt;LI&gt;What is considered Best Practice?&lt;/LI&gt;&lt;/UL&gt;&lt;/LI&gt;&lt;/UL&gt;</description>
      <pubDate>Fri, 16 Sep 2022 09:30:32 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Refresh-the-Impala-metadata-from-Hive-Metastore/m-p/28120#M15955</guid>
      <dc:creator>scott.bader-1997506906</dc:creator>
      <dc:date>2022-09-16T09:30:32Z</dc:date>
    </item>
    <item>
      <title>Re: Refresh the Impala metadata from Hive Metastore?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Refresh-the-Impala-metadata-from-Hive-Metastore/m-p/28726#M15956</link>
      <description>&lt;P&gt;Scott,&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I'll refer you to the documentation on this topic here:&lt;/P&gt;&lt;P&gt;&lt;A target="_blank" href="http://www.cloudera.com/content/cloudera/en/documentation/cloudera-impala/latest/topics/impala_refresh.html"&gt;http://www.cloudera.com/content/cloudera/en/documentation/cloudera-impala/latest/topics/impala_refresh.html&lt;/A&gt;&lt;/P&gt;&lt;P&gt;and&lt;/P&gt;&lt;P&gt;&lt;A target="_blank" href="http://www.cloudera.com/content/cloudera/en/documentation/cloudera-impala/latest/topics/impala_invalidate_metadata.html"&gt;http://www.cloudera.com/content/cloudera/en/documentation/cloudera-impala/latest/topics/impala_invalidate_metadata.html&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;In terms of "best practice":&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P class="p"&gt;Use the REFRESH statement to load the latest metastore metadata and block location data for a particular table in these scenarios:&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;After loading new data files into the HDFS data directory for the table. (Once you have set up an ETL pipeline to bring data into Impala on a regular basis, this is typically the most frequent reason why metadata needs to be refreshed.)&lt;/LI&gt;&lt;LI&gt;After issuing ALTER TABLE, INSERT, LOAD DATA, or other table-modifying SQL statement in Hive.&lt;/LI&gt;&lt;/UL&gt;&lt;P class="p"&gt;INVALIDATE METADATA&lt;SPAN&gt; and &lt;/SPAN&gt;REFRESH&lt;SPAN&gt; are counterparts: &lt;/SPAN&gt;INVALIDATE METADATA&lt;SPAN&gt; waits to reload the metadata when needed for a subsequent query, but reloads all the metadata for the table, which can be an expensive operation, especially for large tables with many partitions. &lt;/SPAN&gt;REFRESH&lt;SPAN&gt; reloads the metadata immediately, but only loads the block location data for newly added data files, making it a less expensive operation overall. If data was altered in some more extensive way, such as being reorganized by the HDFS balancer, use &lt;/SPAN&gt;INVALIDATE METADATA&lt;SPAN&gt; to avoid a performance penalty from reduced local reads. If you used Impala version 1.0, the &lt;/SPAN&gt;INVALIDATE METADATA&lt;SPAN&gt; statement works just like the Impala 1.0 &lt;/SPAN&gt;REFRESH&lt;SPAN&gt; statement did, while the Impala 1.1 &lt;/SPAN&gt;REFRESH&lt;SPAN&gt; is optimized for the common use case of adding new data files to an existing table, thus the table name argument is now required.&lt;/SPAN&gt;&lt;/P&gt;&lt;P class="p"&gt;&amp;nbsp;&lt;/P&gt;&lt;P class="p"&gt;&lt;SPAN&gt;Let me know if this doesn't answer your question.&lt;/SPAN&gt;&lt;/P&gt;&lt;P class="p"&gt;&amp;nbsp;&lt;/P&gt;&lt;P class="p"&gt;&lt;SPAN&gt;Thanks&lt;/SPAN&gt;&lt;/P&gt;&lt;P class="p"&gt;&amp;nbsp;&lt;/P&gt;&lt;P class="p"&gt;&lt;SPAN&gt;Jeff&lt;/SPAN&gt;&lt;/P&gt;&lt;P class="p"&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 19 Jun 2015 13:04:24 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Refresh-the-Impala-metadata-from-Hive-Metastore/m-p/28726#M15956</guid>
      <dc:creator>jholoman</dc:creator>
      <dc:date>2015-06-19T13:04:24Z</dc:date>
    </item>
    <item>
      <title>Re: Refresh the Impala metadata from Hive Metastore?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Refresh-the-Impala-metadata-from-Hive-Metastore/m-p/37716#M15957</link>
      <description>&lt;P&gt;Thanks for the great question! We have even created a &lt;A href="https://community.cloudera.com/t5/Interactive-Short-cycle-SQL/Refresh-the-Impala-metadata-from-Hive-Metastore/ta-p/30186" target="_blank"&gt;Community Knowledge Article&lt;/A&gt; based on this thread.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 22 Feb 2016 22:00:42 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Refresh-the-Impala-metadata-from-Hive-Metastore/m-p/37716#M15957</guid>
      <dc:creator>cjervis</dc:creator>
      <dc:date>2016-02-22T22:00:42Z</dc:date>
    </item>
    <item>
      <title>Re: Refresh the Impala metadata from Hive Metastore?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Refresh-the-Impala-metadata-from-Hive-Metastore/m-p/37727#M15958</link>
      <description>&lt;P&gt;Jeff,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I understand what you explained.&lt;/P&gt;&lt;P&gt;However, what if HDFS rebalances data automatically?&lt;/P&gt;&lt;P&gt;If this&amp;nbsp;&lt;SPAN&gt;scenario, it seems there is only one option left using INVALIDATE METADATA.&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 23 Feb 2016 08:31:55 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Refresh-the-Impala-metadata-from-Hive-Metastore/m-p/37727#M15958</guid>
      <dc:creator>thewayofthinkin</dc:creator>
      <dc:date>2016-02-23T08:31:55Z</dc:date>
    </item>
  </channel>
</rss>

