<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Very slow catalog update after insert overwrite in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Very-slow-catalog-update-after-insert-overwrite/m-p/34698#M11205</link>
    <description>&lt;P&gt;Hi Mauricio,&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;You're probably running into IMPALA-1480. If a table has a substantial number of partitions (&amp;gt;10K) it take a long time to perform certain DDL operations even though only a small fraction of metadata changes.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Dimitris&lt;/P&gt;</description>
    <pubDate>Thu, 03 Dec 2015 01:42:09 GMT</pubDate>
    <dc:creator>Dimitris</dc:creator>
    <dc:date>2015-12-03T01:42:09Z</dc:date>
    <item>
      <title>Very slow catalog update after insert overwrite</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Very-slow-catalog-update-after-insert-overwrite/m-p/34629#M11204</link>
      <description>&lt;P&gt;Could anyone help point me where I should look into why catalog update takes so long after an insert overwrite? &amp;nbsp;From profile I can see&amp;nbsp;that data was written in about 7s but it took another 107s to update catalog.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Remote fragments started: 2,249,771,266&lt;BR /&gt;DML data written: 7,268,752,004&lt;BR /&gt;DML Metastore update finished: 114,757,731,166&lt;BR /&gt;Request finished: 114,786,664,689&lt;/P&gt;&lt;P&gt;...&lt;/P&gt;&lt;P&gt;- MetastoreUpdateTimer: 107,517,902,343&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;It uses dynamic partitioning, though I don't think that's a factor here:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;insert overwrite table action_fact_a partition(p_action_date_ym='201511',p_campaign_id_mod=5, p_publisher_id_mod) select field1, etc....&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;There are 10&amp;nbsp;&lt;SPAN&gt;p_publisher_id_mod partitions so 10 files generated, each only a couple MB, so no more than 30MB altogether, and I don't think any more than 10 blocks were deleted and 10 inserted.&amp;nbsp;&lt;/SPAN&gt;Cluster is not particularly under load and performance of this operation is pretty stable. &amp;nbsp;10 DNs. Source table is text, target (partitioned) is parquet.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Much appreciated!&lt;/P&gt;</description>
      <pubDate>Wed, 02 Dec 2015 00:08:48 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Very-slow-catalog-update-after-insert-overwrite/m-p/34629#M11204</guid>
      <dc:creator>mauricio</dc:creator>
      <dc:date>2015-12-02T00:08:48Z</dc:date>
    </item>
    <item>
      <title>Re: Very slow catalog update after insert overwrite</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Very-slow-catalog-update-after-insert-overwrite/m-p/34698#M11205</link>
      <description>&lt;P&gt;Hi Mauricio,&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;You're probably running into IMPALA-1480. If a table has a substantial number of partitions (&amp;gt;10K) it take a long time to perform certain DDL operations even though only a small fraction of metadata changes.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Dimitris&lt;/P&gt;</description>
      <pubDate>Thu, 03 Dec 2015 01:42:09 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Very-slow-catalog-update-after-insert-overwrite/m-p/34698#M11205</guid>
      <dc:creator>Dimitris</dc:creator>
      <dc:date>2015-12-03T01:42:09Z</dc:date>
    </item>
    <item>
      <title>Re: Very slow catalog update after insert overwrite</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Very-slow-catalog-update-after-insert-overwrite/m-p/34741#M11206</link>
      <description>&lt;P&gt;Thanks Dimitris. &amp;nbsp;I've commented in the ticket and upvoted it. &amp;nbsp;Hope you guys can get it in progress soon!&lt;/P&gt;</description>
      <pubDate>Thu, 03 Dec 2015 23:02:20 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Very-slow-catalog-update-after-insert-overwrite/m-p/34741#M11206</guid>
      <dc:creator>mauricio</dc:creator>
      <dc:date>2015-12-03T23:02:20Z</dc:date>
    </item>
  </channel>
</rss>

