<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Difference between hive analyze commands in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Difference-between-hive-analyze-commands/m-p/155140#M48994</link>
    <description>&lt;P&gt;with columns stats you will be able to update basic stat also&lt;/P&gt;</description>
    <pubDate>Thu, 15 Dec 2016 21:18:52 GMT</pubDate>
    <dc:creator>rajkumar_singh</dc:creator>
    <dc:date>2016-12-15T21:18:52Z</dc:date>
    <item>
      <title>Difference between hive analyze commands</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Difference-between-hive-analyze-commands/m-p/155137#M48991</link>
      <description>&lt;P&gt;Can someone help me explain what the difference is between these 2 hive analyze commands:&lt;/P&gt;&lt;PRE&gt;analyze table svcrpt.predictive_customers compute statistics;
analyze table svcrpt.predictive_customers compute statistics for columns;&lt;/PRE&gt;&lt;P&gt;What more does the "for columns" part do?&lt;/P&gt;</description>
      <pubDate>Thu, 15 Dec 2016 20:48:06 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Difference-between-hive-analyze-commands/m-p/155137#M48991</guid>
      <dc:creator>chad_woodhead</dc:creator>
      <dc:date>2016-12-15T20:48:06Z</dc:date>
    </item>
    <item>
      <title>Re: Difference between hive analyze commands</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Difference-between-hive-analyze-commands/m-p/155138#M48992</link>
      <description>&lt;P&gt;1. analyze table svcrpt.predictive_customers compute statistics;&lt;/P&gt;&lt;P&gt;will compute basic stats of the table like numFiles, numRows, totalSize, rawDataSize in the table, these are stored in &lt;/P&gt;&lt;P&gt;TABLE_PARAMS table under hive metastore db.&lt;/P&gt;&lt;P&gt;2. analyze table svcrpt.predictive_customers compute statistics for columns;&lt;/P&gt;&lt;P&gt;create/update column level stats like NUM_DISTINCTS,LOW_VALUE,HIGH_VALUE,NUM_NULLS etc in TAB_COL_STATS table under metastore db&lt;/P&gt;</description>
      <pubDate>Thu, 15 Dec 2016 21:02:24 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Difference-between-hive-analyze-commands/m-p/155138#M48992</guid>
      <dc:creator>rajkumar_singh</dc:creator>
      <dc:date>2016-12-15T21:02:24Z</dc:date>
    </item>
    <item>
      <title>Re: Difference between hive analyze commands</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Difference-between-hive-analyze-commands/m-p/155139#M48993</link>
      <description>&lt;P&gt;Got it, thanks! Does the for columns command also do the basic stats that the first analyze command does, or would I have to run them both to get both sets of stats computed?&lt;/P&gt;</description>
      <pubDate>Thu, 15 Dec 2016 21:06:10 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Difference-between-hive-analyze-commands/m-p/155139#M48993</guid>
      <dc:creator>chad_woodhead</dc:creator>
      <dc:date>2016-12-15T21:06:10Z</dc:date>
    </item>
    <item>
      <title>Re: Difference between hive analyze commands</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Difference-between-hive-analyze-commands/m-p/155140#M48994</link>
      <description>&lt;P&gt;with columns stats you will be able to update basic stat also&lt;/P&gt;</description>
      <pubDate>Thu, 15 Dec 2016 21:18:52 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Difference-between-hive-analyze-commands/m-p/155140#M48994</guid>
      <dc:creator>rajkumar_singh</dc:creator>
      <dc:date>2016-12-15T21:18:52Z</dc:date>
    </item>
    <item>
      <title>Re: Difference between hive analyze commands</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Difference-between-hive-analyze-commands/m-p/155141#M48995</link>
      <description>&lt;P&gt;Thanks. I just did my own testing to see if "for columns" would also update TABLE_PARAMS table and I found that it did not. &lt;/P&gt;&lt;P&gt;For instance, when I run "analyze table svcrpt.predictive_customers compute statistics;" the column transient_lastDdlTime in the table TABLE_PARAMS gets updated, but if I run "analyze table svcrpt.predictive_customers compute statistics for columns;" transient_lastDdlTime does not updated.&lt;/P&gt;&lt;P&gt;So does this mean "for columns" does not update the basic stats?&lt;/P&gt;</description>
      <pubDate>Thu, 15 Dec 2016 21:24:26 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Difference-between-hive-analyze-commands/m-p/155141#M48995</guid>
      <dc:creator>chad_woodhead</dc:creator>
      <dc:date>2016-12-15T21:24:26Z</dc:date>
    </item>
  </channel>
</rss>

