<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Hive Explain Plan Predicate Question in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hive-Explain-Plan-Predicate-Question/m-p/209608#M66447</link>
    <description>&lt;P&gt;Thanks! This is exactly what I was looking for.&lt;/P&gt;</description>
    <pubDate>Thu, 17 Aug 2017 01:18:04 GMT</pubDate>
    <dc:creator>nfleming</dc:creator>
    <dc:date>2017-08-17T01:18:04Z</dc:date>
    <item>
      <title>Hive Explain Plan Predicate Question</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hive-Explain-Plan-Predicate-Question/m-p/209606#M66445</link>
      <description>&lt;P&gt;Could you remind me whats going on here in this example explain plan. The table contains 611 rows, which I see are being read. Then it appears as the key is not null predicate is applied and the number of rows drops to 306. There are no null fields in this dataset. &lt;/P&gt;&lt;P&gt;How is this pruning data? I would have expected that to be the same as the input size. 
&lt;/P&gt;&lt;P&gt; Map Operator Tree:&lt;/P&gt;&lt;P&gt;          TableScan&lt;/P&gt;&lt;P&gt;            alias: a&lt;/P&gt;&lt;P&gt;            filterExpr: key is not null (type: boolean)&lt;/P&gt;&lt;P&gt;            Statistics: &lt;STRONG&gt;Num rows: 611 Data size: 1833&lt;/STRONG&gt; Basic stats: COMPLETE Column stats: NONE&lt;/P&gt;&lt;P&gt;            Filter Operator&lt;/P&gt;&lt;P&gt;              predicate: key is not null (type: boolean)&lt;/P&gt;&lt;P&gt;              Statistics:&lt;STRONG&gt; Num rows: 306 Data size: 918 &lt;/STRONG&gt;Basic stats: COMPLETE Column stats: NONE&lt;/P&gt;&lt;P&gt;              Reduce Output Operator&lt;/P&gt;&lt;P&gt;                key expressions: key (type: string)&lt;/P&gt;&lt;P&gt;                sort order: +&lt;/P&gt;&lt;P&gt;                Map-reduce partition columns: key (type: string)&lt;/P&gt;&lt;P&gt;                Statistics: Num rows: 306 Data size: 918 Basic stats: COMPLETE Column stats: NONE&lt;/P&gt;</description>
      <pubDate>Fri, 11 Aug 2017 02:08:53 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hive-Explain-Plan-Predicate-Question/m-p/209606#M66445</guid>
      <dc:creator>nfleming</dc:creator>
      <dc:date>2017-08-11T02:08:53Z</dc:date>
    </item>
    <item>
      <title>Re: Hive Explain Plan Predicate Question</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hive-Explain-Plan-Predicate-Question/m-p/209607#M66446</link>
      <description>&lt;P&gt;These numbers (Num rows, Data size) are estimated by Hive (optimizer) and do not represent actual numbers. You can run EXPLAIN + ANALYZE to see both Estimated and Actual numbers.&lt;/P&gt;</description>
      <pubDate>Sat, 12 Aug 2017 03:02:31 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hive-Explain-Plan-Predicate-Question/m-p/209607#M66446</guid>
      <dc:creator>vgarg</dc:creator>
      <dc:date>2017-08-12T03:02:31Z</dc:date>
    </item>
    <item>
      <title>Re: Hive Explain Plan Predicate Question</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hive-Explain-Plan-Predicate-Question/m-p/209608#M66447</link>
      <description>&lt;P&gt;Thanks! This is exactly what I was looking for.&lt;/P&gt;</description>
      <pubDate>Thu, 17 Aug 2017 01:18:04 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hive-Explain-Plan-Predicate-Question/m-p/209608#M66447</guid>
      <dc:creator>nfleming</dc:creator>
      <dc:date>2017-08-17T01:18:04Z</dc:date>
    </item>
  </channel>
</rss>

