<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question What is _col0 in hive explain plan? in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/What-is-col0-in-hive-explain-plan/m-p/191214#M153303</link>
    <description>&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;I am trying to better understand hive explain and hive performance. I have a simple count(*) that is failing with OOM. Rather than just increasing tez container size memory, im trying to understand why its failing. Explain plan below:&lt;/P&gt;&lt;P&gt;It looks like its grouping by _col0. What is the real name of this column? Is there a way to find out?&lt;/P&gt;&lt;PRE&gt;+----------------------------------------------------+
|                      Explain                       |
+----------------------------------------------------+
| Plan optimized by CBO.                             |
|                                                    |
| Vertex dependency in root stage                    |
| Reducer 2 &amp;lt;- Map 1 (CUSTOM_SIMPLE_EDGE)            |
|                                                    |
| Stage-0                                            |
|   Fetch Operator                                   |
|     limit:-1                                       |
|     Stage-1                                        |
|       Reducer 2                                    |
|       File Output Operator [FS_6]                  |
|         Group By Operator [GBY_4] (rows=1 width=8) |
|           Output:["_col0"],aggregations:["count(VALUE._col0)"] |
|         &amp;lt;-Map 1 [CUSTOM_SIMPLE_EDGE]               |
|           PARTITION_ONLY_SHUFFLE [RS_3]            |
|             Group By Operator [GBY_2] (rows=1 width=8) |
|               Output:["_col0"],aggregations:["count()"] |
|               Select Operator [SEL_1] (rows=1 width=53922865152) |
|                 TableScan [TS_0] (rows=1 width=53922865152) |
|                   db1@tb1,tb1,Tbl:COMPLETE,Col:COMPLETE |
&lt;/PRE&gt;</description>
    <pubDate>Tue, 15 May 2018 04:22:29 GMT</pubDate>
    <dc:creator>crayonml</dc:creator>
    <dc:date>2018-05-15T04:22:29Z</dc:date>
  </channel>
</rss>

