<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Hive query error with Vertex failed on partitioned table by date. Any recommendation? in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Hive-query-error-with-Vertex-failed-on-partitioned-table-by/m-p/209141#M171098</link>
    <description>&lt;P&gt;&lt;A href="https://community.cloudera.com/legacyfs/online/attachments/39712-q3.tar.gz"&gt;q3.tar.gz&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;A href="https://community.cloudera.com/legacyfs/online/attachments/39712-q3.tar.gz"&gt;&lt;/A&gt;Hello, &lt;/P&gt;&lt;P&gt;I am taking part in PoC project where we are a looking for solution for interactive analytics (Tableau client)&lt;/P&gt;&lt;P&gt;1. Apache Hive (version 2.1.0.2.6.1.0-129)  Driver: Hive JDBC (version 1.2.1000.2.6.1.0-129)&lt;/P&gt;&lt;P&gt;2. We have configured 3 node HDP cluster with Hive + LLAP. All our test tables created in ORC format with  &lt;STRONG&gt;"orc.compress"="ZLIB"&lt;/STRONG&gt; option. &lt;/P&gt;&lt;P&gt;3. Fact table &lt;STRONG&gt;PARTITIONED BY (`date` date) &lt;/STRONG&gt; with dynamic partitions. &lt;/P&gt;&lt;P&gt;4. Tables  column statistics were collected for all tables. &lt;/P&gt;&lt;P&gt;Unfortunately some of our test queries have failed with error:&lt;/P&gt;&lt;P&gt;&lt;EM&gt;ERROR : Vertex failed, vertexName=Map 1, vertexId=vertex_1507032990279_0050_1_11, diagnostics=[Task failed, taskId=task_1507032990279_0050_1_11_000000, diagnostics=[TaskAttempt 0 failed, info=[Error: Error while running task ( failure ) : attempt_1507032990279_0050_1_11_000000_0:java.lang.RuntimeException: java.lang.RuntimeException: org.apache.hadoop.hive.ql.metadata.HiveException: Hive Runtime Error while processing row&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;Query runs with next parameters specified explicitly: &lt;/P&gt;&lt;PRE&gt;set tez.queue.name=llap;
set hive.llap.execution.mode=all;
set hive.execution.engine=tez;
set mapred.reduce.tasks=-1;
set hive.exec.parallel=true;
set hive.exec.dynamic.partition=true;
set hive.exec.dynamic.partition.mode = nonstrict;
set hive.exec.max.dynamic.partitions.pernode=256;
set hive.exec.max.dynamic.partitions=10000;
set hive.optimize.sort.dynamic.partition=true;
set hive.enforce.sorting=true;
set optimize.sort.dynamic.partitioning=true;
set hive.tez.exec.print.summary=true;
set hive.optimize.ppd=true;
set hive.optimize.ppd.storage=true;
set hive.vectorized.execution.enabled=true;
set hive.vectorized.execution.reduce.enabled = true;
set hive.cbo.enable=true;
set hive.compute.query.using.stats=true;
set hive.stats.fetch.column.stats=true;
set hive.stats.fetch.partition.stats=true;
set hive.tez.auto.reducer.parallelism=true;
set hive.tez.max.partition.factor=20;
set hive.exec.reducers.bytes.per.reducer=128000000;

set hive.optimize.index.filter=true;
set hive.exec.orc.skip.corrupt.data=true;
set hive.exec.compress.output=true;
set tez.am.container.reuse.enabled=TRUE;
set hive.compute.query.using.stats=true;
set stats.reliable=true;
set hive.merge.tezfiles=true;
&lt;/PRE&gt;&lt;P&gt;Our findings: &lt;/P&gt;&lt;P&gt;1. Query works well on non-partitioned tables &lt;/P&gt;&lt;P&gt;2. Query works fine with Tez or MR configured but failed with LLAP. &lt;/P&gt;&lt;P&gt;3.  If I remove "CAST(DATE_ADD(NEXT_DAY(`f_daily_funnel_report`.`date`,'SU'),-7) AS DATE) AS `twk_calculation_1485062019336982529_ok`" from select list and group by list the query start working. &lt;/P&gt;&lt;P&gt;In attachment you will find next files: &lt;/P&gt;&lt;P&gt;q3.sql  - original queries that failed &lt;/P&gt;&lt;P&gt;q3.err  - full execution log from beeline client&lt;/P&gt;&lt;P&gt;Any ideas ? &lt;/P&gt;&lt;P&gt;Thank you, &lt;/P&gt;</description>
    <pubDate>Mon, 09 Oct 2017 03:16:26 GMT</pubDate>
    <dc:creator>yevgen_shramko</dc:creator>
    <dc:date>2017-10-09T03:16:26Z</dc:date>
  </channel>
</rss>

