<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: While executing a select sql on Hive we are seeing &amp;quot;Execution Error, return code 2 from org.apache.hadoop.hive.ql.exec.tez.TezTask. Vertex failed, vertexName=Map 1&amp;quot; error . in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/While-executing-a-select-sql-on-Hive-we-are-seeing-quot/m-p/166026#M37000</link>
    <description>&lt;P&gt;@&lt;A href="https://community.hortonworks.com/users/12061/mahipalramidi.html"&gt;Mahipal Ramidi&lt;/A&gt;! &lt;/P&gt;&lt;P&gt;Ideally, keep Tez global settings as they are and set tez.grouping.max-size to a value &lt;STRONG&gt;that makes sense for the query you execute, &lt;/STRONG&gt;always higher than the&lt;STRONG&gt;  &lt;/STRONG&gt;tez.grouping.min-size which you set globally. If your global tez.grouping_min-size is not low enough to allow you to set your session tez.grouping.max-size  to a value higher than the global tez.grouping.min-size, you may want to change the global tez.grouping.min-size to a lower value to satisfy the condition. Low values of min and max create a lot of small tasks. Each task has allocated a container. Has such a lot of parallel tasks will do the work but it could also consume all resources of the cluster. This approach needs always a careful analysis of how many tasks created and resources used. Anyhow, mappers will chunk the input data to sizes between min and max and most likely there will be no impact on other jobs requiring a larger chunking. Your query seems to have a not so large data volume, but it requires a lot of parallelism to complete faster.&lt;/P&gt;</description>
    <pubDate>Tue, 09 Aug 2016 21:29:10 GMT</pubDate>
    <dc:creator>cstanca</dc:creator>
    <dc:date>2016-08-09T21:29:10Z</dc:date>
    <item>
      <title>While executing a select sql on Hive we are seeing "Execution Error, return code 2 from org.apache.hadoop.hive.ql.exec.tez.TezTask. Vertex failed, vertexName=Map 1" error .</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/While-executing-a-select-sql-on-Hive-we-are-seeing-quot/m-p/166024#M36998</link>
      <description>&lt;P&gt;While executing a select query when other queries are running in Hive we are facing below error . Can you some one suggest me what could be the reason for below error.&lt;/P&gt;&lt;P&gt;Error while
processing statement: FAILED: Execution Error, return code 2 from
org.apache.hadoop.hive.ql.exec.tez.TezTask. Vertex failed, vertexName=Map 1,
vertexId=vertex_1470078643944_0469_5_00, diagnostics=[Vertex
vertex_1470078643944_0469_5_00 [Map 1] killed/failed due
to:ROOT_INPUT_INIT_FAILURE, Vertex Input: se initializer failed,
vertex=vertex_1470078643944_0469_5_00 [Map 1],
org.apache.tez.dag.api.TezUncheckedException: Invalid max/min group lengths.
Required min&amp;gt;0, max&amp;gt;=min.  max: 4194304 min: 16777216&lt;/P&gt;&lt;P&gt; 
at
org.apache.hadoop.mapred.split.TezMapredSplitsGrouper.getGroupedSplits(TezMapredSplitsGrouper.java:147)&lt;/P&gt;&lt;P&gt; 
at org.apache.hadoop.hive.ql.exec.tez.SplitGrouper.group(SplitGrouper.java:89)&lt;/P&gt;&lt;P&gt; 
at org.apache.hadoop.hive.ql.exec.tez.SplitGrouper.generateGroupedSplits(SplitGrouper.java:168)&lt;/P&gt;&lt;P&gt; 
at
org.apache.hadoop.hive.ql.exec.tez.SplitGrouper.generateGroupedSplits(SplitGrouper.java:138)&lt;/P&gt;&lt;P&gt; 
at org.apache.hadoop.hive.ql.exec.tez.HiveSplitGenerator.initialize(HiveSplitGenerator.java:159)&lt;/P&gt;&lt;P&gt; 
at
org.apache.tez.dag.app.dag.RootInputInitializerManager$InputInitializerCallable$1.run(RootInputInitializerManager.java:273)&lt;/P&gt;&lt;P&gt; 
at org.apache.tez.dag.app.dag.RootInputInitializerManager$InputInitializerCallable$1.run(RootInputInitializerManager.java:266)&lt;/P&gt;&lt;P&gt; 
at java.security.AccessController.doPrivileged(Native Method)&lt;/P&gt;&lt;P&gt; 
at javax.security.auth.Subject.doAs(Subject.java:422)&lt;/P&gt;&lt;P&gt; 
at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1709)&lt;/P&gt;&lt;P&gt; 
at
org.apache.tez.dag.app.dag.RootInputInitializerManager$InputInitializerCallable.call(RootInputInitializerManager.java:266)&lt;/P&gt;&lt;P&gt; 
at org.apache.tez.dag.app.dag.RootInputInitializerManager$InputInitializerCallable.call(RootInputInitializerManager.java:253)&lt;/P&gt;&lt;P&gt; 
at java.util.concurrent.FutureTask.run(FutureTask.java:266)&lt;/P&gt;&lt;P&gt; 
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)&lt;/P&gt;&lt;P&gt; 
at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)&lt;/P&gt;&lt;P&gt; 
at java.lang.Thread.run(Thread.java:745)&lt;/P&gt;&lt;P&gt;]Vertex killed,
vertexName=Map 2, vertexId=vertex_1470078643944_0469_5_01, diagnostics=[Vertex
received Kill in INITED state., Vertex vertex_1470078643944_0469_5_01 [Map 2]
killed/failed due to:OTHER_VERTEX_FAILURE]Vertex killed, vertexName=Reducer 3,
vertexId=vertex_1470078643944_0469_5_02, diagnostics=[Vertex received Kill in
INITED state., Vertex vertex_1470078643944_0469_5_02 [Reducer 3] killed/failed
due to:OTHER_VERTEX_FAILURE]Vertex killed, vertexName=Reducer 4,
vertexId=vertex_1470078643944_0469_5_03, diagnostics=[Vertex received Kill in
INITED state., Vertex vertex_1470078643944_0469_5_03 [Reducer 4] killed/failed
due to:OTHER_VERTEX_FAILURE]DAG did not succeed due to VERTEX_FAILURE.
failedVertices:1 killedVertices:3&lt;/P&gt;&lt;P&gt;Appreciated for your help in advance.&lt;/P&gt;</description>
      <pubDate>Fri, 05 Aug 2016 10:03:06 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/While-executing-a-select-sql-on-Hive-we-are-seeing-quot/m-p/166024#M36998</guid>
      <dc:creator>mahipal_ramidi</dc:creator>
      <dc:date>2016-08-05T10:03:06Z</dc:date>
    </item>
    <item>
      <title>Re: While executing a select sql on Hive we are seeing "Execution Error, return code 2 from org.apache.hadoop.hive.ql.exec.tez.TezTask. Vertex failed, vertexName=Map 1" error .</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/While-executing-a-select-sql-on-Hive-we-are-seeing-quot/m-p/166025#M36999</link>
      <description>&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;&lt;U&gt;Issue :-&lt;/U&gt;&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;Vertex failed, vertexName=Map
1, vertexId=vertex_1470078643944_0307_5_00, diagnostics=[Vertex
vertex_1470078643944_0307_5_00 [Map 1] killed/failed due
to:ROOT_INPUT_INIT_FAILURE, Vertex Input: se initializer failed,
vertex=vertex_1470078643944_0307_5_00 [Map 1],
org.apache.tez.dag.api.TezUncheckedException: Invalid max/min group lengths.
Required min&amp;gt;0, max&amp;gt;=min.  &lt;STRONG&gt;max: 4194304 min: 16777216&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;&lt;U&gt;Cause:-&lt;/U&gt;&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;In the client app ( Jmeter ) we are setting 3
parameters as below:-&lt;/P&gt;&lt;P&gt;set mapreduce.job.maps=64;&lt;/P&gt;&lt;P&gt;set tez.grouping.max-size=&lt;STRONG&gt;4194304;&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;set tez.grouping.min-size=&lt;STRONG&gt;2097152;&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;In Tez  configuration ,
the max size is  set as &lt;STRONG&gt;1GB &lt;/STRONG&gt;and min size as &lt;STRONG&gt;16777216.&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;Now if you see the error ,
max shown as &lt;STRONG&gt;4194304&lt;/STRONG&gt; which is that we exactly set through Jmeter and min
is &lt;STRONG&gt;16777216&lt;/STRONG&gt; which is the tez default parameter. So that means hive is
taking the max from jmeter and min from tez default configuration and this is
the cause of the issue.&lt;/P&gt;</description>
      <pubDate>Tue, 09 Aug 2016 21:20:52 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/While-executing-a-select-sql-on-Hive-we-are-seeing-quot/m-p/166025#M36999</guid>
      <dc:creator>sivakumar_sudha</dc:creator>
      <dc:date>2016-08-09T21:20:52Z</dc:date>
    </item>
    <item>
      <title>Re: While executing a select sql on Hive we are seeing "Execution Error, return code 2 from org.apache.hadoop.hive.ql.exec.tez.TezTask. Vertex failed, vertexName=Map 1" error .</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/While-executing-a-select-sql-on-Hive-we-are-seeing-quot/m-p/166026#M37000</link>
      <description>&lt;P&gt;@&lt;A href="https://community.hortonworks.com/users/12061/mahipalramidi.html"&gt;Mahipal Ramidi&lt;/A&gt;! &lt;/P&gt;&lt;P&gt;Ideally, keep Tez global settings as they are and set tez.grouping.max-size to a value &lt;STRONG&gt;that makes sense for the query you execute, &lt;/STRONG&gt;always higher than the&lt;STRONG&gt;  &lt;/STRONG&gt;tez.grouping.min-size which you set globally. If your global tez.grouping_min-size is not low enough to allow you to set your session tez.grouping.max-size  to a value higher than the global tez.grouping.min-size, you may want to change the global tez.grouping.min-size to a lower value to satisfy the condition. Low values of min and max create a lot of small tasks. Each task has allocated a container. Has such a lot of parallel tasks will do the work but it could also consume all resources of the cluster. This approach needs always a careful analysis of how many tasks created and resources used. Anyhow, mappers will chunk the input data to sizes between min and max and most likely there will be no impact on other jobs requiring a larger chunking. Your query seems to have a not so large data volume, but it requires a lot of parallelism to complete faster.&lt;/P&gt;</description>
      <pubDate>Tue, 09 Aug 2016 21:29:10 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/While-executing-a-select-sql-on-Hive-we-are-seeing-quot/m-p/166026#M37000</guid>
      <dc:creator>cstanca</dc:creator>
      <dc:date>2016-08-09T21:29:10Z</dc:date>
    </item>
    <item>
      <title>Re: While executing a select sql on Hive we are seeing "Execution Error, return code 2 from org.apache.hadoop.hive.ql.exec.tez.TezTask. Vertex failed, vertexName=Map 1" error .</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/While-executing-a-select-sql-on-Hive-we-are-seeing-quot/m-p/166027#M37001</link>
      <description>&lt;P&gt;&lt;A href="https://community.hortonworks.com/users/12317/sivakumarsudhakarannairgirijakumari.html"&gt;@sivakumar sudhakarannair girijakumari&lt;/A&gt; &lt;/P&gt;&lt;P&gt;This is a great finding. I did not realize that setting the ez.grouping.min-size at the session level will not override the global value. It should override. Maybe this is a bug which is a rare condition because nobody was thinking to override at the session level the tez.grouping.max-size so low that it would be lower than the tez.grouping.min-size set at global level. It is a small issue and it could be workarounded as I specified in my response below.&lt;/P&gt;</description>
      <pubDate>Tue, 09 Aug 2016 21:32:28 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/While-executing-a-select-sql-on-Hive-we-are-seeing-quot/m-p/166027#M37001</guid>
      <dc:creator>cstanca</dc:creator>
      <dc:date>2016-08-09T21:32:28Z</dc:date>
    </item>
    <item>
      <title>Re: While executing a select sql on Hive we are seeing "Execution Error, return code 2 from org.apache.hadoop.hive.ql.exec.tez.TezTask. Vertex failed, vertexName=Map 1" error .</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/While-executing-a-select-sql-on-Hive-we-are-seeing-quot/m-p/166028#M37002</link>
      <description>&lt;P&gt;@All    I also similar issue. Can you please help me in that - &lt;A href="http://stackoverflow.com/questions/39547001/why-hive-staging-file-is-missing-in-aws-emr" target="_blank"&gt;http://stackoverflow.com/questions/39547001/why-hive-staging-file-is-missing-in-aws-emr&lt;/A&gt;?&lt;/P&gt;</description>
      <pubDate>Mon, 19 Sep 2016 20:36:06 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/While-executing-a-select-sql-on-Hive-we-are-seeing-quot/m-p/166028#M37002</guid>
      <dc:creator>mail2jhamb</dc:creator>
      <dc:date>2016-09-19T20:36:06Z</dc:date>
    </item>
  </channel>
</rss>

