<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: hive.auto.convert.join Execution Error, return code 3 from MapredLocalTask (state=08S01,code=3) in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/hive-auto-convert-join-Execution-Error-return-code-3-from/m-p/41022#M9876</link>
    <description>&lt;P&gt;I'm not seeing the same issue here.&lt;/P&gt;&lt;P&gt;Check the yarn application logs. It will surely contain information about the issue.&lt;/P&gt;</description>
    <pubDate>Thu, 19 May 2016 07:41:24 GMT</pubDate>
    <dc:creator>mathieu.d</dc:creator>
    <dc:date>2016-05-19T07:41:24Z</dc:date>
    <item>
      <title>hive.auto.convert.join Execution Error, return code 3 from MapredLocalTask (state=08S01,code=3)</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/hive-auto-convert-join-Execution-Error-return-code-3-from/m-p/33715#M9873</link>
      <description>&lt;P&gt;Hi&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I set up&amp;nbsp;&lt;SPAN&gt;CDH 5.4.8 to AWS.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;hive&amp;gt; set hive.auto.convert.join;&lt;BR /&gt;hive.auto.convert.join=&lt;STRONG&gt;true&lt;/STRONG&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Executing query:&lt;/P&gt;&lt;P&gt;select AVG(commute_time) AS avg_commute_time from transportation JOIN person p ON persons = id WHERE commute_time &amp;lt; 10 GROUP BY nationality;&lt;/P&gt;&lt;P&gt;Total jobs = 1&lt;BR /&gt;Execution log at: /tmp/demo/demo_20151103083131_9c7f8815-a9fd-42d2-b7a3-9028fcdd81fb.log&lt;BR /&gt;2015-11-03 08:31:49 Starting to launch local task to process map join; maximum memory = 1029701632&lt;BR /&gt;2015-11-03 08:31:51 Processing rows: 200000 Hashtable size: 199999 Memory usage: 118909728 percentage: 0.115&lt;BR /&gt;2015-11-03 08:31:51 Processing rows: 300000 Hashtable size: 299999 Memory usage: 160068072 percentage: 0.155&lt;BR /&gt;2015-11-03 08:31:51 Dump the side-table for tag: 1 with group count: 300000 into file: file:/tmp/demo/facf066b-778f-447e-86b5-a9152a74c41b/hive_2015-11-03_08-31-44_617_3475565247267897601-1/-local-10004/HashTable-Stage-2/MapJoin-mapfile21--.hashtable&lt;BR /&gt;2015-11-03 08:31:51 Uploaded 1 File to: file:/tmp/demo/facf066b-778f-447e-86b5-a9152a74c41b/hive_2015-11-03_08-31-44_617_3475565247267897601-1/-local-10004/HashTable-Stage-2/MapJoin-mapfile21--.hashtable (20655402 bytes)&lt;BR /&gt;2015-11-03 08:31:51 End of local task; Time Taken: 2.361 sec.&lt;BR /&gt;Execution completed successfully&lt;BR /&gt;MapredLocal task succeeded&lt;BR /&gt;Launching Job 1 out of 1&lt;BR /&gt;Number of reduce tasks not specified. Estimated from input data size: 1&lt;BR /&gt;In order to change the average load for a reducer (in bytes):&lt;BR /&gt;set hive.exec.reducers.bytes.per.reducer=&amp;lt;number&amp;gt;&lt;BR /&gt;In order to limit the maximum number of reducers:&lt;BR /&gt;set hive.exec.reducers.max=&amp;lt;number&amp;gt;&lt;BR /&gt;In order to set a constant number of reducers:&lt;BR /&gt;set mapreduce.job.reduces=&amp;lt;number&amp;gt;&lt;BR /&gt;Starting Job = job_1446191535269_0136, Tracking URL = http://[server]:8088/proxy/application_1446191535269_0136/&lt;BR /&gt;Kill Command = /opt/cloudera/parcels/CDH-5.4.8-1.cdh5.4.8.p0.4/lib/hadoop/bin/hadoop job -kill job_1446191535269_0136&lt;BR /&gt;Hadoop job information for Stage-2: number of mappers: 1; number of reducers: 1&lt;BR /&gt;2015-11-03 08:32:00,237 Stage-2 map = 0%, reduce = 0%&lt;BR /&gt;2015-11-03 08:32:11,603 Stage-2 map = 100%, reduce = 0%, Cumulative CPU 10.73 sec&lt;BR /&gt;2015-11-03 08:32:18,828 Stage-2 map = 100%, reduce = 100%, Cumulative CPU 12.44 sec&lt;BR /&gt;MapReduce Total cumulative CPU time: 12 seconds 440 msec&lt;BR /&gt;Ended Job = job_1446191535269_0136&lt;BR /&gt;MapReduce Jobs Launched:&lt;BR /&gt;Stage-Stage-2: Map: 1 Reduce: 1 Cumulative CPU: 12.44 sec HDFS Read: 64673839 HDFS Write: 84 SUCCESS&lt;BR /&gt;Total MapReduce CPU Time Spent: 12 seconds 440 msec&lt;BR /&gt;OK&lt;BR /&gt;9.22561984510033&lt;BR /&gt;6.97536844275076&lt;BR /&gt;3.4043091344593&lt;BR /&gt;8.97108984313809&lt;BR /&gt;9.47274416580304&lt;BR /&gt;Time taken: 36.344 seconds, Fetched: 5 row(s)&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;When I execute the same query via beeline:&lt;/P&gt;&lt;P&gt;0: jdbc:hive2://localhost:10000&amp;gt; select AVG(commute_time) AS avg_commute_time from transportation JOIN person p ON persons = id WHERE commute_time &amp;lt; 10 GROUP BY nationality;&lt;BR /&gt;ERROR : Execution failed with exit status: 3&lt;BR /&gt;ERROR : Obtaining error information&lt;BR /&gt;ERROR :&lt;BR /&gt;Task failed!&lt;BR /&gt;Task ID:&lt;BR /&gt;Stage-5&lt;/P&gt;&lt;P&gt;Logs:&lt;/P&gt;&lt;P&gt;ERROR : /var/log/hive/hadoop-cmf-hive-HIVESERVER2-[server].log.out&lt;BR /&gt;Getting log thread is interrupted, since query is done!&lt;BR /&gt;Error: Error while processing statement: FAILED: Execution Error, return code 3 from org.apache.hadoop.hive.ql.exec.mr.MapredLocalTask (state=08S01,code=3)&lt;BR /&gt;java.sql.SQLException: Error while processing statement: FAILED: Execution Error, return code 3 from org.apache.hadoop.hive.ql.exec.mr.MapredLocalTask&lt;BR /&gt;at org.apache.hive.jdbc.HiveStatement.execute(HiveStatement.java:279)&lt;BR /&gt;at org.apache.hive.beeline.Commands.execute(Commands.java:814)&lt;BR /&gt;at org.apache.hive.beeline.Commands.sql(Commands.java:670)&lt;BR /&gt;at org.apache.hive.beeline.BeeLine.dispatch(BeeLine.java:974)&lt;BR /&gt;at org.apache.hive.beeline.BeeLine.execute(BeeLine.java:810)&lt;BR /&gt;at org.apache.hive.beeline.BeeLine.begin(BeeLine.java:767)&lt;BR /&gt;at org.apache.hive.beeline.BeeLine.mainWithInputRedirection(BeeLine.java:480)&lt;BR /&gt;at org.apache.hive.beeline.BeeLine.main(BeeLine.java:463)&lt;BR /&gt;at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)&lt;BR /&gt;at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)&lt;BR /&gt;at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)&lt;BR /&gt;at java.lang.reflect.Method.invoke(Method.java:606)&lt;BR /&gt;at org.apache.hadoop.util.RunJar.run(RunJar.java:221)&lt;BR /&gt;at org.apache.hadoop.util.RunJar.main(RunJar.java:136)&lt;BR /&gt;0: jdbc:hive2://localhost:10000&amp;gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;But when I turn&amp;nbsp;hive.auto.convert.join to false it works:&lt;/P&gt;&lt;P&gt;0: jdbc:hive2://localhost:10000&amp;gt; set hive.auto.convert.join=&lt;STRONG&gt;false&lt;/STRONG&gt;;&lt;/P&gt;&lt;P&gt;0: jdbc:hive2://localhost:10000&amp;gt; select AVG(commute_time) AS avg_commute_time from transportation JOIN person p ON persons = id WHERE commute_time &amp;lt; 10 GROUP BY nationality;&lt;BR /&gt;INFO : Number of reduce tasks not specified. Estimated from input data size: 2&lt;BR /&gt;INFO : In order to change the average load for a reducer (in bytes):&lt;BR /&gt;INFO : set hive.exec.reducers.bytes.per.reducer=&amp;lt;number&amp;gt;&lt;BR /&gt;INFO : In order to limit the maximum number of reducers:&lt;BR /&gt;INFO : set hive.exec.reducers.max=&amp;lt;number&amp;gt;&lt;BR /&gt;INFO : In order to set a constant number of reducers:&lt;BR /&gt;INFO : set mapreduce.job.reduces=&amp;lt;number&amp;gt;&lt;BR /&gt;WARN : Hadoop command-line option parsing not performed. Implement the Tool interface and execute your application with ToolRunner to remedy this.&lt;BR /&gt;INFO : number of splits:2&lt;BR /&gt;INFO : Submitting tokens for job: job_1446191535269_0137&lt;BR /&gt;INFO : The url to track the job: http://[ip]:8088/proxy/application_1446191535269_0137/&lt;BR /&gt;INFO : Starting Job = job_1446191535269_0137, Tracking URL = http://[ip]:8088/proxy/application_1446191535269_0137/&lt;BR /&gt;INFO : Kill Command = /opt/cloudera/parcels/CDH-5.4.8-1.cdh5.4.8.p0.4/lib/hadoop/bin/hadoop job -kill job_1446191535269_0137&lt;BR /&gt;INFO : Hadoop job information for Stage-1: number of mappers: 2; number of reducers: 2&lt;BR /&gt;INFO : 2015-11-03 08:37:40,662 Stage-1 map = 0%, reduce = 0%&lt;BR /&gt;INFO : 2015-11-03 08:37:48,910 Stage-1 map = 50%, reduce = 0%, Cumulative CPU 4.66 sec&lt;BR /&gt;INFO : 2015-11-03 08:37:49,941 Stage-1 map = 100%, reduce = 0%, Cumulative CPU 9.75 sec&lt;BR /&gt;INFO : 2015-11-03 08:37:58,192 Stage-1 map = 100%, reduce = 100%, Cumulative CPU 16.76 sec&lt;BR /&gt;INFO : MapReduce Total cumulative CPU time: 16 seconds 760 msec&lt;BR /&gt;INFO : Ended Job = job_1446191535269_0137&lt;BR /&gt;INFO : Number of reduce tasks not specified. Estimated from input data size: 1&lt;BR /&gt;INFO : In order to change the average load for a reducer (in bytes):&lt;BR /&gt;INFO : set hive.exec.reducers.bytes.per.reducer=&amp;lt;number&amp;gt;&lt;BR /&gt;INFO : In order to limit the maximum number of reducers:&lt;BR /&gt;INFO : set hive.exec.reducers.max=&amp;lt;number&amp;gt;&lt;BR /&gt;INFO : In order to set a constant number of reducers:&lt;BR /&gt;INFO : set mapreduce.job.reduces=&amp;lt;number&amp;gt;&lt;BR /&gt;WARN : Hadoop command-line option parsing not performed. Implement the Tool interface and execute your application with ToolRunner to remedy this.&lt;BR /&gt;INFO : number of splits:1&lt;BR /&gt;INFO : Submitting tokens for job: job_1446191535269_0138&lt;BR /&gt;INFO : The url to track the job: http://[ip]:8088/proxy/application_1446191535269_0138/&lt;BR /&gt;INFO : Starting Job = job_1446191535269_0138, Tracking URL = http://[ip]:8088/proxy/application_1446191535269_0138/&lt;BR /&gt;INFO : Kill Command = /opt/cloudera/parcels/CDH-5.4.8-1.cdh5.4.8.p0.4/lib/hadoop/bin/hadoop job -kill job_1446191535269_0138&lt;BR /&gt;INFO : Hadoop job information for Stage-2: number of mappers: 1; number of reducers: 1&lt;BR /&gt;INFO : 2015-11-03 08:38:05,525 Stage-2 map = 0%, reduce = 0%&lt;BR /&gt;INFO : 2015-11-03 08:38:12,738 Stage-2 map = 100%, reduce = 0%, Cumulative CPU 1.18 sec&lt;BR /&gt;INFO : 2015-11-03 08:38:18,932 Stage-2 map = 100%, reduce = 100%, Cumulative CPU 2.87 sec&lt;BR /&gt;INFO : MapReduce Total cumulative CPU time: 2 seconds 870 msec&lt;BR /&gt;INFO : Ended Job = job_1446191535269_0138&lt;BR /&gt;Getting log thread is interrupted, since query is done!&lt;BR /&gt;+--------------------------------+--+&lt;BR /&gt;| avg_commute_time |&lt;BR /&gt;+--------------------------------+--+&lt;BR /&gt;| 9.22561984510033 |&lt;BR /&gt;| 6.97536844275076 |&lt;BR /&gt;| 3.4043091344593 |&lt;BR /&gt;| 8.97108984313809 |&lt;BR /&gt;| 9.47274416580304 |&lt;BR /&gt;+--------------------------------+--+&lt;BR /&gt;5 rows selected (46.722 seconds)&lt;BR /&gt;0: jdbc:hive2://localhost:10000&amp;gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;How to explain that? I'd like to set&amp;nbsp;hive.auto.convert.join=true because users who make queries via HUE do not know what is map side joins.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Best regards, Margus (margusja) Roo&lt;/P&gt;</description>
      <pubDate>Tue, 21 Apr 2026 13:56:29 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/hive-auto-convert-join-Execution-Error-return-code-3-from/m-p/33715#M9873</guid>
      <dc:creator>margusja</dc:creator>
      <dc:date>2026-04-21T13:56:29Z</dc:date>
    </item>
    <item>
      <title>Re: hive.auto.convert.join Execution Error, return code 3 from MapredLocalTask (state=08S01,code=3)</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/hive-auto-convert-join-Execution-Error-return-code-3-from/m-p/33773#M9874</link>
      <description>&lt;P&gt;Hi,&lt;BR /&gt;&lt;BR /&gt;Error related to "MapredLocalTask"+"hive.auto.convert.join" has a lot to do with the JVM Heap Size.&lt;BR /&gt;&lt;BR /&gt;If you run the query against the hive cli, you should try to increase the JVM Heap size of the HIVE Gateway.&lt;BR /&gt;If you run the query against HUE (or using a JDBC connection), you should try to increase the JVM Heap size of the HiveServer2.&lt;BR /&gt;&lt;BR /&gt;Good luck.&lt;/P&gt;</description>
      <pubDate>Wed, 04 Nov 2015 15:21:50 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/hive-auto-convert-join-Execution-Error-return-code-3-from/m-p/33773#M9874</guid>
      <dc:creator>mathieu.d</dc:creator>
      <dc:date>2015-11-04T15:21:50Z</dc:date>
    </item>
    <item>
      <title>Re: hive.auto.convert.join Execution Error, return code 3 from MapredLocalTask (state=08S01,code=3)</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/hive-auto-convert-join-Execution-Error-return-code-3-from/m-p/40994#M9875</link>
      <description>&lt;P&gt;Greetings&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I am having the same problem&amp;nbsp;&amp;nbsp; I increased the heapsize as requested and my Hive/MR job fails&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;hive&amp;gt; set mapreduce.map.memory.mb=2048;&lt;BR /&gt;hive&amp;gt; set mapreduce.reduce.memory.mb=4096;&lt;BR /&gt;hive&amp;gt; select count(distinct warctype) from commoncrawl18 where warctype='warcinfo';&lt;BR /&gt;Query ID = jmill383_20160518141345_91d2a202-049e-4546-a9f7-e7183f2ff4bf&lt;BR /&gt;Total jobs = 1&lt;BR /&gt;Launching Job 1 out of 1&lt;BR /&gt;Number of reduce tasks determined at compile time: 1&lt;BR /&gt;In order to change the average load for a reducer (in bytes):&lt;BR /&gt;set hive.exec.reducers.bytes.per.reducer=&amp;lt;number&amp;gt;&lt;BR /&gt;In order to limit the maximum number of reducers:&lt;BR /&gt;set hive.exec.reducers.max=&amp;lt;number&amp;gt;&lt;BR /&gt;In order to set a constant number of reducers:&lt;BR /&gt;set mapreduce.job.reduces=&amp;lt;number&amp;gt;&lt;BR /&gt;Starting Job = job_1463594979064_0001, Tracking URL = http://starchild:8088/proxy/application_1463594979064_0001/&lt;BR /&gt;Kill Command = /opt/hadoop/bin/hadoop job -kill job_1463594979064_0001&lt;BR /&gt;Hadoop job information for Stage-1: number of mappers: 0; number of reducers: 0&lt;BR /&gt;2016-05-18 14:13:52,640 Stage-1 map = 0%, reduce = 0%&lt;BR /&gt;Ended Job = job_1463594979064_0001 with errors&lt;BR /&gt;Error during job, obtaining debugging information...&lt;BR /&gt;Job Tracking URL: http://starchild:8088/cluster/app/application_1463594979064_0001&lt;BR /&gt;FAILED: Execution Error, return code 2 from org.apache.hadoop.hive.ql.exec.mr.MapRedTask&lt;BR /&gt;MapReduce Jobs Launched:&lt;BR /&gt;Stage-Stage-1: HDFS Read: 0 HDFS Write: 0 FAIL&lt;BR /&gt;Total MapReduce CPU Time Spent: 0 msec&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Is there another alternative remedy for this?&amp;nbsp; Please advise&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;John M&lt;/P&gt;</description>
      <pubDate>Wed, 18 May 2016 18:18:08 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/hive-auto-convert-join-Execution-Error-return-code-3-from/m-p/40994#M9875</guid>
      <dc:creator>FIBERNACHI</dc:creator>
      <dc:date>2016-05-18T18:18:08Z</dc:date>
    </item>
    <item>
      <title>Re: hive.auto.convert.join Execution Error, return code 3 from MapredLocalTask (state=08S01,code=3)</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/hive-auto-convert-join-Execution-Error-return-code-3-from/m-p/41022#M9876</link>
      <description>&lt;P&gt;I'm not seeing the same issue here.&lt;/P&gt;&lt;P&gt;Check the yarn application logs. It will surely contain information about the issue.&lt;/P&gt;</description>
      <pubDate>Thu, 19 May 2016 07:41:24 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/hive-auto-convert-join-Execution-Error-return-code-3-from/m-p/41022#M9876</guid>
      <dc:creator>mathieu.d</dc:creator>
      <dc:date>2016-05-19T07:41:24Z</dc:date>
    </item>
  </channel>
</rss>

