<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Hive on Tez query Map output OutOfMemoryError: Java heap space at java.nio.HeapByteBuffer.(HeapByteBuffer.java:57) in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Hive-on-Tez-query-Map-output-OutOfMemoryError-Java-heap/m-p/363447#M238989</link>
    <description>&lt;P&gt;I'm running into the same situation:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;LI-CODE lang="java"&gt; Error: Error while running task ( failure ) : java.lang.OutOfMemoryError: Java heap space
  at java.nio.HeapByteBuffer.(HeapByteBuffer.java:57)
  at java.nio.ByteBuffer.allocate(ByteBuffer.java:335)
  at org.apache.tez.runtime.library.common.sort.impl.PipelinedSorter.allocateSpace(PipelinedSorter.java:250)
  at org.apache.tez.runtime.library.common.sort.impl.PipelinedSorter$SortSpan.end(PipelinedSorter.java:1054)
  at org.apache.tez.runtime.library.common.sort.impl.PipelinedSorter$SortSpan.next(PipelinedSorter.java:1009)
  at org.apache.tez.runtime.library.common.sort.impl.PipelinedSorter.sort(PipelinedSorter.java:318)
  at org.apache.tez.runtime.library.common.sort.impl.PipelinedSorter.collect(PipelinedSorter.java:423)
  at org.apache.tez.runtime.library.common.sort.impl.PipelinedSorter.write(PipelinedSorter.java:379)
  at org.apache.tez.runtime.library.output.OrderedPartitionedKVOutput$1.write(OrderedPartitionedKVOutput.java:167)
  at org.apache.hadoop.hive.ql.exec.tez.TezProcessor$TezKVOutputCollector.collect(TezProcessor.java:355)
  at org.apache.hadoop.hive.ql.exec.ReduceSinkOperator.collect(ReduceSinkOperator.java:511)
  at org.apache.hadoop.hive.ql.exec.ReduceSinkOperator.process(ReduceSinkOperator.java:367)
  at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:939)
  at org.apache.hadoop.hive.ql.exec.GroupByOperator.forward(GroupByOperator.java:1050)
  at org.apache.hadoop.hive.ql.exec.GroupByOperator.flushHashTable(GroupByOperator.java:998)
  at org.apache.hadoop.hive.ql.exec.GroupByOperator.process(GroupByOperator.java:750)
  at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:939)
  at org.apache.hadoop.hive.ql.exec.SelectOperator.process(SelectOperator.java:95)
  at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:939)
  at org.apache.hadoop.hive.ql.exec.CommonJoinOperator.internalForward(CommonJoinOperator.java:825)
  at org.apache.hadoop.hive.ql.exec.CommonJoinOperator.genAllOneUniqueJoinObject(CommonJoinOperator.java:857)
  at org.apache.hadoop.hive.ql.exec.CommonJoinOperator.checkAndGenObject(CommonJoinOperator.java:941)
  at org.apache.hadoop.hive.ql.exec.MapJoinOperator.process(MapJoinOperator.java:590)
  at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:939)
  at org.apache.hadoop.hive.ql.exec.SelectOperator.process(SelectOperator.java:95)
  at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:939)
  at org.apache.hadoop.hive.ql.exec.FilterOperator.process(FilterOperator.java:126)
  at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:939)
  at org.apache.hadoop.hive.ql.exec.TableScanOperator.process(TableScanOperator.java:128)
  at org.apache.hadoop.hive.ql.exec.MapOperator$MapOpCtx.forward(MapOperator.java:153)
  at org.apache.hadoop.hive.ql.exec.MapOperator.process(MapOperator.java:555)
  at org.apache.hadoop.hive.ql.exec.tez.MapRecordSource.processRow(MapRecordSource.java:92)&lt;/LI-CODE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I'm not sure why do we need to manually set container or buffer size? Shouldn't Tez do the calculations and use only what's available?&lt;/P&gt;</description>
    <pubDate>Thu, 09 Feb 2023 03:05:14 GMT</pubDate>
    <dc:creator>dz902</dc:creator>
    <dc:date>2023-02-09T03:05:14Z</dc:date>
    <item>
      <title>Hive on Tez query Map output OutOfMemoryError: Java heap space at java.nio.HeapByteBuffer.(HeapByteBuffer.java:57)</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Hive-on-Tez-query-Map-output-OutOfMemoryError-Java-heap/m-p/98436#M61549</link>
      <description>&lt;P&gt;When I use Hive on tez to insert overwrite table from other table,get the follow error,it did not happped every time,sometime query succefully:&lt;/P&gt;&lt;PRE&gt;"Error while processing statement: FAILED: Execution Error, return code 2 from org.apache.hadoop.hive.ql.exec.tez.TezTask. Vertex failed, vertexName=Map 1, vertexId=vertex_1449486079177_5239_1_01, diagnostics=[Task failed, taskId=task_1449486079177_5239_1_01_000018, diagnostics=[TaskAttempt 0 failed, info=[Error: Failure while running task: attempt_1449486079177_5239_1_01_000018_0:java.lang.RuntimeException: java.lang.OutOfMemoryError: Java heap space at org.apache.hadoop.hive.ql.exec.tez.TezProcessor.initializeAndRunProcessor(TezProcessor.java:157) at org.apache.hadoop.hive.ql.exec.tez.TezProcessor.run(TezProcessor.java:137) at org.apache.tez.runtime.LogicalIOProcessorRuntimeTask.run(LogicalIOProcessorRuntimeTask.java:348) at org.apache.tez.runtime.task.TaskRunner2Callable$1.run(TaskRunner2Callable.java:71) at org.apache.tez.runtime.task.TaskRunner2Callable$1.run(TaskRunner2Callable.java:60) at java.security.AccessController.doPrivileged(Native Method) at javax.security.auth.Subject.doAs(Subject.java:415) at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1614) at org.apache.tez.runtime.task.TaskRunner2Callable.callInternal(TaskRunner2Callable.java:60) at org.apache.tez.runtime.task.TaskRunner2Callable.callInternal(TaskRunner2Callable.java:35) at org.apache.tez.common.CallableWithNdc.call(CallableWithNdc.java:36) at java.util.concurrent.FutureTask.run(FutureTask.java:262) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) at java.lang.Thread.run(Thread.java:745) Caused by: java.lang.OutOfMemoryError: Java heap space at java.nio.HeapByteBuffer.(HeapByteBuffer.java:57) at java.nio.ByteBuffer.allocate(ByteBuffer.java:331) at org.apache.tez.runtime.library.common.sort.impl.PipelinedSorter.(PipelinedSorter.java:173) at org.apache.tez.runtime.library.common.sort.impl.PipelinedSorter.(PipelinedSorter.java:117) at org.apache.tez.runtime.library.output.OrderedPartitionedKVOutput.start(OrderedPartitionedKVOutput.java:141) at org.apache.hadoop.hive.ql.exec.tez.MapRecordProcessor.init(MapRecordProcessor.java:141) at org.apache.hadoop.hive.ql.exec.tez.TezProcessor.initializeAndRunProcessor(TezProcessor.java:147) ... 14 more ], TaskAttempt 1 failed, info=[Error: Failure while running task: attempt_1449486079177_5239_1_01_000018_1:java.lang.RuntimeException: java.lang.OutOfMemoryError: Java heap space at org.apache.hadoop.hive.ql.exec.tez.TezProcessor.initializeAndRunProcessor(TezProcessor.java:157) at org.apache.hadoop.hive.ql.exec.tez.TezProcessor.run(TezProcessor.java:137) at org.apache.tez.runtime.LogicalIOProcessorRuntimeTask.run(LogicalIOProcessorRuntimeTask.java:348) at org.apache.tez.runtime.task.TaskRunner2Callable$1.run(TaskRunner2Callable.java:71) at org.apache.tez.runtime.task.TaskRunner2Callable$1.run(TaskRunner2Callable.java:60) at java.security.AccessController.doPrivileged(Native Method) at javax.security.auth.Subject.doAs(Subject.java:415) at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1614) at org.apache.tez.runtime.task.TaskRunner2Callable.callInternal(TaskRunner2Callable.java:60) at org.apache.tez.runtime.task.TaskRunner2Callable.callInternal(TaskRunner2Callable.java:35) at org.apache.tez.common.CallableWithNdc.call(CallableWithNdc.java:36) at java.util.concurrent.FutureTask.run(FutureTask.java:262) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) at java.lang.Thread.run(Thread.java:745) Caused by: java.lang.OutOfMemoryError: Java heap space at java.nio.HeapByteBuffer.(HeapByteBuffer.java:57) at java.nio.ByteBuffer.allocate(ByteBuffer.java:331) at org.apache.tez.runtime.library.common.sort.impl.PipelinedSorter.(PipelinedSorter.java:173) at org.apache.tez.runtime.library.common.sort.impl.PipelinedSorter.(PipelinedSorter.java:117) at org.apache.tez.runtime.library.output.OrderedPartitionedKVOutput.start(OrderedPartitionedKVOutput.java:141) at org.apache.hadoop.hive.ql.exec.tez.MapRecordProcessor.init(MapRecordProcessor.java:141) at org.apache.hadoop.hive.ql.exec.tez.TezProcessor.initializeAndRunProcessor(TezProcessor.java:147) ... 14 more ], TaskAttempt 2&lt;/PRE&gt;</description>
      <pubDate>Thu, 10 Dec 2015 14:48:22 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Hive-on-Tez-query-Map-output-OutOfMemoryError-Java-heap/m-p/98436#M61549</guid>
      <dc:creator>connectchen</dc:creator>
      <dc:date>2015-12-10T14:48:22Z</dc:date>
    </item>
    <item>
      <title>Re: Hive on Tez query Map output OutOfMemoryError: Java heap space at java.nio.HeapByteBuffer.(HeapByteBuffer.java:57)</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Hive-on-Tez-query-Map-output-OutOfMemoryError-Java-heap/m-p/98437#M61550</link>
      <description>&lt;P&gt;If the sum of data sizes is greater than the amount of memory reserved for the hash tables (see below config param), then it happens.&lt;/P&gt;&lt;PRE&gt;hive.auto.convert.join.noconditionaltask.size=1370MB&lt;/PRE&gt;</description>
      <pubDate>Thu, 10 Dec 2015 18:46:15 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Hive-on-Tez-query-Map-output-OutOfMemoryError-Java-heap/m-p/98437#M61550</guid>
      <dc:creator>vt0084883</dc:creator>
      <dc:date>2015-12-10T18:46:15Z</dc:date>
    </item>
    <item>
      <title>Re: Hive on Tez query Map output OutOfMemoryError: Java heap space at java.nio.HeapByteBuffer.(HeapByteBuffer.java:57)</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Hive-on-Tez-query-Map-output-OutOfMemoryError-Java-heap/m-p/98438#M61551</link>
      <description>&lt;P&gt;It didn't work after set hive.auto.convert.join.noconditionaltask.size=1436549120.After set hive.tez.container.size=2048,set hive.tez.java.opts=-Xmx1700m,the OOM problem is solved.&lt;/P&gt;</description>
      <pubDate>Fri, 11 Dec 2015 10:36:58 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Hive-on-Tez-query-Map-output-OutOfMemoryError-Java-heap/m-p/98438#M61551</guid>
      <dc:creator>connectchen</dc:creator>
      <dc:date>2015-12-11T10:36:58Z</dc:date>
    </item>
    <item>
      <title>Re: Hive on Tez query Map output OutOfMemoryError: Java heap space at java.nio.HeapByteBuffer.(HeapByteBuffer.java:57)</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Hive-on-Tez-query-Map-output-OutOfMemoryError-Java-heap/m-p/98439#M61552</link>
      <description>&lt;P&gt;Try the following, lets assume your hive.tez.container.size=2048.&lt;/P&gt;&lt;P&gt;set hive.tez.java.opts=-Xmx1640m (0.8 times hive.tez.container.size)&lt;/P&gt;&lt;P&gt;set tez.runtime.io.sort.mb=820 (0.4 times hive.tez.container.size)&lt;/P&gt;&lt;P&gt;set tez.runtime.unordered.output.buffer.size-mb=205 (0.1 times hive.tez.container.size)&lt;/P&gt;</description>
      <pubDate>Fri, 11 Dec 2015 12:42:36 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Hive-on-Tez-query-Map-output-OutOfMemoryError-Java-heap/m-p/98439#M61552</guid>
      <dc:creator>deepesh1</dc:creator>
      <dc:date>2015-12-11T12:42:36Z</dc:date>
    </item>
    <item>
      <title>Re: Hive on Tez query Map output OutOfMemoryError: Java heap space at java.nio.HeapByteBuffer.(HeapByteBuffer.java:57)</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Hive-on-Tez-query-Map-output-OutOfMemoryError-Java-heap/m-p/98440#M61553</link>
      <description>&lt;UL&gt;
&lt;LI&gt;&lt;EM&gt;Total jobs = 3
Launching Job 1 out of 3
Number of reduce tasks is set to 0 since there's no reduce operator
Starting Job = job_1455546410616_13085, Tracking URL = &lt;A href="http://ndrm:8088/proxy/application_1455546410616_13085/" target="_blank"&gt;http://ndrm:8088/proxy/application_1455546410616_13085/&lt;/A&gt;
Kill Command = /usr/lib/hadoop/bin/hadoop job  -kill job_1455546410616_13085
Hadoop job information for Stage-1: number of mappers: 7; number of reducers: 0
2016-03-03 13:39:54,224 Stage-1 map = 0%,  reduce = 0%
2016-03-03 13:40:04,733 Stage-1 map = 57%,  reduce = 0%, Cumulative CPU 13.0 sec
2016-03-03 13:40:26,943 Stage-1 map = 86%,  reduce = 0%, Cumulative CPU 112.9 sec
2016-03-03 13:40:30,114 Stage-1 map = 96%,  reduce = 0%, Cumulative CPU 142.98 sec
2016-03-03 13:40:48,010 Stage-1 map = 86%,  reduce = 0%, Cumulative CPU 104.61 sec
2016-03-03 13:41:22,610 Stage-1 map = 96%,  reduce = 0%, Cumulative CPU 142.05 sec
2016-03-03 13:41:40,425 Stage-1 map = 86%,  reduce = 0%, Cumulative CPU 104.61 sec
2016-03-03 13:42:16,026 Stage-1 map = 96%,  reduce = 0%, Cumulative CPU 143.26 sec
2016-03-03 13:42:34,857 Stage-1 map = 86%,  reduce = 0%, Cumulative CPU 104.61 sec
2016-03-03 13:43:09,393 Stage-1 map = 96%,  reduce = 0%, Cumulative CPU 144.34 sec
2016-03-03 13:43:28,197 Stage-1 map = 100%,  reduce = 0%, Cumulative CPU 104.61 sec
MapReduce Total cumulative CPU time: 1 minutes 44 seconds 610 msec
Ended Job = job_1455546410616_13085 with errors
Error during job, obtaining debugging information...
Examining task ID: task_1455546410616_13085_m_000003 (and more) from job job_1455546410616_13085
Task with the most failures(4):
-----
Task ID:
  task_1455546410616_13085_m_000000
URL:
  &lt;A href="http://ndrm:8088/taskdetails.jsp?jobid=job_1455546410616_13085&amp;amp;tipid=task_1455546410616_13085_m_000000" target="_blank"&gt;http://ndrm:8088/taskdetails.jsp?jobid=job_1455546410616_13085&amp;amp;tipid=task_1455546410616_13085_m_000000&lt;/A&gt;
-----
Diagnostic Messages for this Task:
Error: Java heap space
&lt;/EM&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;Increasing the JVM memory and the map memory allocated by the container helped for me .&lt;/P&gt;&lt;P&gt;below are the values used:&lt;/P&gt;&lt;P&gt;hive&amp;gt; set mapreduce.map.memory.mb=4096; &lt;/P&gt;&lt;P&gt;hive &amp;gt;set mapreduce.map.java.opts=-Xmx3600M;&lt;/P&gt;&lt;P&gt;Incase you still get the Java heap error , try increasing to higher values, but make sure that the mapreduce.map.java.opts doesnt exceed mapreduce.map.memory.mb.&lt;/P&gt;&lt;P&gt;well in case of tez you may have to set set hive.tez.java.opts=-Xmx3600M;&lt;/P&gt;&lt;P&gt;Thanks&lt;/P&gt;</description>
      <pubDate>Fri, 04 Mar 2016 11:28:38 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Hive-on-Tez-query-Map-output-OutOfMemoryError-Java-heap/m-p/98440#M61553</guid>
      <dc:creator>Neethu PL</dc:creator>
      <dc:date>2016-03-04T11:28:38Z</dc:date>
    </item>
    <item>
      <title>Re: Hive on Tez query Map output OutOfMemoryError: Java heap space at java.nio.HeapByteBuffer.(HeapByteBuffer.java:57)</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Hive-on-Tez-query-Map-output-OutOfMemoryError-Java-heap/m-p/98441#M61554</link>
      <description>&lt;P&gt;I am running on Azure using Maria_dev login, where could I input this 3 lines of code? In the Tez config, it is all locked and cannot be edited.&lt;/P&gt;</description>
      <pubDate>Sat, 16 Apr 2016 21:01:42 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Hive-on-Tez-query-Map-output-OutOfMemoryError-Java-heap/m-p/98441#M61554</guid>
      <dc:creator>chen_zhi</dc:creator>
      <dc:date>2016-04-16T21:01:42Z</dc:date>
    </item>
    <item>
      <title>Re: Hive on Tez query Map output OutOfMemoryError: Java heap space at java.nio.HeapByteBuffer.(HeapByteBuffer.java:57)</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Hive-on-Tez-query-Map-output-OutOfMemoryError-Java-heap/m-p/98442#M61555</link>
      <description>&lt;P&gt;
	&lt;A rel="user" href="https://community.cloudera.com/users/1169/connectchen.html" nodeid="1169"&gt;@Jun Chen&lt;/A&gt;&lt;/P&gt;&lt;P&gt;
ssh to your server and open /etc/tez/conf/tez-site.xml and make these changes, if it did not work try larger values: &lt;/P&gt;&lt;UL&gt;
  
&lt;LI&gt;tez.am.resource.memory.mb &amp;gt; 768 &lt;/LI&gt;  
&lt;LI&gt;tez.task.resource.memory.mb &amp;gt; 768&lt;/LI&gt;  
&lt;LI&gt;tez.am.java.opts: &amp;gt; -Xmx560m -Xms560m&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;
the same for /etc/tez/conf/hive-site.xml&lt;/P&gt;&lt;UL&gt;
  
&lt;LI&gt;hive.tez.container.size:  -&amp;gt; 768&lt;/LI&gt;  
&lt;LI&gt;hive.tez.java.opts: -&amp;gt; -Xmx560m -Xms560m&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;
Then run &lt;/P&gt;&lt;P&gt;
  $&amp;gt; su hive&lt;/P&gt;&lt;P&gt;  $&amp;gt; hive&lt;/P&gt;&lt;P&gt;
and run your query.&lt;/P&gt;</description>
      <pubDate>Fri, 29 Apr 2016 07:32:13 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Hive-on-Tez-query-Map-output-OutOfMemoryError-Java-heap/m-p/98442#M61555</guid>
      <dc:creator>masoud</dc:creator>
      <dc:date>2016-04-29T07:32:13Z</dc:date>
    </item>
    <item>
      <title>Re: Hive on Tez query Map output OutOfMemoryError: Java heap space at java.nio.HeapByteBuffer.(HeapByteBuffer.java:57)</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Hive-on-Tez-query-Map-output-OutOfMemoryError-Java-heap/m-p/98443#M61556</link>
      <description>&lt;P&gt;Thanks, your solution worked for me - but there's a minor typo, I think you mean /etc/hive/conf/hive-site.xml for the second file, not /etc/tez/conf/hive-site.xml.&lt;/P&gt;</description>
      <pubDate>Thu, 16 Feb 2017 06:34:32 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Hive-on-Tez-query-Map-output-OutOfMemoryError-Java-heap/m-p/98443#M61556</guid>
      <dc:creator>richard_d_corfi</dc:creator>
      <dc:date>2017-02-16T06:34:32Z</dc:date>
    </item>
    <item>
      <title>Re: Hive on Tez query Map output OutOfMemoryError: Java heap space at java.nio.HeapByteBuffer.(HeapByteBuffer.java:57)</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Hive-on-Tez-query-Map-output-OutOfMemoryError-Java-heap/m-p/98444#M61557</link>
      <description>&lt;P&gt;Worked supper! Thank U.&lt;/P&gt;</description>
      <pubDate>Mon, 20 Mar 2017 19:24:03 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Hive-on-Tez-query-Map-output-OutOfMemoryError-Java-heap/m-p/98444#M61557</guid>
      <dc:creator>carlos_guimarae</dc:creator>
      <dc:date>2017-03-20T19:24:03Z</dc:date>
    </item>
    <item>
      <title>Re: Hive on Tez query Map output OutOfMemoryError: Java heap space at java.nio.HeapByteBuffer.(HeapByteBuffer.java:57)</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Hive-on-Tez-query-Map-output-OutOfMemoryError-Java-heap/m-p/363447#M238989</link>
      <description>&lt;P&gt;I'm running into the same situation:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;LI-CODE lang="java"&gt; Error: Error while running task ( failure ) : java.lang.OutOfMemoryError: Java heap space
  at java.nio.HeapByteBuffer.(HeapByteBuffer.java:57)
  at java.nio.ByteBuffer.allocate(ByteBuffer.java:335)
  at org.apache.tez.runtime.library.common.sort.impl.PipelinedSorter.allocateSpace(PipelinedSorter.java:250)
  at org.apache.tez.runtime.library.common.sort.impl.PipelinedSorter$SortSpan.end(PipelinedSorter.java:1054)
  at org.apache.tez.runtime.library.common.sort.impl.PipelinedSorter$SortSpan.next(PipelinedSorter.java:1009)
  at org.apache.tez.runtime.library.common.sort.impl.PipelinedSorter.sort(PipelinedSorter.java:318)
  at org.apache.tez.runtime.library.common.sort.impl.PipelinedSorter.collect(PipelinedSorter.java:423)
  at org.apache.tez.runtime.library.common.sort.impl.PipelinedSorter.write(PipelinedSorter.java:379)
  at org.apache.tez.runtime.library.output.OrderedPartitionedKVOutput$1.write(OrderedPartitionedKVOutput.java:167)
  at org.apache.hadoop.hive.ql.exec.tez.TezProcessor$TezKVOutputCollector.collect(TezProcessor.java:355)
  at org.apache.hadoop.hive.ql.exec.ReduceSinkOperator.collect(ReduceSinkOperator.java:511)
  at org.apache.hadoop.hive.ql.exec.ReduceSinkOperator.process(ReduceSinkOperator.java:367)
  at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:939)
  at org.apache.hadoop.hive.ql.exec.GroupByOperator.forward(GroupByOperator.java:1050)
  at org.apache.hadoop.hive.ql.exec.GroupByOperator.flushHashTable(GroupByOperator.java:998)
  at org.apache.hadoop.hive.ql.exec.GroupByOperator.process(GroupByOperator.java:750)
  at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:939)
  at org.apache.hadoop.hive.ql.exec.SelectOperator.process(SelectOperator.java:95)
  at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:939)
  at org.apache.hadoop.hive.ql.exec.CommonJoinOperator.internalForward(CommonJoinOperator.java:825)
  at org.apache.hadoop.hive.ql.exec.CommonJoinOperator.genAllOneUniqueJoinObject(CommonJoinOperator.java:857)
  at org.apache.hadoop.hive.ql.exec.CommonJoinOperator.checkAndGenObject(CommonJoinOperator.java:941)
  at org.apache.hadoop.hive.ql.exec.MapJoinOperator.process(MapJoinOperator.java:590)
  at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:939)
  at org.apache.hadoop.hive.ql.exec.SelectOperator.process(SelectOperator.java:95)
  at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:939)
  at org.apache.hadoop.hive.ql.exec.FilterOperator.process(FilterOperator.java:126)
  at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:939)
  at org.apache.hadoop.hive.ql.exec.TableScanOperator.process(TableScanOperator.java:128)
  at org.apache.hadoop.hive.ql.exec.MapOperator$MapOpCtx.forward(MapOperator.java:153)
  at org.apache.hadoop.hive.ql.exec.MapOperator.process(MapOperator.java:555)
  at org.apache.hadoop.hive.ql.exec.tez.MapRecordSource.processRow(MapRecordSource.java:92)&lt;/LI-CODE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I'm not sure why do we need to manually set container or buffer size? Shouldn't Tez do the calculations and use only what's available?&lt;/P&gt;</description>
      <pubDate>Thu, 09 Feb 2023 03:05:14 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Hive-on-Tez-query-Map-output-OutOfMemoryError-Java-heap/m-p/363447#M238989</guid>
      <dc:creator>dz902</dc:creator>
      <dc:date>2023-02-09T03:05:14Z</dc:date>
    </item>
    <item>
      <title>Re: Hive on Tez query Map output OutOfMemoryError: Java heap space at java.nio.HeapByteBuffer.(HeapByteBuffer.java:57)</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Hive-on-Tez-query-Map-output-OutOfMemoryError-Java-heap/m-p/363450#M238991</link>
      <description>&lt;P&gt;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/103391"&gt;@dz902&lt;/a&gt;&amp;nbsp;as this is an older post, you would have a better chance of receiving a resolution by&lt;A href="“https://community.cloudera.com/t5/forums/postpage/board-id/Questions”" target="_blank"&gt; starting a new thread&lt;/A&gt;. This will also be an opportunity to provide details specific to your environment that could aid others in assisting you with a more accurate answer to your question. You can link this thread as a reference in your new post.&lt;/P&gt;</description>
      <pubDate>Thu, 09 Feb 2023 06:09:14 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Hive-on-Tez-query-Map-output-OutOfMemoryError-Java-heap/m-p/363450#M238991</guid>
      <dc:creator>VidyaSargur</dc:creator>
      <dc:date>2023-02-09T06:09:14Z</dc:date>
    </item>
  </channel>
</rss>

