<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Out of Memory Error in Hive in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Out-of-Memory-Error-in-Hive/m-p/125850#M34537</link>
    <description>&lt;P&gt;Where exactly is the OOM occurring? Is it in the AM? The table are partitioned?&lt;/P&gt;&lt;P&gt;Does this work for you with a scaled down dataset? Table definition will be helpful yo look at&lt;/P&gt;</description>
    <pubDate>Wed, 13 Jul 2016 04:46:02 GMT</pubDate>
    <dc:creator>schauhan</dc:creator>
    <dc:date>2016-07-13T04:46:02Z</dc:date>
    <item>
      <title>Out of Memory Error in Hive</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Out-of-Memory-Error-in-Hive/m-p/125845#M34532</link>
      <description>&lt;P&gt;I am getting the below error while trying to execute a query like "select * from a where a.col1 not in (select b.col1 from b)"&lt;/P&gt;&lt;P&gt;Exception in thread "main"
java.lang.OutOfMemoryError: Java heap space&lt;/P&gt;&lt;P&gt;  at
java.util.Arrays.copyOfRange(Arrays.java:2694)&lt;/P&gt;&lt;P&gt;  at java.lang.String.&amp;lt;init&amp;gt;(String.java:203)&lt;/P&gt;&lt;P&gt;  at
java.lang.StringBuilder.toString(StringBuilder.java:405)&lt;/P&gt;&lt;P&gt;  at
org.apache.hadoop.fs.Path.toString(Path.java:390)&lt;/P&gt;&lt;P&gt;  at
org.apache.hadoop.hive.ql.optimizer.AbstractBucketJoinProc.getBucketFilePathsOfPartition(AbstractBucketJoinProc.java:87)&lt;/P&gt;&lt;P&gt;  at
org.apache.hadoop.hive.ql.optimizer.metainfo.annotation.OpTraitsRulesProcFactory$TableScanRule.checkBucketedTable(OpTraitsRulesProcFactory.java:147)&lt;/P&gt;&lt;P&gt;  at
org.apache.hadoop.hive.ql.optimizer.metainfo.annotation.OpTraitsRulesProcFactory$TableScanRule.process(OpTraitsRulesProcFactory.java:174)&lt;/P&gt;&lt;P&gt;  at
org.apache.hadoop.hive.ql.lib.DefaultRuleDispatcher.dispatch(DefaultRuleDispatcher.java:90)&lt;/P&gt;&lt;P&gt;  at org.apache.hadoop.hive.ql.lib.DefaultGraphWalker.dispatchAndReturn(DefaultGraphWalker.java:95)&lt;/P&gt;&lt;P&gt;  at
org.apache.hadoop.hive.ql.lib.DefaultGraphWalker.dispatch(DefaultGraphWalker.java:79)&lt;/P&gt;&lt;P&gt;  at
org.apache.hadoop.hive.ql.lib.PreOrderWalker.walk(PreOrderWalker.java:56)&lt;/P&gt;&lt;P&gt;  at
org.apache.hadoop.hive.ql.lib.DefaultGraphWalker.startWalking(DefaultGraphWalker.java:110)&lt;/P&gt;&lt;P&gt;  at
org.apache.hadoop.hive.ql.optimizer.metainfo.annotation.AnnotateWithOpTraits.transform(AnnotateWithOpTraits.java:91)&lt;/P&gt;&lt;P&gt;  at
org.apache.hadoop.hive.ql.parse.TezCompiler.runStatsAnnotation(TezCompiler.java:249)&lt;/P&gt;&lt;P&gt;  at
org.apache.hadoop.hive.ql.parse.TezCompiler.optimizeOperatorPlan(TezCompiler.java:122)&lt;/P&gt;&lt;P&gt;  at
org.apache.hadoop.hive.ql.parse.TaskCompiler.compile(TaskCompiler.java:102)&lt;/P&gt;&lt;P&gt;  at
org.apache.hadoop.hive.ql.parse.SemanticAnalyzer.analyzeInternal(SemanticAnalyzer.java:10188)&lt;/P&gt;&lt;P&gt;  at
org.apache.hadoop.hive.ql.parse.CalcitePlanner.analyzeInternal(CalcitePlanner.java:211)&lt;/P&gt;&lt;P&gt;  at
org.apache.hadoop.hive.ql.parse.BaseSemanticAnalyzer.analyze(BaseSemanticAnalyzer.java:227)&lt;/P&gt;&lt;P&gt;  at
org.apache.hadoop.hive.ql.Driver.compile(Driver.java:424)&lt;/P&gt;&lt;P&gt;  at
org.apache.hadoop.hive.ql.Driver.compile(Driver.java:308)&lt;/P&gt;&lt;P&gt;   at
org.apache.hadoop.hive.ql.Driver.compileInternal(Driver.java:1122)&lt;/P&gt;&lt;P&gt;  at
org.apache.hadoop.hive.ql.Driver.runInternal(Driver.java:1170)&lt;/P&gt;&lt;P&gt;  at
org.apache.hadoop.hive.ql.Driver.run(Driver.java:1059)&lt;/P&gt;&lt;P&gt;  at
org.apache.hadoop.hive.ql.Driver.run(Driver.java:1049)&lt;/P&gt;&lt;P&gt;  at org.apache.hadoop.hive.cli.CliDriver.processLocalCmd(CliDriver.java:213)&lt;/P&gt;&lt;P&gt;  at
org.apache.hadoop.hive.cli.CliDriver.processCmd(CliDriver.java:165)&lt;/P&gt;&lt;P&gt;  at
org.apache.hadoop.hive.cli.CliDriver.processLine(CliDriver.java:376)&lt;/P&gt;&lt;P&gt;  at
org.apache.hadoop.hive.cli.CliDriver.executeDriver(CliDriver.java:736)&lt;/P&gt;&lt;P&gt;  at
org.apache.hadoop.hive.cli.CliDriver.run(CliDriver.java:681)&lt;/P&gt;&lt;P&gt;  at
org.apache.hadoop.hive.cli.CliDriver.main(CliDriver.java:621)&lt;/P&gt;&lt;P&gt;  at
sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)&lt;/P&gt;&lt;P&gt;I tried increasing value for below properties, but it is not working. Hive is running on TEZ&lt;/P&gt;&lt;P&gt;mapreduce.map.memory.mb &lt;/P&gt;&lt;P&gt;mapreduce.reduce.memory.mb &lt;/P&gt;&lt;P&gt;hive.tez.container.size &lt;/P&gt;&lt;P&gt;hive.tez.java.opts&lt;/P&gt;</description>
      <pubDate>Tue, 12 Jul 2016 23:30:29 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Out-of-Memory-Error-in-Hive/m-p/125845#M34532</guid>
      <dc:creator>pooja_khandelwa</dc:creator>
      <dc:date>2016-07-12T23:30:29Z</dc:date>
    </item>
    <item>
      <title>Re: Out of Memory Error in Hive</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Out-of-Memory-Error-in-Hive/m-p/125846#M34533</link>
      <description>&lt;P&gt;You might want to take a look at this&lt;/P&gt;&lt;P&gt;&lt;A href="https://community.hortonworks.com/questions/35403/hive-query-outofmemoryerror-java-heap-space.html" target="_blank"&gt;https://community.hortonworks.com/questions/35403/hive-query-outofmemoryerror-java-heap-space.html&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 12 Jul 2016 23:34:32 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Out-of-Memory-Error-in-Hive/m-p/125846#M34533</guid>
      <dc:creator>icocio</dc:creator>
      <dc:date>2016-07-12T23:34:32Z</dc:date>
    </item>
    <item>
      <title>Re: Out of Memory Error in Hive</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Out-of-Memory-Error-in-Hive/m-p/125847#M34534</link>
      <description>&lt;P&gt;Hello pooja&lt;/P&gt;&lt;P&gt;From your stack trace your table seems to be bucketed. Can you share your table definition&lt;/P&gt;&lt;P&gt;could you also try running the query with the setting: hive.auto.convert.join.noconditionaltask=false&lt;/P&gt;</description>
      <pubDate>Tue, 12 Jul 2016 23:44:49 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Out-of-Memory-Error-in-Hive/m-p/125847#M34534</guid>
      <dc:creator>nmaillard1</dc:creator>
      <dc:date>2016-07-12T23:44:49Z</dc:date>
    </item>
    <item>
      <title>Re: Out of Memory Error in Hive</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Out-of-Memory-Error-in-Hive/m-p/125848#M34535</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;I had tried with hive.auto.convert.join.noconditionaltask=false, but didn't work.&lt;/P&gt;&lt;P&gt;No table is bucketed. &lt;/P&gt;</description>
      <pubDate>Tue, 12 Jul 2016 23:56:28 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Out-of-Memory-Error-in-Hive/m-p/125848#M34535</guid>
      <dc:creator>pooja_khandelwa</dc:creator>
      <dc:date>2016-07-12T23:56:28Z</dc:date>
    </item>
    <item>
      <title>Re: Out of Memory Error in Hive</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Out-of-Memory-Error-in-Hive/m-p/125849#M34536</link>
      <description>&lt;P&gt;The problem is probably because there is too much data moving through the shuffle phase.  You can reduce the amount of data moving between tasks as part of the SHUFFLE steps by using more aggressive queries and by looking carefully at your input splits and reduce summary steps.  If you have the Ambari TEZ View installed, then I would recommend inspecting each of the TEZ tasks and look at the SHUFFLE BYTES counters to see how much data is moving between the steps.  If you see the early steps are moving a lot of data between the tasks then you have probably found the root cause of your out of memory exception and you should be able to tune your  Hive query to filter data earlier in the p&lt;/P&gt;</description>
      <pubDate>Wed, 13 Jul 2016 02:59:01 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Out-of-Memory-Error-in-Hive/m-p/125849#M34536</guid>
      <dc:creator>mjohnson</dc:creator>
      <dc:date>2016-07-13T02:59:01Z</dc:date>
    </item>
    <item>
      <title>Re: Out of Memory Error in Hive</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Out-of-Memory-Error-in-Hive/m-p/125850#M34537</link>
      <description>&lt;P&gt;Where exactly is the OOM occurring? Is it in the AM? The table are partitioned?&lt;/P&gt;&lt;P&gt;Does this work for you with a scaled down dataset? Table definition will be helpful yo look at&lt;/P&gt;</description>
      <pubDate>Wed, 13 Jul 2016 04:46:02 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Out-of-Memory-Error-in-Hive/m-p/125850#M34537</guid>
      <dc:creator>schauhan</dc:creator>
      <dc:date>2016-07-13T04:46:02Z</dc:date>
    </item>
    <item>
      <title>Re: Out of Memory Error in Hive</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Out-of-Memory-Error-in-Hive/m-p/125851#M34538</link>
      <description>&lt;A rel="user" href="https://community.cloudera.com/users/131/nmaillard.html" nodeid="131"&gt;@nmaillard&lt;/A&gt;&lt;P&gt; I am have a same problem. I am on &lt;A href="http://host:8080/#/main/services/HIVE/configs" target="_blank"&gt;http://host:8080/#/main/services/HIVE/configs&lt;/A&gt; but not sure what to change in the configs.&lt;/P&gt;</description>
      <pubDate>Thu, 21 Jul 2016 00:28:30 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Out-of-Memory-Error-in-Hive/m-p/125851#M34538</guid>
      <dc:creator>rajinder_kaur</dc:creator>
      <dc:date>2016-07-21T00:28:30Z</dc:date>
    </item>
    <item>
      <title>Re: Out of Memory Error in Hive</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Out-of-Memory-Error-in-Hive/m-p/125852#M34539</link>
      <description>&lt;P&gt;I am facing the same issue . Can someone please help&lt;/P&gt;</description>
      <pubDate>Wed, 11 Apr 2018 04:59:43 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Out-of-Memory-Error-in-Hive/m-p/125852#M34539</guid>
      <dc:creator>chsanthosh_sep</dc:creator>
      <dc:date>2018-04-11T04:59:43Z</dc:date>
    </item>
  </channel>
</rss>

