<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Sqoop virtual memory error in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Sqoop-virtual-memory-error/m-p/88920#M45540</link>
    <description>One possibility could be the fetch size (combined with some unexpectedly&lt;BR /&gt;wide rows). Does lowering the result fetch size help?&lt;BR /&gt;&lt;BR /&gt;&amp;gt;From &lt;A href="http://sqoop.apache.org/docs/1.4.7/SqoopUserGuide.html#idp774390917888" target="_blank"&gt;http://sqoop.apache.org/docs/1.4.7/SqoopUserGuide.html#idp774390917888&lt;/A&gt;&lt;BR /&gt;:&lt;BR /&gt;--fetch-size Number of entries to read from database at once.&lt;BR /&gt;&lt;BR /&gt;Also, do you always see it fail with the YARN memory kill (due to pmem&lt;BR /&gt;exhaustion) or do you also observe an actual java.lang.OutOfMemoryError&lt;BR /&gt;occasionally? If it is always the former, then another suspect would be&lt;BR /&gt;some off-heap memory use done by the JDBC driver in use, although I've not&lt;BR /&gt;come across such a problem.&lt;BR /&gt;</description>
    <pubDate>Wed, 10 Apr 2019 07:31:51 GMT</pubDate>
    <dc:creator>Harsh J</dc:creator>
    <dc:date>2019-04-10T07:31:51Z</dc:date>
    <item>
      <title>Sqoop virtual memory error</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Sqoop-virtual-memory-error/m-p/88919#M45539</link>
      <description>&lt;P&gt;Hi.&amp;nbsp; I am having a "What the heck" moment.&amp;nbsp; Could someone please explain the theory behind this.&amp;nbsp; I have always presumed that Sqoop - unlike other MR processes that might require the entire dataset to be in memory to work - should not ever have a OOM issue.&amp;nbsp; Afterall, it is using its memory as a buffer, copying the data from DB to the staging area in HDFS, and when complete, moving from staging to --target-dir.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;So, we were moving a fairly large DB (500GB) but our client would only allow us to use 1 mapper (don't ask why...gulp).&amp;nbsp; About 90 minutes into the process, it terminated with:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;PRE&gt;Container is running beyond the 'PHYSICAL' memory limit. Current usage: 1.0Gib of 1 GB physical memory used; 2.7GB of 2.1 GB virtual memory used. Killing container&lt;/PRE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;This is really confusing me.&amp;nbsp; I suppose I can solve the problem by&lt;/P&gt;
&lt;P&gt;a) increasing the vmem/pmem ratio (yarn.nodemanager.vmem-pmem-ratio = xyz)&lt;/P&gt;
&lt;P&gt;OR&lt;/P&gt;
&lt;P&gt;b) not checking for this error (yarn.nodemanager.vmem-check-enabled = false).&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;But WHY is this error coming up?&amp;nbsp;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks in advance and cheers.&lt;/P&gt;</description>
      <pubDate>Fri, 16 Sep 2022 14:18:04 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Sqoop-virtual-memory-error/m-p/88919#M45539</guid>
      <dc:creator>HenryPark</dc:creator>
      <dc:date>2022-09-16T14:18:04Z</dc:date>
    </item>
    <item>
      <title>Re: Sqoop virtual memory error</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Sqoop-virtual-memory-error/m-p/88920#M45540</link>
      <description>One possibility could be the fetch size (combined with some unexpectedly&lt;BR /&gt;wide rows). Does lowering the result fetch size help?&lt;BR /&gt;&lt;BR /&gt;&amp;gt;From &lt;A href="http://sqoop.apache.org/docs/1.4.7/SqoopUserGuide.html#idp774390917888" target="_blank"&gt;http://sqoop.apache.org/docs/1.4.7/SqoopUserGuide.html#idp774390917888&lt;/A&gt;&lt;BR /&gt;:&lt;BR /&gt;--fetch-size Number of entries to read from database at once.&lt;BR /&gt;&lt;BR /&gt;Also, do you always see it fail with the YARN memory kill (due to pmem&lt;BR /&gt;exhaustion) or do you also observe an actual java.lang.OutOfMemoryError&lt;BR /&gt;occasionally? If it is always the former, then another suspect would be&lt;BR /&gt;some off-heap memory use done by the JDBC driver in use, although I've not&lt;BR /&gt;come across such a problem.&lt;BR /&gt;</description>
      <pubDate>Wed, 10 Apr 2019 07:31:51 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Sqoop-virtual-memory-error/m-p/88920#M45540</guid>
      <dc:creator>Harsh J</dc:creator>
      <dc:date>2019-04-10T07:31:51Z</dc:date>
    </item>
  </channel>
</rss>

