<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question worker uses more ram than it should in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/worker-uses-more-ram-than-it-should/m-p/223472#M79645</link>
    <description>&lt;P&gt;
	&lt;BR /&gt;
	We have the following server to act as workers&lt;/P&gt;
&lt;DIV&gt;
	2*6 cores (24 threads)
64 gb ram
based on ambari 2.6.1.5 
&lt;/DIV&gt;
&lt;DIV&gt;
	our process uses approx 1gb, for example  when i submit 100 workers with the settings: 
&lt;/DIV&gt;
&lt;PRE&gt;spark-submit ..... --executor-memory 2gb
&lt;/PRE&gt;
&lt;DIV&gt;&lt;P&gt;the total ram used us 302 (100*3), because the ram usage is 3 gb, i cant fully use all the computation power, 3*24 &amp;gt;60 (i set the limit to 60) what did i miss?&lt;/P&gt;&lt;P&gt;both answers helped, each improved the ram usage&lt;/P&gt;&lt;/DIV&gt;</description>
    <pubDate>Mon, 18 Jun 2018 19:50:42 GMT</pubDate>
    <dc:creator>ilia987</dc:creator>
    <dc:date>2018-06-18T19:50:42Z</dc:date>
    <item>
      <title>worker uses more ram than it should</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/worker-uses-more-ram-than-it-should/m-p/223472#M79645</link>
      <description>&lt;P&gt;
	&lt;BR /&gt;
	We have the following server to act as workers&lt;/P&gt;
&lt;DIV&gt;
	2*6 cores (24 threads)
64 gb ram
based on ambari 2.6.1.5 
&lt;/DIV&gt;
&lt;DIV&gt;
	our process uses approx 1gb, for example  when i submit 100 workers with the settings: 
&lt;/DIV&gt;
&lt;PRE&gt;spark-submit ..... --executor-memory 2gb
&lt;/PRE&gt;
&lt;DIV&gt;&lt;P&gt;the total ram used us 302 (100*3), because the ram usage is 3 gb, i cant fully use all the computation power, 3*24 &amp;gt;60 (i set the limit to 60) what did i miss?&lt;/P&gt;&lt;P&gt;both answers helped, each improved the ram usage&lt;/P&gt;&lt;/DIV&gt;</description>
      <pubDate>Mon, 18 Jun 2018 19:50:42 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/worker-uses-more-ram-than-it-should/m-p/223472#M79645</guid>
      <dc:creator>ilia987</dc:creator>
      <dc:date>2018-06-18T19:50:42Z</dc:date>
    </item>
    <item>
      <title>Re: worker uses more ram than it should</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/worker-uses-more-ram-than-it-should/m-p/223473#M79646</link>
      <description>&lt;P&gt;&lt;A href="https://community.hortonworks.com/questions/198074/worker-uses-more-ram-than-it-should.html#"&gt;@ilia kheifets&lt;/A&gt;&lt;/P&gt;&lt;P&gt;The difference may come from yarn.scheduler.minimum-allocation-mb, spark memory overhead and jvm. For more information you may want to read the following article: &lt;A href="https://blog.csdn.net/oufuji/article/details/50387104" target="_blank"&gt;https://blog.csdn.net/oufuji/article/details/50387104&lt;/A&gt;&lt;/P&gt;&lt;P&gt;HTH&lt;/P&gt;&lt;P&gt;*** If you found this answer addressed your question, please take a moment to login and click the "accept" link on the answer.&lt;/P&gt;</description>
      <pubDate>Mon, 18 Jun 2018 20:11:12 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/worker-uses-more-ram-than-it-should/m-p/223473#M79646</guid>
      <dc:creator>falbani</dc:creator>
      <dc:date>2018-06-18T20:11:12Z</dc:date>
    </item>
    <item>
      <title>Re: worker uses more ram than it should</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/worker-uses-more-ram-than-it-should/m-p/223474#M79647</link>
      <description>&lt;P&gt;You may not be accounting for the driver RAM. Spark creates a driver process to act as a "parent" from which the executor processes spawn as separate YARN jobs. You are specifying the executor memory as 2GB but you did not specify the driver's memory limit.&lt;/P&gt;&lt;P&gt;By default, the driver is allocated 1GB of RAM thus explaining your calculations.&lt;/P&gt;&lt;P&gt;&lt;A href="https://spark.apache.org/docs/latest/configuration.html" target="_blank"&gt;https://spark.apache.org/docs/latest/configuration.html&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 18 Jun 2018 20:42:13 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/worker-uses-more-ram-than-it-should/m-p/223474#M79647</guid>
      <dc:creator>anarasimham</dc:creator>
      <dc:date>2018-06-18T20:42:13Z</dc:date>
    </item>
    <item>
      <title>Re: worker uses more ram than it should</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/worker-uses-more-ram-than-it-should/m-p/223475#M79648</link>
      <description>&lt;P&gt;setting yarn.scheduler.minimum-allocation-mb to smaller size improved the allocated memory by 30%&lt;/P&gt;</description>
      <pubDate>Mon, 18 Jun 2018 20:46:52 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/worker-uses-more-ram-than-it-should/m-p/223475#M79648</guid>
      <dc:creator>ilia987</dc:creator>
      <dc:date>2018-06-18T20:46:52Z</dc:date>
    </item>
    <item>
      <title>Re: worker uses more ram than it should</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/worker-uses-more-ram-than-it-should/m-p/223476#M79649</link>
      <description>&lt;P&gt;I have set it to 512M , it works.&lt;/P&gt;&lt;P&gt;when tried to go lower for example 128 i have got an error:&lt;/P&gt;&lt;PRE&gt;java.lang.IllegalArgumentException: System memory 119537664 must be at least 471859200. Please increase heap size using the --driver-memory option or spark.driver.memory in Spark configuration.&lt;/PRE&gt;</description>
      <pubDate>Mon, 18 Jun 2018 20:58:46 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/worker-uses-more-ram-than-it-should/m-p/223476#M79649</guid>
      <dc:creator>ilia987</dc:creator>
      <dc:date>2018-06-18T20:58:46Z</dc:date>
    </item>
    <item>
      <title>Re: worker uses more ram than it should</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/worker-uses-more-ram-than-it-should/m-p/223477#M79650</link>
      <description>&lt;P&gt;It looks like the application you've written uses almost 500 MB of driver memory. It sounds like your goal is to utilize all the CPU that your nodes carry - you'll have to either change the way your application works (to reduce the driver RAM) or reduce the executor memory to use all of the threads that your cluster offers.&lt;/P&gt;</description>
      <pubDate>Mon, 18 Jun 2018 21:13:09 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/worker-uses-more-ram-than-it-should/m-p/223477#M79650</guid>
      <dc:creator>anarasimham</dc:creator>
      <dc:date>2018-06-18T21:13:09Z</dc:date>
    </item>
    <item>
      <title>Re: worker uses more ram than it should</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/worker-uses-more-ram-than-it-should/m-p/223478#M79651</link>
      <description>&lt;P&gt;it uses 300-1200mb, but you are right it cpu heavy. and i am trying to maximize the processing power&lt;/P&gt;</description>
      <pubDate>Mon, 18 Jun 2018 21:25:45 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/worker-uses-more-ram-than-it-should/m-p/223478#M79651</guid>
      <dc:creator>ilia987</dc:creator>
      <dc:date>2018-06-18T21:25:45Z</dc:date>
    </item>
  </channel>
</rss>

