<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Yarn memory allocation &amp;amp; utilization in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Yarn-memory-allocation-utilization/m-p/285220#M211690</link>
    <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Hope this document will clarify your doubts. This was a tuning document.&lt;/P&gt;&lt;P&gt;&lt;A href="https://blog.cloudera.com/how-to-tune-your-apache-spark-jobs-part-2/" target="_blank"&gt;https://blog.cloudera.com/how-to-tune-your-apache-spark-jobs-part-2/&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks&lt;/P&gt;&lt;P&gt;AK&lt;/P&gt;</description>
    <pubDate>Tue, 10 Dec 2019 07:07:36 GMT</pubDate>
    <dc:creator>AKR</dc:creator>
    <dc:date>2019-12-10T07:07:36Z</dc:date>
    <item>
      <title>Yarn memory allocation &amp; utilization</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Yarn-memory-allocation-utilization/m-p/216290#M178199</link>
      <description>&lt;P&gt;I am running a cluster with 2 nodes where master &amp;amp; worker having below configuration. &lt;/P&gt;&lt;P&gt;Master  : 8 Cores, 16GB RAM &lt;/P&gt;&lt;P&gt;Worker  : 16 Cores, 64GB RAM &lt;/P&gt;&lt;P&gt;YARN configuration: &lt;/P&gt;&lt;PRE&gt;yarn.scheduler.minimum-allocation-mb: 1024
yarn.scheduler.maximum-allocation-mb: 22145
yarn.nodemanager.resource.cpu-vcores : 6
yarn.nodemanager.resource.memory-mb: 25145
&lt;/PRE&gt;&lt;P&gt;Capacity Scheduler:&lt;/P&gt;&lt;PRE&gt;yarn.scheduler.capacity.default.minimum-user-limit-percent=100
yarn.scheduler.capacity.maximum-am-resource-percent=0.5
yarn.scheduler.capacity.maximum-applications=100
yarn.scheduler.capacity.node-locality-delay=40
yarn.scheduler.capacity.root.accessible-node-labels=*
yarn.scheduler.capacity.root.acl_administer_queue=*
yarn.scheduler.capacity.root.capacity=100
yarn.scheduler.capacity.root.default.acl_administer_jobs=*
yarn.scheduler.capacity.root.default.acl_submit_applications=*
yarn.scheduler.capacity.root.default.capacity=100
yarn.scheduler.capacity.root.default.maximum-capacity=100
yarn.scheduler.capacity.root.default.state=RUNNING
yarn.scheduler.capacity.root.default.user-limit-factor=1
yarn.scheduler.capacity.root.queues=default&lt;/PRE&gt;&lt;P&gt;We have 23 spark jobs(scheduled in oozie)running on YARN at every hour. Some jobs are taking more time to complete. I am not sure whether YARN memory + vcores allocation is done properly or not. &lt;/P&gt;&lt;P&gt;Please suggest me the recommended YARN memory, vcores &amp;amp; Scheduler configuration based on the number of cores + RAM availablity. &lt;/P&gt;&lt;P&gt;
Thanks, &lt;/P&gt;&lt;P&gt;Sampath&lt;/P&gt;</description>
      <pubDate>Sat, 10 Feb 2018 18:56:38 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Yarn-memory-allocation-utilization/m-p/216290#M178199</guid>
      <dc:creator>sampathkumar_ma</dc:creator>
      <dc:date>2018-02-10T18:56:38Z</dc:date>
    </item>
    <item>
      <title>Re: Yarn memory allocation &amp; utilization</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Yarn-memory-allocation-utilization/m-p/216291#M178200</link>
      <description>&lt;P&gt;It looks like you are only letting YARN use 25GB's of your worker nodes' 64GB as well as only 6 of your 16 CPU cores, so these values should be raised.  Check out details at &lt;A href="https://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.6.4/bk_command-line-installation/content/determine-hdp-memory-config.html" target="_blank"&gt;https://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.6.4/bk_command-line-installation/content/determine-hdp-memory-config.html&lt;/A&gt; for a script that can help you set some baseline values for these properties.&lt;/P&gt;&lt;P&gt;As for the Spark jobs.  Interestingly enough, each of these jobs is requesting a certain size and number of containers and I'm betting each job is a bit different.  Since Spark jobs get their resources first, it would seem normal that a specific job (as long as the resource request doesn't change nor does the fundamental dataset size for input) take a comparable time to run from invocation to invocation.  Surely, that isn't necessarily the case from different Spark jobs which may be doing entirely different things.&lt;/P&gt;&lt;P&gt;Good luck and happy Hadooping/Sparking!&lt;/P&gt;</description>
      <pubDate>Wed, 14 Feb 2018 05:14:37 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Yarn-memory-allocation-utilization/m-p/216291#M178200</guid>
      <dc:creator>LesterMartin</dc:creator>
      <dc:date>2018-02-14T05:14:37Z</dc:date>
    </item>
    <item>
      <title>Re: Yarn memory allocation &amp; utilization</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Yarn-memory-allocation-utilization/m-p/216292#M178201</link>
      <description>&lt;P&gt;Thanks for your inputs.&lt;/P&gt;</description>
      <pubDate>Thu, 15 Mar 2018 16:10:22 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Yarn-memory-allocation-utilization/m-p/216292#M178201</guid>
      <dc:creator>sampathkumar_ma</dc:creator>
      <dc:date>2018-03-15T16:10:22Z</dc:date>
    </item>
    <item>
      <title>Re: Yarn memory allocation &amp; utilization</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Yarn-memory-allocation-utilization/m-p/285220#M211690</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Hope this document will clarify your doubts. This was a tuning document.&lt;/P&gt;&lt;P&gt;&lt;A href="https://blog.cloudera.com/how-to-tune-your-apache-spark-jobs-part-2/" target="_blank"&gt;https://blog.cloudera.com/how-to-tune-your-apache-spark-jobs-part-2/&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks&lt;/P&gt;&lt;P&gt;AK&lt;/P&gt;</description>
      <pubDate>Tue, 10 Dec 2019 07:07:36 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Yarn-memory-allocation-utilization/m-p/285220#M211690</guid>
      <dc:creator>AKR</dc:creator>
      <dc:date>2019-12-10T07:07:36Z</dc:date>
    </item>
  </channel>
</rss>

