<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question How does virtualization affect &amp;quot;python yarn-utils.py&amp;quot; output and settings? in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-does-virtualization-affect-quot-python-yarn-utils-py/m-p/152761#M20495</link>
    <description>&lt;P&gt;We are running an 8 node virtualized cluster with 5 datanodes. Each datanode is allocated 8 vcores, 54 GB of RAM, and use shared SAN storage. The output of yarn-utils (v=8, m=54, d=4) is:&lt;/P&gt;&lt;P&gt;yarn.scheduler.minimum-allocation-mb=6656 &lt;/P&gt;&lt;P&gt;yarn.scheduler.maximum-allocation-mb=53248 &lt;/P&gt;&lt;P&gt;yarn.nodemanager.resource.memory-mb=53248 &lt;/P&gt;&lt;P&gt; mapreduce.map.memory.mb=6656 &lt;/P&gt;&lt;P&gt; mapreduce.map.java.opts=-Xmx5324m &lt;/P&gt;&lt;P&gt; mapreduce.reduce.memory.mb=6656&lt;/P&gt;&lt;P&gt;
 mapreduce.reduce.java.opts=-Xmx5324m &lt;/P&gt;&lt;P&gt; yarn.app.mapreduce.am.resource.mb=6656 &lt;/P&gt;&lt;P&gt; yarn.app.mapreduce.am.command-opts=-Xmx5324m &lt;/P&gt;&lt;P&gt; mapreduce.task.io.sort.mb=2662&lt;/P&gt;&lt;P&gt;Some questions I have is 1) what do you put for disks value when data node disks are running on shared SAN storage? and; 2) The maximum container size only shows 8 GB even though each node is assigned 54 GB. Does this have something to do with over commitment in the virtual environment? yarn-utils wants it set to 53 GB.&lt;/P&gt;</description>
    <pubDate>Tue, 23 Feb 2016 04:28:33 GMT</pubDate>
    <dc:creator>SQLShaw</dc:creator>
    <dc:date>2016-02-23T04:28:33Z</dc:date>
    <item>
      <title>How does virtualization affect "python yarn-utils.py" output and settings?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-does-virtualization-affect-quot-python-yarn-utils-py/m-p/152761#M20495</link>
      <description>&lt;P&gt;We are running an 8 node virtualized cluster with 5 datanodes. Each datanode is allocated 8 vcores, 54 GB of RAM, and use shared SAN storage. The output of yarn-utils (v=8, m=54, d=4) is:&lt;/P&gt;&lt;P&gt;yarn.scheduler.minimum-allocation-mb=6656 &lt;/P&gt;&lt;P&gt;yarn.scheduler.maximum-allocation-mb=53248 &lt;/P&gt;&lt;P&gt;yarn.nodemanager.resource.memory-mb=53248 &lt;/P&gt;&lt;P&gt; mapreduce.map.memory.mb=6656 &lt;/P&gt;&lt;P&gt; mapreduce.map.java.opts=-Xmx5324m &lt;/P&gt;&lt;P&gt; mapreduce.reduce.memory.mb=6656&lt;/P&gt;&lt;P&gt;
 mapreduce.reduce.java.opts=-Xmx5324m &lt;/P&gt;&lt;P&gt; yarn.app.mapreduce.am.resource.mb=6656 &lt;/P&gt;&lt;P&gt; yarn.app.mapreduce.am.command-opts=-Xmx5324m &lt;/P&gt;&lt;P&gt; mapreduce.task.io.sort.mb=2662&lt;/P&gt;&lt;P&gt;Some questions I have is 1) what do you put for disks value when data node disks are running on shared SAN storage? and; 2) The maximum container size only shows 8 GB even though each node is assigned 54 GB. Does this have something to do with over commitment in the virtual environment? yarn-utils wants it set to 53 GB.&lt;/P&gt;</description>
      <pubDate>Tue, 23 Feb 2016 04:28:33 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-does-virtualization-affect-quot-python-yarn-utils-py/m-p/152761#M20495</guid>
      <dc:creator>SQLShaw</dc:creator>
      <dc:date>2016-02-23T04:28:33Z</dc:date>
    </item>
    <item>
      <title>Re: How does virtualization affect "python yarn-utils.py" output and settings?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-does-virtualization-affect-quot-python-yarn-utils-py/m-p/152762#M20496</link>
      <description>&lt;P&gt;It's making more sense. My yarn.nodemanager.resource.memory-mb was only set to 16 GB so this restricted my min and max settings. Still not clear what to set disks to in a virtual environment in order to get a good baseline setting.&lt;/P&gt;</description>
      <pubDate>Tue, 23 Feb 2016 04:54:27 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-does-virtualization-affect-quot-python-yarn-utils-py/m-p/152762#M20496</guid>
      <dc:creator>SQLShaw</dc:creator>
      <dc:date>2016-02-23T04:54:27Z</dc:date>
    </item>
    <item>
      <title>Re: How does virtualization affect "python yarn-utils.py" output and settings?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-does-virtualization-affect-quot-python-yarn-utils-py/m-p/152763#M20497</link>
      <description>&lt;P&gt;Scott, there's two layers of memory settings that you need to be aware of - NodeManager and Containers. NodeManager has all the available memory it can provide to containers. You want to have more containers with decent memory. Rule of thumb is to use 2048MB of memory per container. So if you have 53GB of available memory per node, then you have about 26 containers available per node to do the job. 8GB of memory per container IMO is too big. &lt;/P&gt;&lt;P&gt;We don't know how many disks are there to be used by Hadoop from the SAN storage. You can disregard the disks in the equation as the formula is typically done for on-premise clusters. But you can run a manual calculation of the memory settings since you have the minimum container per node and memory per container values (26, 2048MB respectively). You can use the formula below. Just replace the # of containers per node and RAM per container with your values. Please note that 53GB of available ram per vm is too big knowing it only has 54GB RAM. Typically, you would want to set aside about 8GB for other processes - OS, HBase, etc. which means available memory per node is just 46GB.&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="2327-scott-vm-png.png" style="width: 843px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/21081i5F4E4789D14609A3/image-size/medium?v=v2&amp;amp;px=400" role="button" title="2327-scott-vm-png.png" alt="2327-scott-vm-png.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;Hope this helps.
&lt;/P&gt;</description>
      <pubDate>Sun, 18 Aug 2019 12:43:27 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-does-virtualization-affect-quot-python-yarn-utils-py/m-p/152763#M20497</guid>
      <dc:creator>rgarcia</dc:creator>
      <dc:date>2019-08-18T12:43:27Z</dc:date>
    </item>
  </channel>
</rss>

