<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Yarn behaviour with external tools in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Yarn-behaviour-with-external-tools/m-p/205127#M68786</link>
    <description>&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;There are several tools in HDP that doesn't use Yarn (Storm, HBase, etc). If I have OS, HBase, Storm and other tools taking resources on my cluster, how does Yarn know how much resources he owns for its application?&lt;/P&gt;&lt;P&gt;What are the best practices for multi-tenancy and isolation in this case?&lt;/P&gt;&lt;P&gt;How can I isolate IO with Yarn? is this something coming in future versions?&lt;/P&gt;&lt;P&gt;Thanks&lt;BR /&gt;&lt;/P&gt;</description>
    <pubDate>Sun, 01 Oct 2017 22:47:09 GMT</pubDate>
    <dc:creator>joe_harvyy</dc:creator>
    <dc:date>2017-10-01T22:47:09Z</dc:date>
    <item>
      <title>Yarn behaviour with external tools</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Yarn-behaviour-with-external-tools/m-p/205127#M68786</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;There are several tools in HDP that doesn't use Yarn (Storm, HBase, etc). If I have OS, HBase, Storm and other tools taking resources on my cluster, how does Yarn know how much resources he owns for its application?&lt;/P&gt;&lt;P&gt;What are the best practices for multi-tenancy and isolation in this case?&lt;/P&gt;&lt;P&gt;How can I isolate IO with Yarn? is this something coming in future versions?&lt;/P&gt;&lt;P&gt;Thanks&lt;BR /&gt;&lt;/P&gt;</description>
      <pubDate>Sun, 01 Oct 2017 22:47:09 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Yarn-behaviour-with-external-tools/m-p/205127#M68786</guid>
      <dc:creator>joe_harvyy</dc:creator>
      <dc:date>2017-10-01T22:47:09Z</dc:date>
    </item>
    <item>
      <title>Re: Yarn behaviour with external tools</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Yarn-behaviour-with-external-tools/m-p/205128#M68787</link>
      <description>&lt;P&gt;Hi &lt;A rel="user" href="https://community.cloudera.com/users/15184/joeharvyy.html" nodeid="15184"&gt;@Joe Harvy&lt;/A&gt;,&lt;/P&gt;&lt;P&gt;Yarn/Other tenent Application not aware of any of the other tenents resource usage, this will be much bigger problem when there is swap defined, as the OS Terminates(technically "sacrifice" ) one of the process based out of  age and amount of resources free up for the sacrifice.&lt;/P&gt;&lt;P&gt;So it become much critical to organize the applications in a multi tenant Environment.&lt;/P&gt;&lt;P&gt;there are multiple things needs to be considered while managing these kind of environments, such as memory CPU and Disk bottlenecks.&lt;/P&gt;&lt;P&gt;Memory Usage :&lt;/P&gt;&lt;P style="margin-left: 20px;"&gt;Interns of the Memory usage, we need to subtract the each component's maximum Heap allocation (-xmx ) and add additional resources such as 2G- for OS, 2GB -For DataNode, 2GB - Ambari Metrics etc then &lt;/P&gt;&lt;P style="margin-left: 20px;"&gt;for HBASE additional BucketCache(off heap) + Region Server Heap Size, and similar for Accumulo and Storm etc ..&lt;/P&gt;&lt;P style="margin-left: 20px;"&gt;After all subtracted from total memory, remaining can be allocated for Yarn, example of this has been well documented at HBASE cache configuration &lt;A href="https://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.5.6/bk_data-access/content/ch_managing-hbase.html"&gt;Here&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;A href="https://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.5.6/bk_data-access/content/ch_managing-hbase.html"&gt;&lt;/A&gt;&lt;/P&gt;&lt;P&gt;CPU usage : &lt;/P&gt;&lt;P style="margin-left: 20px;"&gt;This is Bit tricky as, Configuration of this value upfront may not be straight forward. need to compute the SAR / Ambari Metrics information, with respect to CPU usage and allocate the remaining CPU for the Yarn.&lt;/P&gt;&lt;P style="margin-left: 20px;"&gt;At the same time verify the load average on the host, should not be exceed too high, in cases that should be controlled with amount of parallel work happening form apps/YARN according to the priority. - this is where yarn scheduler comes handy.&lt;/P&gt;&lt;P&gt;Disk Usage :&lt;/P&gt;&lt;P style="margin-left: 20px;"&gt;Have a keen eye on CPU wait IO, any of the increase in that value cased by the low disk latency, better option is not share the disk for multiple purposes ( ex : for data nodes other application activities ), will result in queuing up the resources.&lt;/P&gt;&lt;P&gt;Hope this helps!!&lt;/P&gt;</description>
      <pubDate>Mon, 02 Oct 2017 15:21:53 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Yarn-behaviour-with-external-tools/m-p/205128#M68787</guid>
      <dc:creator>bkosaraju</dc:creator>
      <dc:date>2017-10-02T15:21:53Z</dc:date>
    </item>
  </channel>
</rss>

