<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Speeding up deployment of app binaries in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Speeding-up-deployment-of-app-binaries/m-p/30605#M6919</link>
    <description>&lt;P&gt;Are there any recommendations to speed up deployment of app binaries to YARN?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I've been using RM REST APIs to submit apps to it with binaries located on HDFS. This tends to take a lot of time when the size of binaries to be deployed as YARN app are big in size (say, &amp;gt;500MB&amp;nbsp;or more),&amp;nbsp;and also&amp;nbsp;when number of&amp;nbsp;containers that I need are high.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I could probably speed this up by&amp;nbsp;:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;1. Turning off default 3 copies needed on HDFS&lt;/P&gt;&lt;P&gt;2. Using&amp;nbsp;HDFS cluster-wide cache which can help avoid block reads&lt;/P&gt;&lt;P&gt;3. Using YARN resource localization&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Do&amp;nbsp;you have any recommendations which&amp;nbsp;are definitely known to speed this up?&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;&lt;P&gt;Sumit&lt;/P&gt;</description>
    <pubDate>Fri, 16 Sep 2022 09:37:20 GMT</pubDate>
    <dc:creator>sumit.nigam</dc:creator>
    <dc:date>2022-09-16T09:37:20Z</dc:date>
    <item>
      <title>Speeding up deployment of app binaries</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Speeding-up-deployment-of-app-binaries/m-p/30605#M6919</link>
      <description>&lt;P&gt;Are there any recommendations to speed up deployment of app binaries to YARN?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I've been using RM REST APIs to submit apps to it with binaries located on HDFS. This tends to take a lot of time when the size of binaries to be deployed as YARN app are big in size (say, &amp;gt;500MB&amp;nbsp;or more),&amp;nbsp;and also&amp;nbsp;when number of&amp;nbsp;containers that I need are high.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I could probably speed this up by&amp;nbsp;:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;1. Turning off default 3 copies needed on HDFS&lt;/P&gt;&lt;P&gt;2. Using&amp;nbsp;HDFS cluster-wide cache which can help avoid block reads&lt;/P&gt;&lt;P&gt;3. Using YARN resource localization&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Do&amp;nbsp;you have any recommendations which&amp;nbsp;are definitely known to speed this up?&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;&lt;P&gt;Sumit&lt;/P&gt;</description>
      <pubDate>Fri, 16 Sep 2022 09:37:20 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Speeding-up-deployment-of-app-binaries/m-p/30605#M6919</guid>
      <dc:creator>sumit.nigam</dc:creator>
      <dc:date>2022-09-16T09:37:20Z</dc:date>
    </item>
    <item>
      <title>Re: Speeding up deployment of app binaries</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Speeding-up-deployment-of-app-binaries/m-p/30639#M6920</link>
      <description>&lt;P&gt;I also think we can probably compress the binaries before being copied to&amp;nbsp;HDFS and&amp;nbsp;have YARN uncompress them somehow?&lt;/P&gt;</description>
      <pubDate>Mon, 10 Aug 2015 16:35:32 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Speeding-up-deployment-of-app-binaries/m-p/30639#M6920</guid>
      <dc:creator>sumit.nigam</dc:creator>
      <dc:date>2015-08-10T16:35:32Z</dc:date>
    </item>
    <item>
      <title>Re: Speeding up deployment of app binaries</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Speeding-up-deployment-of-app-binaries/m-p/30661#M6921</link>
      <description>Do the binaries change job-to-job, or are they static? If they do not change, you could benefit from the new feature of proper-shared resource caches: &lt;A href="https://issues.apache.org/jira/browse/YARN-1492" target="_blank"&gt;https://issues.apache.org/jira/browse/YARN-1492&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;Otherwise, compression is your answer. The APIs allow you to add "archives" (zips) which will automatically be unwrapped at the attempt's root, when localising.&lt;BR /&gt;&lt;BR /&gt;Have you also profiled which part is worrying to you more? Is it the initial upload-to-HDFS period, or the localising (per NM) period instead?</description>
      <pubDate>Tue, 11 Aug 2015 02:15:35 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Speeding-up-deployment-of-app-binaries/m-p/30661#M6921</guid>
      <dc:creator>Harsh J</dc:creator>
      <dc:date>2015-08-11T02:15:35Z</dc:date>
    </item>
  </channel>
</rss>

