<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Yarn applications hang foreever if run in parallel in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Yarn-applications-hang-foreever-if-run-in-parallel/m-p/17146#M43971</link>
    <description>&lt;P&gt;&lt;SPAN&gt;We are still experiencing periodic problems with applications hanging when a number of jobs are submitted in parallel. &amp;nbsp;We have reduced&amp;nbsp;'maxRunningApps', increased the virtual core count, and also increased '&lt;SPAN&gt;oozie.service.callablequeueservice.threads' to 40. &amp;nbsp;In many cases, the applications do not hang, however this is not consistent.&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;&lt;SPAN&gt;Regarding YARN issue number 1913 (&lt;A href="https://issues.apache.org/jira/browse/YARN-1913)," target="_blank"&gt;https://issues.apache.org/jira/browse/YARN-1913),&lt;/A&gt; is this patch incorporated in CDH 5.1.0, the version we are using? &amp;nbsp;YARN-1913 indicates the affected version is&amp;nbsp;&lt;SPAN&gt;2.3.0, and is fixed in 2.5.0. &amp;nbsp;Our Hadoop version in 5.1.0 is 2.3.0.&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;&lt;SPAN&gt;&lt;SPAN&gt;Thank you,&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;&lt;SPAN&gt;&lt;SPAN&gt;Michael Reynolds&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Mon, 18 Aug 2014 22:29:38 GMT</pubDate>
    <dc:creator>Urantian</dc:creator>
    <dc:date>2014-08-18T22:29:38Z</dc:date>
    <item>
      <title>Yarn applications hang foreever if run in parallel</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Yarn-applications-hang-foreever-if-run-in-parallel/m-p/15184#M43966</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;we have a cluster with 8 nodes on CDH5 (5.0.2) with Yarn MRv2 in use and a big problem which is probably due to the Config.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;In addition to Hadoop, we also use Imapala so we can not use give all ressoures to yarn.&lt;/P&gt;&lt;P&gt;Each of our nodes have 128GB of RAM and 12 cores.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Currently sees the Memory config for Yarn as follows:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;mapreduce.map.memory.mb = 8Gib&lt;/P&gt;&lt;P&gt;mapreduce.reduce.memory.mb = 8Gib&lt;/P&gt;&lt;P&gt;yarn.app.mapreduce.am.resource.mb = 8Gib&lt;/P&gt;&lt;P&gt;mapreduce.map.java.opts.max.heap = 6960MiB&lt;/P&gt;&lt;P&gt;mapreduce.reduce.java.opts.max.heap = 6960MiB&lt;/P&gt;&lt;P&gt;"Java Heap Size in bytes of NodeManager" = 8Gib&lt;/P&gt;&lt;P&gt;yarn.nodemanager.resource.memory-mb = 80Gib&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Now we get the problem that if we run multiple applications in parallel all stop and no one finished.&lt;/P&gt;&lt;P&gt;it looks as if they hang forever. I see no exception or errors in "/var/log/hadoop-yarn" (Debug Log Level).&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I would be glad if someone can help? &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;BG&lt;/P&gt;</description>
      <pubDate>Fri, 16 Sep 2022 09:02:29 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Yarn-applications-hang-foreever-if-run-in-parallel/m-p/15184#M43966</guid>
      <dc:creator>scubMUC</dc:creator>
      <dc:date>2022-09-16T09:02:29Z</dc:date>
    </item>
    <item>
      <title>Re: Yarn applications hang foreever if run in parallel</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Yarn-applications-hang-foreever-if-run-in-parallel/m-p/15474#M43967</link>
      <description>Can you post more details on what you mean by 'multiple applications' (and how many, exactly), as well as your scheduler configuration?&lt;BR /&gt;&lt;BR /&gt;What behaviour do you notice exactly when you say they all 'stop'. Do you mean their AppMasters run but the actual application containers (i.e. map or reduce tasks) do not run, or do you mean they all just fail?</description>
      <pubDate>Sun, 20 Jul 2014 06:19:05 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Yarn-applications-hang-foreever-if-run-in-parallel/m-p/15474#M43967</guid>
      <dc:creator>Harsh J</dc:creator>
      <dc:date>2014-07-20T06:19:05Z</dc:date>
    </item>
    <item>
      <title>Re: Yarn applications hang foreever if run in parallel</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Yarn-applications-hang-foreever-if-run-in-parallel/m-p/16804#M43968</link>
      <description>&lt;P&gt;&lt;SPAN&gt;I am experiencing the same problem stated earlier. &amp;nbsp;We have a 4-node cluster using YARN on v5.1.0. &amp;nbsp;I have an Oozie workflow that uses Sqoop to import from MySQL, which is sharded with 10 tables. &amp;nbsp;Therefore, I have a coordinator that executes the same workflow with 10 simultaneous (parallel) sessions, to pull from each sharded table.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;However, sometime after the workflows reach the Sqoop action step, they stop running. &amp;nbsp;The jobs are not failing, rather they stop processing, even though their status shows "Running" in the Hue workflow dashboard. &amp;nbsp;None of the jobs have had any updated status in the SysLog for more than 12 hours.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Further, if other, unrelated jobs are submitted, they also appear to hang. &amp;nbsp;I have had a job running successfully for several days, which is executing a DISTCP command to import S3 data. &amp;nbsp;This job has also hung after submitting the 10 parallel workflows.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Is there a configuration that must be set to allows the same workflow to be processed in parallel?&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Thank you!&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Michael Reynolds&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 12 Aug 2014 17:19:08 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Yarn-applications-hang-foreever-if-run-in-parallel/m-p/16804#M43968</guid>
      <dc:creator>Urantian</dc:creator>
      <dc:date>2014-08-12T17:19:08Z</dc:date>
    </item>
    <item>
      <title>Re: Yarn applications hang foreever if run in parallel</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Yarn-applications-hang-foreever-if-run-in-parallel/m-p/16816#M43969</link>
      <description>&lt;P&gt;On a small cluster, sometimes all the resources are occupied by AMs, and no real work get done. See&amp;nbsp;&lt;A href="https://issues.apache.org/jira/browse/YARN-1913." target="_blank"&gt;https://issues.apache.org/jira/browse/YARN-1913.&lt;/A&gt; One workaround is to configure the `maxRunningApps' to a smaller number. See&amp;nbsp;&lt;A href="http://hadoop.apache.org/docs/r2.4.1/hadoop-yarn/hadoop-yarn-site/FairScheduler.html." target="_blank"&gt;http://hadoop.apache.org/docs/r2.4.1/hadoop-yarn/hadoop-yarn-site/FairScheduler.html.&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 12 Aug 2014 18:38:07 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Yarn-applications-hang-foreever-if-run-in-parallel/m-p/16816#M43969</guid>
      <dc:creator>bcwalrus</dc:creator>
      <dc:date>2014-08-12T18:38:07Z</dc:date>
    </item>
    <item>
      <title>Re: Yarn applications hang foreever if run in parallel</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Yarn-applications-hang-foreever-if-run-in-parallel/m-p/16828#M43970</link>
      <description>Thank you very much for your assistance! It is now working fine. Michael Reynolds</description>
      <pubDate>Tue, 12 Aug 2014 21:34:06 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Yarn-applications-hang-foreever-if-run-in-parallel/m-p/16828#M43970</guid>
      <dc:creator>Urantian</dc:creator>
      <dc:date>2014-08-12T21:34:06Z</dc:date>
    </item>
    <item>
      <title>Re: Yarn applications hang foreever if run in parallel</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Yarn-applications-hang-foreever-if-run-in-parallel/m-p/17146#M43971</link>
      <description>&lt;P&gt;&lt;SPAN&gt;We are still experiencing periodic problems with applications hanging when a number of jobs are submitted in parallel. &amp;nbsp;We have reduced&amp;nbsp;'maxRunningApps', increased the virtual core count, and also increased '&lt;SPAN&gt;oozie.service.callablequeueservice.threads' to 40. &amp;nbsp;In many cases, the applications do not hang, however this is not consistent.&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;&lt;SPAN&gt;Regarding YARN issue number 1913 (&lt;A href="https://issues.apache.org/jira/browse/YARN-1913)," target="_blank"&gt;https://issues.apache.org/jira/browse/YARN-1913),&lt;/A&gt; is this patch incorporated in CDH 5.1.0, the version we are using? &amp;nbsp;YARN-1913 indicates the affected version is&amp;nbsp;&lt;SPAN&gt;2.3.0, and is fixed in 2.5.0. &amp;nbsp;Our Hadoop version in 5.1.0 is 2.3.0.&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;&lt;SPAN&gt;&lt;SPAN&gt;Thank you,&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;&lt;SPAN&gt;&lt;SPAN&gt;Michael Reynolds&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 18 Aug 2014 22:29:38 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Yarn-applications-hang-foreever-if-run-in-parallel/m-p/17146#M43971</guid>
      <dc:creator>Urantian</dc:creator>
      <dc:date>2014-08-18T22:29:38Z</dc:date>
    </item>
  </channel>
</rss>

