<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Why is Spark2 running on only one node? in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Why-is-Spark2-running-on-only-one-node/m-p/209036#M71713</link>
    <description>&lt;P&gt;The answer is because I am an idiot. Only S3 had datanode and nodemanager installed. Hopefully this might help someone.&lt;/P&gt;</description>
    <pubDate>Sat, 25 Nov 2017 04:17:38 GMT</pubDate>
    <dc:creator>ed_day</dc:creator>
    <dc:date>2017-11-25T04:17:38Z</dc:date>
    <item>
      <title>Why is Spark2 running on only one node?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Why-is-Spark2-running-on-only-one-node/m-p/209033#M71710</link>
      <description>&lt;P&gt;Hi.&lt;/P&gt;&lt;P&gt;I am running Spark2 from Zeppelin (0.7 in HDP 2.6) and I am doing an idf transformation which crashes after many hours. It is run on a cluster with a master and 3 datanodes: s1, s2 and s3. All nodes have a Spark2 client and each has 8 cores and 16GB RAM.&lt;/P&gt;&lt;P&gt;I just noticed it is only running on one node s3 with 5 executors.&lt;/P&gt;&lt;P&gt;In zeppelin-env.sh I have set zeppelin.executor.instances to 32 and zeppelin.executor.mem to 12g and it has the line:&lt;/P&gt;&lt;PRE&gt;export MASTER=yarn-client&lt;/PRE&gt;&lt;P&gt;I have set yarn.resourcemanager.scheduler.class to org.apache.hadoop.yarn.server.resourcemanager.scheduler.fair.FairScheduler.&lt;/P&gt;&lt;P&gt;I also set spark.executor.instances to 32 in the Spark2 interprter.&lt;/P&gt;&lt;P&gt;Anyone have any ideas what else I can try to get the other nodes doing their share?&lt;/P&gt;</description>
      <pubDate>Fri, 24 Nov 2017 19:59:21 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Why-is-Spark2-running-on-only-one-node/m-p/209033#M71710</guid>
      <dc:creator>ed_day</dc:creator>
      <dc:date>2017-11-24T19:59:21Z</dc:date>
    </item>
    <item>
      <title>Re: Why is Spark2 running on only one node?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Why-is-Spark2-running-on-only-one-node/m-p/209034#M71711</link>
      <description>&lt;P&gt;Hello&lt;/P&gt;&lt;P&gt;This seems to be happening because your spark have it configured to use master = [local]&lt;/P&gt;&lt;P&gt;1) Take a look at the link below:&lt;/P&gt;&lt;P&gt;&lt;A href="https://zeppelin.apache.org/docs/latest/manual/interpreters.html#what-is-interpreter-group" target="_blank"&gt;https://zeppelin.apache.org/docs/latest/manual/interpreters.html#what-is-interpreter-group&lt;/A&gt;&lt;/P&gt;&lt;P&gt;2) Try to change from (master) local to yarn-client if you still have it on your interpreter. &lt;/P&gt;&lt;P&gt;3) If your application shows up in the Resource Manager, it's likely that it is using the yarn framework.&lt;/P&gt;&lt;P&gt;Regards,&lt;/P&gt;</description>
      <pubDate>Fri, 24 Nov 2017 22:52:13 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Why-is-Spark2-running-on-only-one-node/m-p/209034#M71711</guid>
      <dc:creator>dperez</dc:creator>
      <dc:date>2017-11-24T22:52:13Z</dc:date>
    </item>
    <item>
      <title>Re: Why is Spark2 running on only one node?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Why-is-Spark2-running-on-only-one-node/m-p/209035#M71712</link>
      <description>&lt;P&gt;Thanks Danilo but it is set to &lt;/P&gt;&lt;TABLE&gt;&lt;TBODY&gt;&lt;TR&gt;&lt;TD&gt;&lt;BR /&gt;yarn-client&lt;/TD&gt;&lt;/TR&gt;&lt;/TBODY&gt;&lt;/TABLE&gt;</description>
      <pubDate>Sat, 25 Nov 2017 00:49:09 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Why-is-Spark2-running-on-only-one-node/m-p/209035#M71712</guid>
      <dc:creator>ed_day</dc:creator>
      <dc:date>2017-11-25T00:49:09Z</dc:date>
    </item>
    <item>
      <title>Re: Why is Spark2 running on only one node?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Why-is-Spark2-running-on-only-one-node/m-p/209036#M71713</link>
      <description>&lt;P&gt;The answer is because I am an idiot. Only S3 had datanode and nodemanager installed. Hopefully this might help someone.&lt;/P&gt;</description>
      <pubDate>Sat, 25 Nov 2017 04:17:38 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Why-is-Spark2-running-on-only-one-node/m-p/209036#M71713</guid>
      <dc:creator>ed_day</dc:creator>
      <dc:date>2017-11-25T04:17:38Z</dc:date>
    </item>
  </channel>
</rss>

