<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Spark in yarn-cluster mode on Zeppelin in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Spark-in-yarn-cluster-mode-on-Zeppelin/m-p/242725#M204524</link>
    <description>&lt;P&gt;&lt;EM&gt;&lt;A href="@Jeremy Jean-Jean"&gt; @Jeremy Jean-Jean&lt;/A&gt;&lt;BR /&gt;&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;There is no sense in installing zeppelin on all the nodes, Do you have YARN Client installed on the data nodes? Then  submit using&lt;/EM&gt;&lt;/P&gt;&lt;PRE&gt;spark-submit --class &amp;lt;clasname&amp;gt; --master yarn --deploy-mode cluster &amp;lt;jars&amp;gt; &amp;lt;args&amp;gt;&lt;I&gt; &lt;/I&gt;&lt;/PRE&gt;&lt;P&gt;&lt;I&gt;&lt;BR /&gt;&lt;/I&gt;&lt;/P&gt;&lt;P&gt;&lt;I&gt;HTH&lt;/I&gt;&lt;/P&gt;</description>
    <pubDate>Tue, 22 Jan 2019 18:55:44 GMT</pubDate>
    <dc:creator>Shelton</dc:creator>
    <dc:date>2019-01-22T18:55:44Z</dc:date>
    <item>
      <title>Spark in yarn-cluster mode on Zeppelin</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Spark-in-yarn-cluster-mode-on-Zeppelin/m-p/242724#M204523</link>
      <description>&lt;P&gt;By default with ambari installation, Zeppelin is set to have yarn client mode for Spark Interpreter which means the driver runs in the same host of Zeppelin Server. This incur high memory pressure on the Zeppelin Server host especially when Spark Interpreter is ran in isolated mode.&lt;/P&gt;&lt;P&gt;I'm trying to switch to yarn-cluster mode which would let yarn decide on where spark driver should be executed depending of the available resources in the cluster. This mode is supported by Zeppelin since the version 0.8.0 but I'm facing the following issue &lt;A href="https://issues.apache.org/jira/browse/ZEPPELIN-3633"&gt;https://issues.apache.org/jira/browse/ZEPPELIN-3633&lt;/A&gt;. Basically, the node where yarn decided to run spark driver doesn't have zeppelin installed so is unable to start. &lt;/P&gt;&lt;P&gt;There is a fix on Zeppelin's github &lt;A href="https://github.com/apache/zeppelin/pull/3181"&gt;https://github.com/apache/zeppelin/pull/3181 &lt;/A&gt; but I can't find the files that I need to change. Any chance that this can be fixed easily or should I just install zeppelin on every nodes?&lt;/P&gt;</description>
      <pubDate>Tue, 22 Jan 2019 18:24:54 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Spark-in-yarn-cluster-mode-on-Zeppelin/m-p/242724#M204523</guid>
      <dc:creator>jeremyjjea</dc:creator>
      <dc:date>2019-01-22T18:24:54Z</dc:date>
    </item>
    <item>
      <title>Re: Spark in yarn-cluster mode on Zeppelin</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Spark-in-yarn-cluster-mode-on-Zeppelin/m-p/242725#M204524</link>
      <description>&lt;P&gt;&lt;EM&gt;&lt;A href="@Jeremy Jean-Jean"&gt; @Jeremy Jean-Jean&lt;/A&gt;&lt;BR /&gt;&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;There is no sense in installing zeppelin on all the nodes, Do you have YARN Client installed on the data nodes? Then  submit using&lt;/EM&gt;&lt;/P&gt;&lt;PRE&gt;spark-submit --class &amp;lt;clasname&amp;gt; --master yarn --deploy-mode cluster &amp;lt;jars&amp;gt; &amp;lt;args&amp;gt;&lt;I&gt; &lt;/I&gt;&lt;/PRE&gt;&lt;P&gt;&lt;I&gt;&lt;BR /&gt;&lt;/I&gt;&lt;/P&gt;&lt;P&gt;&lt;I&gt;HTH&lt;/I&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 22 Jan 2019 18:55:44 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Spark-in-yarn-cluster-mode-on-Zeppelin/m-p/242725#M204524</guid>
      <dc:creator>Shelton</dc:creator>
      <dc:date>2019-01-22T18:55:44Z</dc:date>
    </item>
    <item>
      <title>Re: Spark in yarn-cluster mode on Zeppelin</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Spark-in-yarn-cluster-mode-on-Zeppelin/m-p/242726#M204525</link>
      <description>&lt;P&gt;Thank you for your fast answer!&lt;/P&gt;&lt;P&gt;Indeed it works after tweaking zeppelin's spark interpreter parameters&lt;/P&gt;&lt;P&gt; and changing:&lt;/P&gt;&lt;PRE&gt;master: yarn-cluster&lt;/PRE&gt;&lt;P&gt;to&lt;/P&gt;&lt;PRE&gt;master: yarn
spark.submit.deployMode: cluster
&lt;/PRE&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="97630-spark.jpg" style="width: 1010px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/13556iA1CE7208AEF59331/image-size/medium?v=v2&amp;amp;px=400" role="button" title="97630-spark.jpg" alt="97630-spark.jpg" /&gt;&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Sat, 17 Aug 2019 21:54:37 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Spark-in-yarn-cluster-mode-on-Zeppelin/m-p/242726#M204525</guid>
      <dc:creator>jeremyjjea</dc:creator>
      <dc:date>2019-08-17T21:54:37Z</dc:date>
    </item>
  </channel>
</rss>

