<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Spark &amp;quot;Master Node&amp;quot; and &amp;quot;Worker Node&amp;quot; in Cloudera? in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Spark-quot-Master-Node-quot-and-quot-Worker-Node-quot-in/m-p/364861#M239259</link>
    <description>&lt;P&gt;Hello &lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/102287"&gt;@quangbilly79&lt;/a&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks for using Cloudera Community. The "Spark Master" refers to the Resource Manager responsible for allocating resources. Since you are using YARN, Your Team needs to use "&lt;EM&gt;&lt;SPAN class="nt"&gt;--master&lt;/SPAN&gt; yarn&lt;/EM&gt;". The usage of "&lt;EM&gt;&lt;SPAN class="nt"&gt;--master&lt;/SPAN&gt; spark://&amp;lt;IP Address&amp;gt;:7077&lt;/EM&gt;"&amp;nbsp;is for Spark Standalone Cluster, which isn't the Case for your team.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;To your Observation concerning the "Driver Instance" &amp;amp; "Worker Instance" being added via "Add Role Instance", there is no such Option as YARN is the Resource Manager, which shall allocate the resources for Spark Driver &amp;amp; Executors.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Review [1] for the usage of "&lt;EM&gt;--master&lt;/EM&gt;" as well. Hope the above answers your Team's queries.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Regards, Smarak&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;[1] &lt;A href="https://spark.apache.org/docs/latest/submitting-applications.html#launching-applications-with-spark-submit" target="_blank" rel="noopener"&gt;https://spark.apache.org/docs/latest/submitting-applications.html#launching-applications-with-spark-submit&lt;/A&gt;&lt;/P&gt;</description>
    <pubDate>Wed, 01 Mar 2023 05:04:16 GMT</pubDate>
    <dc:creator>smdas</dc:creator>
    <dc:date>2023-03-01T05:04:16Z</dc:date>
    <item>
      <title>Spark "Master Node" and "Worker Node" in Cloudera?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Spark-quot-Master-Node-quot-and-quot-Worker-Node-quot-in/m-p/362724#M238832</link>
      <description>&lt;P&gt;I'm using a tool in which I have to point out the master node (driver node) of the Cloudera Spark Cluster (&lt;STRONG&gt;spark :// &amp;lt;some-spark-master&amp;gt; : 7077). &lt;/STRONG&gt;Also as I learned, Spark has "Master Node" (Driver Node) and "Worker Nodes".&lt;/P&gt;&lt;P&gt;So I decided to go to the Cloudera Web Manager and checked the Configuration Tab of the Spark service, but all I found are "Gateway instance" and "History Server instance". Where are the "Driver instance" and "Worker instance"? I can't add these two instances in the "Add Role Instances" too&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="quangbilly79_0-1675217417485.png" style="width: 400px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/36693i17C43CADAA570CCF/image-size/medium?v=v2&amp;amp;px=400" role="button" title="quangbilly79_0-1675217417485.png" alt="quangbilly79_0-1675217417485.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;My guess is that it's in Yarn service configuration, but I can't find anything related to "Master"/"Driver" or "Worker" either.&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="quangbilly79_1-1675217551253.png" style="width: 400px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/36694iEEC25FAAAAB4FE73/image-size/medium?v=v2&amp;amp;px=400" role="button" title="quangbilly79_1-1675217551253.png" alt="quangbilly79_1-1675217551253.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;So what is the link to "Spark Master" that ends with 7077? I can't find it anywhere in the Configuration tab&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 01 Feb 2023 02:14:29 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Spark-quot-Master-Node-quot-and-quot-Worker-Node-quot-in/m-p/362724#M238832</guid>
      <dc:creator>quangbilly79</dc:creator>
      <dc:date>2023-02-01T02:14:29Z</dc:date>
    </item>
    <item>
      <title>Re: Spark "Master Node" and "Worker Node" in Cloudera?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Spark-quot-Master-Node-quot-and-quot-Worker-Node-quot-in/m-p/364861#M239259</link>
      <description>&lt;P&gt;Hello &lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/102287"&gt;@quangbilly79&lt;/a&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks for using Cloudera Community. The "Spark Master" refers to the Resource Manager responsible for allocating resources. Since you are using YARN, Your Team needs to use "&lt;EM&gt;&lt;SPAN class="nt"&gt;--master&lt;/SPAN&gt; yarn&lt;/EM&gt;". The usage of "&lt;EM&gt;&lt;SPAN class="nt"&gt;--master&lt;/SPAN&gt; spark://&amp;lt;IP Address&amp;gt;:7077&lt;/EM&gt;"&amp;nbsp;is for Spark Standalone Cluster, which isn't the Case for your team.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;To your Observation concerning the "Driver Instance" &amp;amp; "Worker Instance" being added via "Add Role Instance", there is no such Option as YARN is the Resource Manager, which shall allocate the resources for Spark Driver &amp;amp; Executors.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Review [1] for the usage of "&lt;EM&gt;--master&lt;/EM&gt;" as well. Hope the above answers your Team's queries.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Regards, Smarak&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;[1] &lt;A href="https://spark.apache.org/docs/latest/submitting-applications.html#launching-applications-with-spark-submit" target="_blank" rel="noopener"&gt;https://spark.apache.org/docs/latest/submitting-applications.html#launching-applications-with-spark-submit&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 01 Mar 2023 05:04:16 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Spark-quot-Master-Node-quot-and-quot-Worker-Node-quot-in/m-p/364861#M239259</guid>
      <dc:creator>smdas</dc:creator>
      <dc:date>2023-03-01T05:04:16Z</dc:date>
    </item>
  </channel>
</rss>

