<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: How to tune spark job on (execution time wise and cluster utilization wise) in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/How-to-tune-spark-job-on-execution-time-wise-and-cluster/m-p/367299#M239861</link>
    <description>&lt;P&gt;Use the following tool to generate no of executors:&lt;/P&gt;&lt;P&gt;&lt;A href="https://rangareddy.github.io/SparkConfigurationGenerator/" target="_blank"&gt;https://rangareddy.github.io/SparkConfigurationGenerator/&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;In order to calculate the driver memory/executor memory we need to start with 1g, 2g, 4g, 8g .... and executor-cores you can set 3-5 and number of executor it will depend on data how much you are processing.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Thu, 30 Mar 2023 11:38:59 GMT</pubDate>
    <dc:creator>RangaReddy</dc:creator>
    <dc:date>2023-03-30T11:38:59Z</dc:date>
    <item>
      <title>How to tune spark job on (execution time wise and cluster utilization wise)</title>
      <link>https://community.cloudera.com/t5/Support-Questions/How-to-tune-spark-job-on-execution-time-wise-and-cluster/m-p/361966#M238679</link>
      <description>&lt;P&gt;Hi Team,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;It would be appreciated if someone please guide me how to set spark memory for spark job, where cluster utilization should take 1%-2% memory only for each spark job. Please share math's logic how to calculate on below cluster node details as -&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;#1 How many working Nodes Cluster we have currently? |&lt;BR /&gt;&amp;gt;Nodemanagers:166&amp;nbsp;&lt;BR /&gt;&amp;gt;Datanodes:159&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;#2 How many Cores per Node we have currently ? |&lt;BR /&gt;&amp;gt;64 Cores&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;#3 How much GB RAM per node. we have currently ? |&lt;BR /&gt;&amp;gt;503 GB&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;==== Wanted to calcuate for spark job ===&lt;/P&gt;&lt;P&gt;#1driver-memory&lt;/P&gt;&lt;P&gt;#2 executor-memory&lt;/P&gt;&lt;P&gt;#3 driver-cores&lt;/P&gt;&lt;P&gt;#4 executor-cores&lt;/P&gt;&lt;P&gt;#5 num-executor&lt;/P&gt;&lt;P&gt;========================&lt;/P&gt;&lt;P&gt;Please suggest if any additional parameter help to tune the spark job (execution time and cluster utilization) wise.&lt;/P&gt;</description>
      <pubDate>Mon, 23 Jan 2023 06:01:55 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/How-to-tune-spark-job-on-execution-time-wise-and-cluster/m-p/361966#M238679</guid>
      <dc:creator>pankshiv1809</dc:creator>
      <dc:date>2023-01-23T06:01:55Z</dc:date>
    </item>
    <item>
      <title>Re: How to tune spark job on (execution time wise and cluster utilization wise)</title>
      <link>https://community.cloudera.com/t5/Support-Questions/How-to-tune-spark-job-on-execution-time-wise-and-cluster/m-p/367299#M239861</link>
      <description>&lt;P&gt;Use the following tool to generate no of executors:&lt;/P&gt;&lt;P&gt;&lt;A href="https://rangareddy.github.io/SparkConfigurationGenerator/" target="_blank"&gt;https://rangareddy.github.io/SparkConfigurationGenerator/&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;In order to calculate the driver memory/executor memory we need to start with 1g, 2g, 4g, 8g .... and executor-cores you can set 3-5 and number of executor it will depend on data how much you are processing.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 30 Mar 2023 11:38:59 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/How-to-tune-spark-job-on-execution-time-wise-and-cluster/m-p/367299#M239861</guid>
      <dc:creator>RangaReddy</dc:creator>
      <dc:date>2023-03-30T11:38:59Z</dc:date>
    </item>
  </channel>
</rss>

