<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: How is the memory allocated in CDP Machine Learning session for Spark? in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/How-is-the-memory-allocated-in-CDP-Machine-Learning-session/m-p/361256#M238539</link>
    <description>&lt;P&gt;Hello&amp;nbsp;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/102888"&gt;@Ryan_2002&lt;/a&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks for using Cloudera Community. To your Q, the Driver Cap is the Engine/Resource Profile &amp;amp; the Executor's Resource Usage is defined by the SparkSession or "spark-defaults.conf" file within the Project wherein the Workbench Session is being created.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Your Team can review the Pods in the User's Namespace &amp;amp; see the same i.e. upon a Workbench Session Creation, an Engine Pod is started with "Limits" set toEngine/Resource Profile Settings. After SparkSession is initialised, additional Pods are generated within the User's Namespace based on the Execution's Configs passed via&amp;nbsp;SparkSession or "spark-defaults.conf" file.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;You may configure the Executor's Configs as per your usage yet the same depends on the CML Workspace AutoScale Range &amp;amp; InstanceType. Say, an InstanceType supporting 8 vCPU &amp;amp; Executors requesting 8 vCPU won't work. Similarly, AutoScale Max of 5 yet requesting Executors collectively utilising the Resource Limit of 5 Nodes.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Hope the above helps answer your Post's queries. If Yes, Kindly mark the Post as Solved. If No, Feel free to share your concerns &amp;amp; we shall address accordingly.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Regards, Smarak&lt;/P&gt;</description>
    <pubDate>Fri, 13 Jan 2023 09:35:45 GMT</pubDate>
    <dc:creator>smdas</dc:creator>
    <dc:date>2023-01-13T09:35:45Z</dc:date>
    <item>
      <title>How is the memory allocated in CDP Machine Learning session for Spark?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/How-is-the-memory-allocated-in-CDP-Machine-Learning-session/m-p/361141#M238520</link>
      <description>&lt;P&gt;In CDP Public Cloud Machine Learning, we can create a new session with reserved resource, for example 4vCPU and 16 GiB Memory.&lt;/P&gt;&lt;P&gt;We can also create spark session inside the machine learning workbench with some memory configuration. For example:&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;&lt;FONT size="3"&gt;&lt;EM&gt;spark = (SparkSession&lt;/EM&gt;&lt;EM&gt;.builder&lt;/EM&gt;&lt;EM&gt;.appName(appName)&lt;/EM&gt;&lt;EM&gt;.config("spark.driver.memory", "16G")&lt;/EM&gt;&lt;EM&gt;.config("spark.executor.instances", "10")&lt;/EM&gt;&lt;EM&gt;.config("spark.executor.cores", "4")&lt;/EM&gt;&lt;EM&gt;.config("spark.executor.memory", "20G")&lt;/EM&gt;&lt;EM&gt;.getOrCreate())&lt;/EM&gt;&lt;/FONT&gt;&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT size="3"&gt;My question is, how will the memory be allocated to Spark session now? Is the reserved resource (4vCPU and 16 GiB Memory) in machine learning session the maximal limitation for total spark memory usage? How many work nodes and executors can I configure for the spark session?&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Ryan_2002_0-1673528198488.png" style="width: 400px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/36600i14EEB73E8A0E7F76/image-size/medium?v=v2&amp;amp;px=400" role="button" title="Ryan_2002_0-1673528198488.png" alt="Ryan_2002_0-1673528198488.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 21 Apr 2026 07:45:45 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/How-is-the-memory-allocated-in-CDP-Machine-Learning-session/m-p/361141#M238520</guid>
      <dc:creator>Ryan_2002</dc:creator>
      <dc:date>2026-04-21T07:45:45Z</dc:date>
    </item>
    <item>
      <title>Re: How is the memory allocated in CDP Machine Learning session for Spark?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/How-is-the-memory-allocated-in-CDP-Machine-Learning-session/m-p/361256#M238539</link>
      <description>&lt;P&gt;Hello&amp;nbsp;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/102888"&gt;@Ryan_2002&lt;/a&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks for using Cloudera Community. To your Q, the Driver Cap is the Engine/Resource Profile &amp;amp; the Executor's Resource Usage is defined by the SparkSession or "spark-defaults.conf" file within the Project wherein the Workbench Session is being created.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Your Team can review the Pods in the User's Namespace &amp;amp; see the same i.e. upon a Workbench Session Creation, an Engine Pod is started with "Limits" set toEngine/Resource Profile Settings. After SparkSession is initialised, additional Pods are generated within the User's Namespace based on the Execution's Configs passed via&amp;nbsp;SparkSession or "spark-defaults.conf" file.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;You may configure the Executor's Configs as per your usage yet the same depends on the CML Workspace AutoScale Range &amp;amp; InstanceType. Say, an InstanceType supporting 8 vCPU &amp;amp; Executors requesting 8 vCPU won't work. Similarly, AutoScale Max of 5 yet requesting Executors collectively utilising the Resource Limit of 5 Nodes.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Hope the above helps answer your Post's queries. If Yes, Kindly mark the Post as Solved. If No, Feel free to share your concerns &amp;amp; we shall address accordingly.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Regards, Smarak&lt;/P&gt;</description>
      <pubDate>Fri, 13 Jan 2023 09:35:45 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/How-is-the-memory-allocated-in-CDP-Machine-Learning-session/m-p/361256#M238539</guid>
      <dc:creator>smdas</dc:creator>
      <dc:date>2023-01-13T09:35:45Z</dc:date>
    </item>
    <item>
      <title>Re: How is the memory allocated in CDP Machine Learning session for Spark?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/How-is-the-memory-allocated-in-CDP-Machine-Learning-session/m-p/361260#M238540</link>
      <description>&lt;P&gt;Hi Smarak,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;thanks for your answer. That helps me!&lt;/P&gt;</description>
      <pubDate>Fri, 13 Jan 2023 09:43:40 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/How-is-the-memory-allocated-in-CDP-Machine-Learning-session/m-p/361260#M238540</guid>
      <dc:creator>Ryan_2002</dc:creator>
      <dc:date>2023-01-13T09:43:40Z</dc:date>
    </item>
  </channel>
</rss>

