<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Issues running Spark in CML in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Issues-running-Spark-in-CML/m-p/395313#M248914</link>
    <description>&lt;P&gt;I have a CML project using a JupyterLab Runtime with Python 3.10 and I want to start a Spark cluster with my CDP Datalake. I'm using the predefined Spark Data Lake Connection in CML which looks like this:&lt;/P&gt;&lt;P&gt;```&lt;/P&gt;&lt;P&gt;import cml.data_v1 as cmldata&lt;/P&gt;&lt;P&gt;# Sample in-code customization of spark configurations&lt;BR /&gt;#from pyspark import SparkContext&lt;BR /&gt;#SparkContext.setSystemProperty('spark.executor.cores', '1')&lt;BR /&gt;#SparkContext.setSystemProperty('spark.executor.memory', '2g')&lt;/P&gt;&lt;P&gt;CONNECTION_NAME = "hiaa-dl"&lt;BR /&gt;conn = cmldata.get_connection(CONNECTION_NAME)&lt;BR /&gt;spark = conn.get_spark_session()&lt;/P&gt;&lt;P&gt;# Sample usage to run query through spark&lt;BR /&gt;EXAMPLE_SQL_QUERY = "show databases"&lt;BR /&gt;spark.sql(EXAMPLE_SQL_QUERY).show()&lt;/P&gt;&lt;P&gt;```&amp;nbsp;&lt;/P&gt;&lt;P&gt;When I execute this I get the error:&lt;/P&gt;&lt;PRE&gt;&lt;SPAN class="ansi-red-fg"&gt;IllegalArgumentException&lt;/SPAN&gt;: The value of property spark.app.name must not be null&lt;/PRE&gt;&lt;P&gt;I'm using the predefined spark-defaults.conf which looks like this:&lt;/P&gt;&lt;P&gt;```&lt;/P&gt;&lt;P&gt;spark.executor.memory=1g&lt;BR /&gt;spark.executor.cores=1&lt;BR /&gt;spark.yarn.access.hadoopFileSystems=abfs://[container]@[storage-account].dfs.core.windows.net&lt;/P&gt;&lt;P&gt;```&lt;/P&gt;&lt;P&gt;Is there something else I need to configure in the CML session or at the data lake level?&lt;/P&gt;</description>
    <pubDate>Tue, 21 Apr 2026 06:24:46 GMT</pubDate>
    <dc:creator>ipson</dc:creator>
    <dc:date>2026-04-21T06:24:46Z</dc:date>
    <item>
      <title>Issues running Spark in CML</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Issues-running-Spark-in-CML/m-p/395313#M248914</link>
      <description>&lt;P&gt;I have a CML project using a JupyterLab Runtime with Python 3.10 and I want to start a Spark cluster with my CDP Datalake. I'm using the predefined Spark Data Lake Connection in CML which looks like this:&lt;/P&gt;&lt;P&gt;```&lt;/P&gt;&lt;P&gt;import cml.data_v1 as cmldata&lt;/P&gt;&lt;P&gt;# Sample in-code customization of spark configurations&lt;BR /&gt;#from pyspark import SparkContext&lt;BR /&gt;#SparkContext.setSystemProperty('spark.executor.cores', '1')&lt;BR /&gt;#SparkContext.setSystemProperty('spark.executor.memory', '2g')&lt;/P&gt;&lt;P&gt;CONNECTION_NAME = "hiaa-dl"&lt;BR /&gt;conn = cmldata.get_connection(CONNECTION_NAME)&lt;BR /&gt;spark = conn.get_spark_session()&lt;/P&gt;&lt;P&gt;# Sample usage to run query through spark&lt;BR /&gt;EXAMPLE_SQL_QUERY = "show databases"&lt;BR /&gt;spark.sql(EXAMPLE_SQL_QUERY).show()&lt;/P&gt;&lt;P&gt;```&amp;nbsp;&lt;/P&gt;&lt;P&gt;When I execute this I get the error:&lt;/P&gt;&lt;PRE&gt;&lt;SPAN class="ansi-red-fg"&gt;IllegalArgumentException&lt;/SPAN&gt;: The value of property spark.app.name must not be null&lt;/PRE&gt;&lt;P&gt;I'm using the predefined spark-defaults.conf which looks like this:&lt;/P&gt;&lt;P&gt;```&lt;/P&gt;&lt;P&gt;spark.executor.memory=1g&lt;BR /&gt;spark.executor.cores=1&lt;BR /&gt;spark.yarn.access.hadoopFileSystems=abfs://[container]@[storage-account].dfs.core.windows.net&lt;/P&gt;&lt;P&gt;```&lt;/P&gt;&lt;P&gt;Is there something else I need to configure in the CML session or at the data lake level?&lt;/P&gt;</description>
      <pubDate>Tue, 21 Apr 2026 06:24:46 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Issues-running-Spark-in-CML/m-p/395313#M248914</guid>
      <dc:creator>ipson</dc:creator>
      <dc:date>2026-04-21T06:24:46Z</dc:date>
    </item>
    <item>
      <title>Re: Issues running Spark in CML</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Issues-running-Spark-in-CML/m-p/395337#M248924</link>
      <description>&lt;P&gt;Resolved. I had ML Runtimes Addons disabled. Went into CML &amp;gt; Site Administrations &amp;gt; Settings and&amp;nbsp;&lt;SPAN class="ph cmd"&gt;Under&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN class="ph uicontrol"&gt;Feature Flags&lt;/SPAN&gt;, unchecked the checkbox next to&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN class="ph uicontrol"&gt;Allow users to Run ML Runtimes Addons&lt;/SPAN&gt;.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;Then, started a new session with Spark enabled&lt;/P&gt;</description>
      <pubDate>Wed, 16 Oct 2024 19:10:52 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Issues-running-Spark-in-CML/m-p/395337#M248924</guid>
      <dc:creator>ipson</dc:creator>
      <dc:date>2024-10-16T19:10:52Z</dc:date>
    </item>
    <item>
      <title>Re: Issues running Spark in CML</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Issues-running-Spark-in-CML/m-p/395393#M248938</link>
      <description>&lt;P&gt;Correction: 'Check the checkbox to Allow users to Run ML Runtimes'&lt;/P&gt;</description>
      <pubDate>Thu, 17 Oct 2024 11:56:36 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Issues-running-Spark-in-CML/m-p/395393#M248938</guid>
      <dc:creator>ipson</dc:creator>
      <dc:date>2024-10-17T11:56:36Z</dc:date>
    </item>
  </channel>
</rss>

