<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Error in Scala/Spark Project on Cloudera Data Science Workbench in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Error-in-Scala-Spark-Project-on-Cloudera-Data-Science/m-p/54973#M61630</link>
    <description>&lt;P&gt;We have a Hadoop cluster with ACLs for YARN resource pools.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I am trying to create a Scala/Spark project within CDSW, but it throws the following error as soon as the engine starts:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;PRE&gt;ERROR spark.SparkContext: Error initializing SparkContext.
org.apache.hadoop.yarn.exceptions.YarnException: Failed to submit application_1495197568507_9413 to YARN : Application rejected by queue placement policy&lt;/PRE&gt;&lt;P&gt;I know I need to tell it to use a specific Yarn resource pool, but I don't know how/where to put that parameter so that it can take effect. I tried setting it up as a parameter in engine settings, but that didn't work.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Does anyone any idea about it?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks in advance!&lt;/P&gt;</description>
    <pubDate>Tue, 21 Apr 2026 13:29:21 GMT</pubDate>
    <dc:creator>MGarg</dc:creator>
    <dc:date>2026-04-21T13:29:21Z</dc:date>
    <item>
      <title>Error in Scala/Spark Project on Cloudera Data Science Workbench</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Error-in-Scala-Spark-Project-on-Cloudera-Data-Science/m-p/54973#M61630</link>
      <description>&lt;P&gt;We have a Hadoop cluster with ACLs for YARN resource pools.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I am trying to create a Scala/Spark project within CDSW, but it throws the following error as soon as the engine starts:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;PRE&gt;ERROR spark.SparkContext: Error initializing SparkContext.
org.apache.hadoop.yarn.exceptions.YarnException: Failed to submit application_1495197568507_9413 to YARN : Application rejected by queue placement policy&lt;/PRE&gt;&lt;P&gt;I know I need to tell it to use a specific Yarn resource pool, but I don't know how/where to put that parameter so that it can take effect. I tried setting it up as a parameter in engine settings, but that didn't work.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Does anyone any idea about it?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks in advance!&lt;/P&gt;</description>
      <pubDate>Tue, 21 Apr 2026 13:29:21 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Error-in-Scala-Spark-Project-on-Cloudera-Data-Science/m-p/54973#M61630</guid>
      <dc:creator>MGarg</dc:creator>
      <dc:date>2026-04-21T13:29:21Z</dc:date>
    </item>
    <item>
      <title>Re: Error in Scala/Spark Project on Cloudera Data Science Workbench</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Error-in-Scala-Spark-Project-on-Cloudera-Data-Science/m-p/55018#M61631</link>
      <description>&lt;P&gt;Okay - After much research I found a way to configure Yarn resource pool within the Spark/Scala Project and here are the steps:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;1. Create Scala Project and start the engine&lt;/P&gt;&lt;P&gt;2. Engine startup will fail the very first time.&lt;/P&gt;&lt;P&gt;3. Open "Terminal" in the Workbench window and do the following:&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp; i. Verify that you are in /home/cdsw directory.&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp; ii. Create a file named "spark-defaults.conf" and add "spark.yarn.queue={QUEUE_NAME}"&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp; iii. Save and exit.&lt;/P&gt;&lt;P&gt;4. Stop and start the engine again and the issue will be resolved.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Regards,&lt;/P&gt;&lt;P&gt;MG&lt;/P&gt;</description>
      <pubDate>Wed, 24 May 2017 20:31:49 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Error-in-Scala-Spark-Project-on-Cloudera-Data-Science/m-p/55018#M61631</guid>
      <dc:creator>MGarg</dc:creator>
      <dc:date>2017-05-24T20:31:49Z</dc:date>
    </item>
    <item>
      <title>Re: Error in Scala/Spark Project on Cloudera Data Science Workbench</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Error-in-Scala-Spark-Project-on-Cloudera-Data-Science/m-p/55072#M61632</link>
      <description>&lt;P&gt;MG,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I'm glad you figured this out. &amp;nbsp;You can configure the YARN queue, or any Spark option, either globally using Cloudera Manager or on a per project basis within Cloudera Data Science Workbench. &amp;nbsp;It sounds like you figured this out already, but the documentation for these two options is here:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;A href="https://www.cloudera.com/documentation/data-science-workbench/latest/topics/cdsw_spark_configuration.html#config_files" target="_blank"&gt;https://www.cloudera.com/documentation/data-science-workbench/latest/topics/cdsw_spark_configuration.html#config_files&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Configuring this option globally may make more sense, unless you're using a queue specifically for Cloudera Data Science Workbench launched Spark jobs.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Best,&lt;/P&gt;&lt;P&gt;Tristan&lt;/P&gt;</description>
      <pubDate>Thu, 25 May 2017 19:34:29 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Error-in-Scala-Spark-Project-on-Cloudera-Data-Science/m-p/55072#M61632</guid>
      <dc:creator>tristanzajonc</dc:creator>
      <dc:date>2017-05-25T19:34:29Z</dc:date>
    </item>
    <item>
      <title>Re: Error in Scala/Spark Project on Cloudera Data Science Workbench</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Error-in-Scala-Spark-Project-on-Cloudera-Data-Science/m-p/55074#M61633</link>
      <description>&lt;P&gt;Hi Tristan,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;You are right, configuring it globally was much easier, but we have tenant specific queues and we want to keep them contained within their pools, which is why we needed Engine/Project specific setting.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Anyways, thanks for your response.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Regards,&lt;/P&gt;&lt;P&gt;MG&lt;/P&gt;</description>
      <pubDate>Thu, 25 May 2017 19:39:05 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Error-in-Scala-Spark-Project-on-Cloudera-Data-Science/m-p/55074#M61633</guid>
      <dc:creator>MGarg</dc:creator>
      <dc:date>2017-05-25T19:39:05Z</dc:date>
    </item>
  </channel>
</rss>

