<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: CDH 5.8.3 and 5.9 Spark Action on Oozie issues. in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/CDH-5-8-3-and-5-9-Spark-Action-on-Oozie-issues/m-p/47508#M46440</link>
    <description>&lt;P&gt;Can anyone confirm/deny that they are able to reproduce the problem on their 5.8.3 or 5.9.0 environments?&lt;/P&gt;</description>
    <pubDate>Thu, 17 Nov 2016 15:27:05 GMT</pubDate>
    <dc:creator>chrisjackson</dc:creator>
    <dc:date>2016-11-17T15:27:05Z</dc:date>
    <item>
      <title>CDH 5.8.3 and 5.9 Spark Action on Oozie issues.</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/CDH-5-8-3-and-5-9-Spark-Action-on-Oozie-issues/m-p/47475#M46439</link>
      <description>&lt;P&gt;Hi All,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I've been developing some oozie workflows that include spark actions. As a smoke test I've been running the SparkPi application from spark examples jar to make sure everything is working as I would expect. Originally my testing was done on CDH 5.8.0 where this workflow was working fine. I've tested it against 5.8.3 and 5.9.0 as well and it does NOT&amp;nbsp;work. I get the following error when yarn tries to execute&amp;nbsp;the spark job:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;PRE&gt;Error: Could not find or load main class org.apache.spark.deploy.yarn.ApplicationMaster&lt;/PRE&gt;&lt;P&gt;&lt;BR /&gt;The workflow and its property file are configured and placed on HDFS as seen below.&lt;BR /&gt;&lt;BR /&gt;placement:&lt;/P&gt;&lt;PRE&gt;# hadoop fs -ls -R /foo/oozie/oozieApps/pi
drwxr-xr-x   - foo foo          0 2016-11-15 21:16 /foo/oozie/oozieApps/pi/lib
-rw-r--r--   3 foo foo  107864471 2016-11-15 21:16 /foo/oozie/oozieApps/pi/lib/spark-assembly.jar
-rw-r--r--   3 foo foo        655 2016-11-15 21:13 /foo/oozie/oozieApps/pi/workflow.xml&lt;/PRE&gt;&lt;P&gt;NOTE: The spark-assembly.jar is a copy of&amp;nbsp;/opt/cloudera/parcels/CDH/lib/spark/lib/spark-assembly-1.6.0-cdh5.8.3-hadoop2.6.0-cdh5.8.3.jar&lt;BR /&gt;&lt;BR /&gt;pi.properties:&lt;/P&gt;&lt;PRE&gt;master=yarn-master
mode=cluster
user.name=foo
nameNode=hdfs://example.com:8020
jobTracker=example.com:8050
queueName=default
oozie.wf.application.path=${nameNode}/foo/oozie/oozieApps/pi
sparkExampleJar=${nameNode}/foo/tmp/spark-examples.jar
argN=10
oozie.use.system.libpath=true&lt;/PRE&gt;&lt;P&gt;&lt;BR /&gt;workflow.xml&lt;/P&gt;&lt;PRE&gt;&amp;lt;workflow-app xmlns='uri:oozie:workflow:0.5' name='SparkPi'&amp;gt;
	&amp;lt;start to='spark-node' /&amp;gt;
	&amp;lt;action name='spark-node'&amp;gt;
		&amp;lt;spark xmlns="uri:oozie:spark-action:0.1"&amp;gt;
			&amp;lt;job-tracker&amp;gt;${jobTracker}&amp;lt;/job-tracker&amp;gt;
			&amp;lt;name-node&amp;gt;${nameNode}&amp;lt;/name-node&amp;gt;
			&amp;lt;master&amp;gt;${master}&amp;lt;/master&amp;gt;
			&amp;lt;mode&amp;gt;${mode}&amp;lt;/mode&amp;gt;
			&amp;lt;name&amp;gt;Spark-pi&amp;lt;/name&amp;gt;
			&amp;lt;class&amp;gt;org.apache.spark.examples.SparkPi&amp;lt;/class&amp;gt;
			&amp;lt;jar&amp;gt;${sparkExampleJar}&amp;lt;/jar&amp;gt;
			&amp;lt;arg&amp;gt;${argN}&amp;lt;/arg&amp;gt;
		&amp;lt;/spark&amp;gt;
		&amp;lt;ok to="end" /&amp;gt;
		&amp;lt;error to="fail" /&amp;gt;
	&amp;lt;/action&amp;gt;
	&amp;lt;kill name="fail"&amp;gt;
		&amp;lt;message&amp;gt;Workflow failed, error
			message[\${wf:errorMessage(wf:lastErrorNode())}]
		&amp;lt;/message&amp;gt;
	&amp;lt;/kill&amp;gt;
	&amp;lt;end name='end' /&amp;gt;
&amp;lt;/workflow-app&amp;gt;&lt;/PRE&gt;&lt;P&gt;&lt;BR /&gt;I'm guessing that this is some sort of classpath issue because I've checked the content of the spark-assembly.jar and the class that it says it cannot find is indeed included. Any ideas on how to fix this issue or troubleshoot it further?&lt;/P&gt;</description>
      <pubDate>Fri, 16 Sep 2022 10:47:57 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/CDH-5-8-3-and-5-9-Spark-Action-on-Oozie-issues/m-p/47475#M46439</guid>
      <dc:creator>chrisjackson</dc:creator>
      <dc:date>2022-09-16T10:47:57Z</dc:date>
    </item>
    <item>
      <title>Re: CDH 5.8.3 and 5.9 Spark Action on Oozie issues.</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/CDH-5-8-3-and-5-9-Spark-Action-on-Oozie-issues/m-p/47508#M46440</link>
      <description>&lt;P&gt;Can anyone confirm/deny that they are able to reproduce the problem on their 5.8.3 or 5.9.0 environments?&lt;/P&gt;</description>
      <pubDate>Thu, 17 Nov 2016 15:27:05 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/CDH-5-8-3-and-5-9-Spark-Action-on-Oozie-issues/m-p/47508#M46440</guid>
      <dc:creator>chrisjackson</dc:creator>
      <dc:date>2016-11-17T15:27:05Z</dc:date>
    </item>
    <item>
      <title>Re: CDH 5.8.3 and 5.9 Spark Action on Oozie issues.</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/CDH-5-8-3-and-5-9-Spark-Action-on-Oozie-issues/m-p/47532#M46441</link>
      <description>&lt;P&gt;So I found a solution for getting it to work on 5.8.3 and 5.9.&lt;BR /&gt;&lt;BR /&gt;I added the following configuration to the workflow:&lt;/P&gt;&lt;PRE&gt;&amp;lt;spark-opts&amp;gt;--conf spark.yarn.jar=local:/opt/cloudera/parcels/CDH/lib/spark/lib/spark-assembly.jar&amp;lt;/spark-opts&amp;gt;&lt;/PRE&gt;&lt;P&gt;&lt;BR /&gt;I don't know why that is necessary on 5.8.3 and 5.9 but not on 5.8.0.&lt;/P&gt;</description>
      <pubDate>Thu, 17 Nov 2016 19:50:41 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/CDH-5-8-3-and-5-9-Spark-Action-on-Oozie-issues/m-p/47532#M46441</guid>
      <dc:creator>chrisjackson</dc:creator>
      <dc:date>2016-11-17T19:50:41Z</dc:date>
    </item>
  </channel>
</rss>

