<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Pig on Spark (How to use spark as an execution engine for pig scripts) in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Pig-on-Spark-How-to-use-spark-as-an-execution-engine-for-pig/m-p/193841#M155901</link>
    <description>&lt;P&gt;Pig on spark is a very new feature, and it still not part of an official apache release. It is likely to be take more time before it would be widely used in production and reccomended for production use.&lt;/P&gt;&lt;P&gt; Pig on tez has been around for sometime and has been used in production many large installations. I would reccomend using that over pig on spark.&lt;/P&gt;</description>
    <pubDate>Fri, 02 Jun 2017 01:37:48 GMT</pubDate>
    <dc:creator>thejas</dc:creator>
    <dc:date>2017-06-02T01:37:48Z</dc:date>
    <item>
      <title>Pig on Spark (How to use spark as an execution engine for pig scripts)</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Pig-on-Spark-How-to-use-spark-as-an-execution-engine-for-pig/m-p/193840#M155900</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;I would like to execute pig script using spark as execution engine. Is there any way to do so. Below is the Jira link for the same issue but couldn't make it happen. Any help regarding this would be appreciated. &lt;/P&gt;&lt;P&gt;&lt;A href="https://issues.apache.org/jira/browse/PIG-4059" target="_blank"&gt;https://issues.apache.org/jira/browse/PIG-4059&lt;/A&gt;&lt;/P&gt;&lt;P&gt;Regards,&lt;/P&gt;&lt;P&gt;Mitesh&lt;/P&gt;</description>
      <pubDate>Thu, 01 Jun 2017 13:49:27 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Pig-on-Spark-How-to-use-spark-as-an-execution-engine-for-pig/m-p/193840#M155900</guid>
      <dc:creator>galamitesh1005</dc:creator>
      <dc:date>2017-06-01T13:49:27Z</dc:date>
    </item>
    <item>
      <title>Re: Pig on Spark (How to use spark as an execution engine for pig scripts)</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Pig-on-Spark-How-to-use-spark-as-an-execution-engine-for-pig/m-p/193841#M155901</link>
      <description>&lt;P&gt;Pig on spark is a very new feature, and it still not part of an official apache release. It is likely to be take more time before it would be widely used in production and reccomended for production use.&lt;/P&gt;&lt;P&gt; Pig on tez has been around for sometime and has been used in production many large installations. I would reccomend using that over pig on spark.&lt;/P&gt;</description>
      <pubDate>Fri, 02 Jun 2017 01:37:48 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Pig-on-Spark-How-to-use-spark-as-an-execution-engine-for-pig/m-p/193841#M155901</guid>
      <dc:creator>thejas</dc:creator>
      <dc:date>2017-06-02T01:37:48Z</dc:date>
    </item>
    <item>
      <title>Re: Pig on Spark (How to use spark as an execution engine for pig scripts)</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Pig-on-Spark-How-to-use-spark-as-an-execution-engine-for-pig/m-p/193842#M155902</link>
      <description>&lt;P&gt;Hi Tejas,&lt;/P&gt;&lt;P&gt; Thanks for your response. Yes, agreed that it will be available when they will release pig 0.17 version, but as of now for development environment they have created github repo for pig 0.17 which we can use and implement spark as execution engine. I have already implemented spark as local mode but facing some issue with yarn-client mode. &lt;/P&gt;&lt;P&gt;Error for yarn-client mode:&lt;/P&gt;&lt;P&gt;sshuser@hn0-dfspar:~/pig/bin$ ./pig -x spark
Using Spark Home:  /usr/hdp/current/spark-client
17/06/02 05:41:54 INFO pig.ExecTypeProvider: Trying ExecType : LOCAL
17/06/02 05:41:54 INFO pig.ExecTypeProvider: Trying ExecType : MAPREDUCE
17/06/02 05:41:54 INFO pig.ExecTypeProvider: Trying ExecType : TEZ_LOCAL
17/06/02 05:41:54 INFO pig.ExecTypeProvider: Trying ExecType : TEZ
17/06/02 05:41:54 INFO pig.ExecTypeProvider: Trying ExecType : SPARK
17/06/02 05:41:54 INFO pig.ExecTypeProvider: Picked SPARK as the ExecType
17/06/02 05:41:54 ERROR pig.Main: ERROR 2998: Unhandled internal error. org/apache/spark/scheduler/SparkListener
17/06/02 05:41:54 WARN pig.Main: There is no log file to write to.
17/06/02 05:41:54 ERROR pig.Main: java.lang.NoClassDefFoundError: org/apache/spark/scheduler/SparkListener
        at org.apache.pig.backend.hadoop.executionengine.spark.SparkExecutionEngine.&amp;lt;init&amp;gt;(SparkExecutionEngine.java:35)
        at org.apache.pig.backend.hadoop.executionengine.spark.SparkExecType.getExecutionEngine(SparkExecType.java:42)
        at org.apache.pig.impl.PigContext.&amp;lt;init&amp;gt;(PigContext.java:269)
        at org.apache.pig.impl.PigContext.&amp;lt;init&amp;gt;(PigContext.java:256)
        at org.apache.pig.Main.run(Main.java:389)
        at org.apache.pig.Main.main(Main.java:175)
        at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
        at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
        at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
        at java.lang.reflect.Method.invoke(Method.java:498)
        at org.apache.hadoop.util.RunJar.run(RunJar.java:233)
        at org.apache.hadoop.util.RunJar.main(RunJar.java:148)
Caused by: java.lang.ClassNotFoundException: org.apache.spark.scheduler.SparkListener
        at java.net.URLClassLoader.findClass(URLClassLoader.java:381)
        at java.lang.ClassLoader.loadClass(ClassLoader.java:424)
        at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:331)
        at java.lang.ClassLoader.loadClass(ClassLoader.java:357)
        ... 12 more&lt;/P&gt;&lt;P&gt;Regards,&lt;/P&gt;&lt;P&gt;Mitesh&lt;/P&gt;</description>
      <pubDate>Sat, 03 Jun 2017 03:17:06 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Pig-on-Spark-How-to-use-spark-as-an-execution-engine-for-pig/m-p/193842#M155902</guid>
      <dc:creator>galamitesh1005</dc:creator>
      <dc:date>2017-06-03T03:17:06Z</dc:date>
    </item>
    <item>
      <title>Re: Pig on Spark (How to use spark as an execution engine for pig scripts)</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Pig-on-Spark-How-to-use-spark-as-an-execution-engine-for-pig/m-p/193843#M155903</link>
      <description>&lt;P&gt;As pig-on-spark is not currently supported (its not in immediate plans either) in HDP, you might want to raise this question in the apache pig mailing list itself, where some developers who are working on it might be able to respond.&lt;/P&gt;&lt;P&gt;As mentioned previously, pig-on-tez is lot more mature as it has been in production use for few years now. &lt;/P&gt;</description>
      <pubDate>Sat, 03 Jun 2017 03:37:33 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Pig-on-Spark-How-to-use-spark-as-an-execution-engine-for-pig/m-p/193843#M155903</guid>
      <dc:creator>thejas</dc:creator>
      <dc:date>2017-06-03T03:37:33Z</dc:date>
    </item>
  </channel>
</rss>

