<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: cannot run pyspark (not using interactive shell) on cloudera vm in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/cannot-run-pyspark-not-using-interactive-shell-on-cloudera/m-p/43328#M36145</link>
    <description>found the solution at:&lt;BR /&gt;&lt;BR /&gt;&lt;A href="https://community.cloudera.com/t5/Hadoop-101-Training-Quickstart/CDH-5-5-VirtualBox-unable-to-connect-to-Spark-Master-Worker/td-p/34491" target="_blank"&gt;https://community.cloudera.com/t5/Hadoop-101-Training-Quickstart/CDH-5-5-VirtualBox-unable-to-connect-to-Spark-Master-Worker/td-p/34491&lt;/A&gt;</description>
    <pubDate>Thu, 28 Jul 2016 18:19:48 GMT</pubDate>
    <dc:creator>wqp89324</dc:creator>
    <dc:date>2016-07-28T18:19:48Z</dc:date>
    <item>
      <title>cannot run pyspark (not using interactive shell) on cloudera vm</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/cannot-run-pyspark-not-using-interactive-shell-on-cloudera/m-p/43295#M36144</link>
      <description>&lt;P&gt;Dear cloudera community,&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;When I follow this example:&amp;nbsp;&lt;A href="http://www.cloudera.com/documentation/enterprise/5-5-x/topics/spark_develop_run.html" target="_blank"&gt;http://www.cloudera.com/documentation/enterprise/5-5-x/topics/spark_develop_run.html&lt;/A&gt; and try to use the command spark-submit within the cloudera vm envirionrment, I constantly get the following error:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;ERROR spark.SparkContext: Error initializing SparkContext.&lt;BR /&gt;org.apache.hadoop.security.AccessControlException: Permission denied: user=cloudera, access=WRITE, inode="/user/spark/applicationHistory":spark:supergroup:drwxr-xr-x&lt;/P&gt;&lt;P&gt;....&lt;/P&gt;&lt;P&gt;Traceback (most recent call last):&lt;BR /&gt;File "/home/cloudera/wordcount.py", line 9, in &amp;lt;module&amp;gt;&lt;BR /&gt;sc = SparkContext(conf=conf)&lt;BR /&gt;File "/usr/lib/spark/python/lib/pyspark.zip/pyspark/context.py", line 115, in __init__&lt;BR /&gt;File "/usr/lib/spark/python/lib/pyspark.zip/pyspark/context.py", line 172, in _do_init&lt;BR /&gt;File "/usr/lib/spark/python/lib/pyspark.zip/pyspark/context.py", line 235, in _initialize_context&lt;BR /&gt;File "/usr/lib/spark/python/lib/py4j-0.9-src.zip/py4j/java_gateway.py", line 1064, in __call__&lt;BR /&gt;File "/usr/lib/spark/python/lib/py4j-0.9-src.zip/py4j/protocol.py", line 308, in get_return_value&lt;BR /&gt;py4j.protocol.Py4JJavaError: An error occurred while calling None.org.apache.spark.api.java.JavaSparkContext.&lt;BR /&gt;: org.apache.hadoop.security.AccessControlException: Permission denied: user=cloudera, access=WRITE, inode="/user/spark/applicationHistory":spark:supergroup:drwxr-xr-x&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I have tried these two commands:&lt;/P&gt;&lt;P&gt;1,&amp;nbsp;$ spark-submit --master yarn --deploy-mode client --executor-memory 1g \ --name wordcount --conf "spark.app.id=wordcount" wordcount.py hdfs://namenode_host:8020/path/to/inputfile.txt 2&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;2,&amp;nbsp;$ spark-submit --master yarn --deploy-mode client --executor-memory 1g \ --name wordcount --conf "spark.app.id=wordcount" wordcount.py inputfile.txt 2&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Can somebody help?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks!&lt;/P&gt;</description>
      <pubDate>Fri, 16 Sep 2022 10:31:56 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/cannot-run-pyspark-not-using-interactive-shell-on-cloudera/m-p/43295#M36144</guid>
      <dc:creator>wqp89324</dc:creator>
      <dc:date>2022-09-16T10:31:56Z</dc:date>
    </item>
    <item>
      <title>Re: cannot run pyspark (not using interactive shell) on cloudera vm</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/cannot-run-pyspark-not-using-interactive-shell-on-cloudera/m-p/43328#M36145</link>
      <description>found the solution at:&lt;BR /&gt;&lt;BR /&gt;&lt;A href="https://community.cloudera.com/t5/Hadoop-101-Training-Quickstart/CDH-5-5-VirtualBox-unable-to-connect-to-Spark-Master-Worker/td-p/34491" target="_blank"&gt;https://community.cloudera.com/t5/Hadoop-101-Training-Quickstart/CDH-5-5-VirtualBox-unable-to-connect-to-Spark-Master-Worker/td-p/34491&lt;/A&gt;</description>
      <pubDate>Thu, 28 Jul 2016 18:19:48 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/cannot-run-pyspark-not-using-interactive-shell-on-cloudera/m-p/43328#M36145</guid>
      <dc:creator>wqp89324</dc:creator>
      <dc:date>2016-07-28T18:19:48Z</dc:date>
    </item>
  </channel>
</rss>

