<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: PySpark Connection remote server in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/PySpark-Connection-remote-server/m-p/195222#M65613</link>
    <description>&lt;P&gt;That was a problem of version compatibility between spark in Ambari and my spark version imported with python.&lt;/P&gt;</description>
    <pubDate>Mon, 31 Jul 2017 14:50:35 GMT</pubDate>
    <dc:creator>yoann_boyere95</dc:creator>
    <dc:date>2017-07-31T14:50:35Z</dc:date>
    <item>
      <title>PySpark Connection remote server</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/PySpark-Connection-remote-server/m-p/195221#M65612</link>
      <description>&lt;P&gt;I've install a cluster with one node on a amazon machine thanks to ambari. I'm trying to use spark from an other machine thanks to pySpark.&lt;/P&gt;&lt;P&gt;This is my code : &lt;/P&gt;&lt;PRE&gt;from pyspark import SparkConf, SparkContext
conf = SparkConf().setAppName('hello').setMaster('spark://MYIP:7077')
sc = SparkContext(conf=conf)&lt;/PRE&gt;&lt;P&gt;The problem is that I have a connection refused when I run the program : &lt;/P&gt;&lt;P&gt;WARN StandaloneAppClient$ClientEndpoint: Failed to connect to master "MYIP"&lt;/P&gt;&lt;P&gt;So, I tried this command to start the master : ./sbin/start-master.sh&lt;/P&gt;&lt;P&gt;And now, I have this error :&lt;/P&gt;&lt;BLOCKQUOTE&gt;
&lt;P&gt;17/07/27 12:07:15 WARN StandaloneAppClient$ClientEndpoint: Failed to connect to master XX.XXX.XXX.XX:7077
org.apache.spark.SparkException: Exception thrown in awaitResult: 
   at org.apache.spark.util.ThreadUtils$.awaitResult(ThreadUtils.scala:205)
   at org.apache.spark.rpc.RpcTimeout.awaitResult(RpcTimeout.scala:75)
   at org.apache.spark.rpc.RpcEnv.setupEndpointRefByURI(RpcEnv.scala:100)
   at org.apache.spark.rpc.RpcEnv.setupEndpointRef(RpcEnv.scala:108)
   at org.apache.spark.deploy.client.StandaloneAppClient$ClientEndpoint$$anonfun$tryRegisterAllMasters$1$$anon$1.run(StandaloneAppClient.scala:106)
   at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
   at java.util.concurrent.FutureTask.run(FutureTask.java:266)
   at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
   at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
   at java.lang.Thread.run(Thread.java:748)
Caused by: java.lang.RuntimeException: java.io.StreamCorruptedException: invalid stream header: 01000C31&lt;/P&gt;&lt;/BLOCKQUOTE&gt;&lt;P&gt;This is not a problem of port because the port 7077 is open. &lt;/P&gt;&lt;P&gt;I don't find any answer for that problem on the forum, do you have any idea ?&lt;/P&gt;</description>
      <pubDate>Thu, 27 Jul 2017 17:13:51 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/PySpark-Connection-remote-server/m-p/195221#M65612</guid>
      <dc:creator>yoann_boyere95</dc:creator>
      <dc:date>2017-07-27T17:13:51Z</dc:date>
    </item>
    <item>
      <title>Re: PySpark Connection remote server</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/PySpark-Connection-remote-server/m-p/195222#M65613</link>
      <description>&lt;P&gt;That was a problem of version compatibility between spark in Ambari and my spark version imported with python.&lt;/P&gt;</description>
      <pubDate>Mon, 31 Jul 2017 14:50:35 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/PySpark-Connection-remote-server/m-p/195222#M65613</guid>
      <dc:creator>yoann_boyere95</dc:creator>
      <dc:date>2017-07-31T14:50:35Z</dc:date>
    </item>
  </channel>
</rss>

