<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Spark shows all jobs completed. iPython still wait in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Spark-shows-all-jobs-completed-iPython-still-wait/m-p/51935#M23515</link>
    <description>&lt;P&gt;A bit more info.... (and this is cross-posted in project jupyter list)&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;DIV&gt;I think that messaging is getting screwed up between Pyspark and Livy. When the last cell is executed, I will see this on the client side.&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;2017-03-08 22:24:48,505 INFO &amp;nbsp; &amp;nbsp;EventsHandler &amp;nbsp; InstanceId: 0e1c8fd2-047e-4337-b264-5b64ba74de5a,EventName: notebookStatementExecutionStart,Timestamp: 2017-03-08 22:24:48.504920,SessionGuid: 03d14478-6adc-4b&lt;/DIV&gt;&lt;DIV&gt;34-abef-b9b6fd400543,LivyKind: pyspark,SessionId: 8,StatementGuid: f1933b11-b767-4a18-b311-c48901ad8369&lt;/DIV&gt;&lt;DIV&gt;2017-03-08 22:24:48,788 DEBUG &amp;nbsp; Command Status of statement 8 is running.&lt;/DIV&gt;&lt;DIV&gt;2017-03-08 22:24:50,920 DEBUG &amp;nbsp; Command Status of statement 8 is running.&lt;/DIV&gt;&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;...and it never comes back.&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;On the livy end, I see&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;&lt;DIV&gt;17/03/08 17:26:26 INFO ContextLauncher: 17/03/08 17:26:26 INFO scheduler.DAGScheduler: ResultStage 17 (collect at &amp;lt;stdin&amp;gt;:5) finished in 1.521 s&lt;/DIV&gt;&lt;DIV&gt;17/03/08 17:26:26 INFO ContextLauncher: 17/03/08 17:26:26 INFO scheduler.DAGScheduler: Job 8 finished: collect at &amp;lt;stdin&amp;gt;:5, took 3.729078 s&lt;/DIV&gt;&lt;DIV&gt;17/03/08 17:26:27 DEBUG RpcDispatcher: [ClientProtocol] Registered outstanding rpc 230 (com.cloudera.livy.rsc.BaseProtocol$GetReplJobResult).&lt;/DIV&gt;&lt;DIV&gt;17/03/08 17:26:27 DEBUG KryoMessageCodec: Encoded message of type com.cloudera.livy.rsc.rpc.Rpc$MessageHeader (6 bytes)&lt;/DIV&gt;&lt;DIV&gt;17/03/08 17:26:27 DEBUG KryoMessageCodec: Encoded message of type com.cloudera.livy.rsc.BaseProtocol$GetReplJobResult (91 bytes)&lt;/DIV&gt;&lt;DIV&gt;17/03/08 17:26:27 DEBUG KryoMessageCodec: Decoded message of type com.cloudera.livy.rsc.rpc.Rpc$MessageHeader (6 bytes)&lt;/DIV&gt;&lt;DIV&gt;17/03/08 17:26:27 DEBUG KryoMessageCodec: Decoded message of type com.cloudera.livy.rsc.rpc.Rpc$NullMessage (2 bytes)&lt;/DIV&gt;&lt;DIV&gt;17/03/08 17:26:27 DEBUG RpcDispatcher: [ClientProtocol] Received RPC message: type=REPLY id=230 payload=com.cloudera.livy.rsc.rpc.Rpc$NullMessage&lt;/DIV&gt;&lt;DIV&gt;17/03/08 17:26:28 DEBUG RpcDispatcher: [ClientProtocol] Registered outstanding rpc 231 (com.cloudera.livy.rsc.BaseProtocol$GetReplJobResult).&lt;/DIV&gt;&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;ad infinitum&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;So, with my limited knowledge, it looks to me that Livy thinks it has sent a result to a finished job, but pyspark hasn't received it.&lt;/DIV&gt;&lt;DIV&gt;Anyone seen this before? Any thoughts?&lt;/DIV&gt;</description>
    <pubDate>Wed, 08 Mar 2017 22:55:18 GMT</pubDate>
    <dc:creator>wkupersa</dc:creator>
    <dc:date>2017-03-08T22:55:18Z</dc:date>
  </channel>
</rss>

