<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Spark error: org.apache.spark.SparkException: Failed to get broadcast_2_piece0 of broadcast_2 in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Spark-error-org-apache-spark-SparkException-Failed-to-get/m-p/230881#M192728</link>
    <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/18482/hugofelix.html" nodeid="18482"&gt;@Hugo Felix&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/18482/hugofelix.html" nodeid="18482"&gt;&lt;/A&gt;It appears you are using Spark version lower than 2.1. &lt;/P&gt;&lt;P&gt;In your code you have the following line:&lt;/P&gt;&lt;PRE&gt;val sc =newSparkConf().setMaster("local[2]").setAppName("tweets").set("spark.cleaner.ttl","2000")&lt;/PRE&gt;&lt;P&gt;spark.cleaner.ttl basically triggers a cleanup after the time ( "2000") that you specify.&lt;/P&gt;&lt;P&gt;From the official &lt;A href="https://spark.apache.org/docs/1.6.1/configuration.html"&gt;Spark 1.6 documentation&lt;/A&gt; :&lt;/P&gt;&lt;P&gt;spark.cleaner.ttl - Duration (seconds) of how long Spark will remember any metadata (stages generated, tasks generated, etc.). Periodic cleanups will ensure that metadata older than this duration will be forgotten. This is useful for running Spark for many hours / days (for example, running 24/7 in case of Spark Streaming applications). Note that any RDD that persists in memory for more than this duration will be cleared as well.&lt;/P&gt;&lt;P&gt;Default is infinite.&lt;/P&gt;&lt;P&gt;In your case, it is quite possible that the cleanup is being triggered even before your job finishes.&lt;/P&gt;&lt;P&gt;Increase the value and try again.&lt;/P&gt;&lt;P&gt;Refer this &lt;A href="https://issues.apache.org/jira/browse/SPARK-5594"&gt;JIRA&lt;/A&gt; for an existing discussion to get more insight.&lt;/P&gt;</description>
    <pubDate>Wed, 05 Jul 2017 22:11:17 GMT</pubDate>
    <dc:creator>dineshc</dc:creator>
    <dc:date>2017-07-05T22:11:17Z</dc:date>
  </channel>
</rss>

