<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Questions Around Spark Cache/spillage to the disk in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Questions-Around-Spark-Cache-spillage-to-the-disk/m-p/115717#M38596</link>
    <description>&lt;P&gt;1. This can be controlled through configuration, please see &lt;A href="http://spark.apache.org/docs/latest/configuration.html#memory-management" target="_blank"&gt;http://spark.apache.org/docs/latest/configuration.html#memory-management&lt;/A&gt;&lt;/P&gt;&lt;P&gt;2. No, you cannot disable non-memory caching, but you could choose only MEMORY related storage level to avoid spilling to disk when memory is full.&lt;/P&gt;&lt;P&gt;3. No, the data is not encrypted, and there's no way to encrypt spilled data currently.&lt;/P&gt;&lt;P&gt;4. It depends on different streaming sources you choose. For Kafka it supports ssl or sasl encryption.&lt;/P&gt;&lt;P&gt;5. same as #2.&lt;/P&gt;</description>
    <pubDate>Tue, 23 Aug 2016 20:40:38 GMT</pubDate>
    <dc:creator>sshao</dc:creator>
    <dc:date>2016-08-23T20:40:38Z</dc:date>
    <item>
      <title>Questions Around Spark Cache/spillage to the disk</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Questions-Around-Spark-Cache-spillage-to-the-disk/m-p/115716#M38595</link>
      <description>&lt;P&gt;Guys,&lt;/P&gt;&lt;P&gt;I have a few questions related to Spark cache and would like to know your inputs on the same.&lt;/P&gt;&lt;P&gt;1) How much cache memory can available to each of the executor nodes? Is there a way to control it? &lt;/P&gt;&lt;P&gt;2) We want to restrict the developers from persisting any data to the disk. Is there any configuration can we change to disable non -memory caching? This is to make sure by mistake, any secure data is not spilled to the disk. &lt;/P&gt;&lt;P&gt;3) If point#2 cannot be achieved, is there a way to make sure that spillage (In case developers use Memory_And_Disk option) happens only to a secure directory and data is encrypted?&lt;/P&gt;&lt;P&gt;4) For streaming data, processing with Spark how secure is it, can encryption be applied to data in flight? &lt;/P&gt;&lt;P&gt;5) If the developers decide to cache steaming RDDs, how secure is it? And same case point#2 above. &lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;&lt;P&gt;SS&lt;/P&gt;</description>
      <pubDate>Tue, 23 Aug 2016 20:31:43 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Questions-Around-Spark-Cache-spillage-to-the-disk/m-p/115716#M38595</guid>
      <dc:creator>smartninja723</dc:creator>
      <dc:date>2016-08-23T20:31:43Z</dc:date>
    </item>
    <item>
      <title>Re: Questions Around Spark Cache/spillage to the disk</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Questions-Around-Spark-Cache-spillage-to-the-disk/m-p/115717#M38596</link>
      <description>&lt;P&gt;1. This can be controlled through configuration, please see &lt;A href="http://spark.apache.org/docs/latest/configuration.html#memory-management" target="_blank"&gt;http://spark.apache.org/docs/latest/configuration.html#memory-management&lt;/A&gt;&lt;/P&gt;&lt;P&gt;2. No, you cannot disable non-memory caching, but you could choose only MEMORY related storage level to avoid spilling to disk when memory is full.&lt;/P&gt;&lt;P&gt;3. No, the data is not encrypted, and there's no way to encrypt spilled data currently.&lt;/P&gt;&lt;P&gt;4. It depends on different streaming sources you choose. For Kafka it supports ssl or sasl encryption.&lt;/P&gt;&lt;P&gt;5. same as #2.&lt;/P&gt;</description>
      <pubDate>Tue, 23 Aug 2016 20:40:38 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Questions-Around-Spark-Cache-spillage-to-the-disk/m-p/115717#M38596</guid>
      <dc:creator>sshao</dc:creator>
      <dc:date>2016-08-23T20:40:38Z</dc:date>
    </item>
  </channel>
</rss>

