<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Spark and Kafka broker with SSL (or Kerberos) authentication [CDH5.7] in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Spark-and-Kafka-broker-with-SSL-or-Kerberos-authentication/m-p/41263#M29508</link>
    <description>Yes all that is correct regarding how the CDH build works vs upstream and Kafka version. Although I admit I have not tried it directly it is my understanding that all this is so you can use security with Kafka and Spark Streaming.</description>
    <pubDate>Tue, 24 May 2016 23:13:39 GMT</pubDate>
    <dc:creator>srowen</dc:creator>
    <dc:date>2016-05-24T23:13:39Z</dc:date>
    <item>
      <title>Spark and Kafka broker with SSL (or Kerberos) authentication [CDH5.7]</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Spark-and-Kafka-broker-with-SSL-or-Kerberos-authentication/m-p/41259#M29507</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Before I go to the effort of setting up a Cloudera 5.7 cluster with Kafka, Spark and Kerberos enabled to test it out, can anyone give me the answer to the following:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;- Does Cloudera's distribution of Spark 1.6.0 support SSL or Kerberos on a Kafka broker?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;It looks like vanilla&amp;nbsp;Spark 1.6.0 (and spark-streaming-kafka jar) builds against Kafka 0.8, while I assume CDH's Spark is built against 0.9 (as this is the version that ships with CDH Kafka 2.0.1). &amp;nbsp;It looks like vanilla Spark doesn't support SSL or Kerberos authentication to Kafka topics.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Many thanks&lt;/P&gt;</description>
      <pubDate>Fri, 16 Sep 2022 10:21:36 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Spark-and-Kafka-broker-with-SSL-or-Kerberos-authentication/m-p/41259#M29507</guid>
      <dc:creator>hadooptom</dc:creator>
      <dc:date>2022-09-16T10:21:36Z</dc:date>
    </item>
    <item>
      <title>Re: Spark and Kafka broker with SSL (or Kerberos) authentication [CDH5.7]</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Spark-and-Kafka-broker-with-SSL-or-Kerberos-authentication/m-p/41263#M29508</link>
      <description>Yes all that is correct regarding how the CDH build works vs upstream and Kafka version. Although I admit I have not tried it directly it is my understanding that all this is so you can use security with Kafka and Spark Streaming.</description>
      <pubDate>Tue, 24 May 2016 23:13:39 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Spark-and-Kafka-broker-with-SSL-or-Kerberos-authentication/m-p/41263#M29508</guid>
      <dc:creator>srowen</dc:creator>
      <dc:date>2016-05-24T23:13:39Z</dc:date>
    </item>
    <item>
      <title>Re: Spark and Kafka broker with SSL (or Kerberos) authentication [CDH5.7]</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Spark-and-Kafka-broker-with-SSL-or-Kerberos-authentication/m-p/41293#M29509</link>
      <description>&lt;P&gt;Thanks&amp;nbsp;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/6698"&gt;@srowen﻿&lt;/a&gt;.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Is the source-code to Cloudera's Spark distribution publically available somewhere, so I can take a look at how to configure it?&lt;/P&gt;</description>
      <pubDate>Wed, 25 May 2016 08:24:53 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Spark-and-Kafka-broker-with-SSL-or-Kerberos-authentication/m-p/41293#M29509</guid>
      <dc:creator>hadooptom</dc:creator>
      <dc:date>2016-05-25T08:24:53Z</dc:date>
    </item>
    <item>
      <title>Re: Spark and Kafka broker with SSL (or Kerberos) authentication [CDH5.7]</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Spark-and-Kafka-broker-with-SSL-or-Kerberos-authentication/m-p/41327#M29510</link>
      <description>&lt;P&gt;Apache spark source code has a mirror on github:&amp;nbsp;&lt;A href="https://github.com/apache/spark" target="_blank"&gt;https://github.com/apache/spark&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Cloudera also exposes source code for each component on github with a branch for the different cdh versions:&amp;nbsp;&lt;A href="https://github.com/cloudera" target="_blank"&gt;https://github.com/cloudera&lt;/A&gt;&lt;/P&gt;&lt;P&gt;CDH 5.7 Spark 1.6 source code:&amp;nbsp;&lt;A href="https://github.com/cloudera/spark/tree/cdh5-1.6.0_5.7.0" target="_blank"&gt;https://github.com/cloudera/spark/tree/cdh5-1.6.0_5.7.0&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;As for configurations, you can find the security documentation for kafka here:&amp;nbsp;&lt;A href="http://www.cloudera.com/documentation/kafka/latest/topics/kafka_security.html" target="_blank"&gt;http://www.cloudera.com/documentation/kafka/latest/topics/kafka_security.html&lt;/A&gt;&lt;/P&gt;&lt;P&gt;and spark here:&amp;nbsp;&lt;A href="http://www.cloudera.com/documentation/enterprise/latest/topics/sg_spark_auth.html" target="_blank"&gt;http://www.cloudera.com/documentation/enterprise/latest/topics/sg_spark_auth.html&lt;/A&gt;&lt;/P&gt;&lt;P&gt;and here:&amp;nbsp;&lt;A href="http://www.cloudera.com/documentation/enterprise/latest/topics/sg_spark_encryption.html" target="_blank"&gt;http://www.cloudera.com/documentation/enterprise/latest/topics/sg_spark_encryption.html&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 25 May 2016 18:22:41 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Spark-and-Kafka-broker-with-SSL-or-Kerberos-authentication/m-p/41327#M29510</guid>
      <dc:creator>hubbarja</dc:creator>
      <dc:date>2016-05-25T18:22:41Z</dc:date>
    </item>
    <item>
      <title>Re: Spark and Kafka broker with SSL (or Kerberos) authentication [CDH5.7]</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Spark-and-Kafka-broker-with-SSL-or-Kerberos-authentication/m-p/41384#M29511</link>
      <description>&lt;P&gt;Thanks&amp;nbsp;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/16433"&gt;@hubbarja﻿&lt;/a&gt;.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Spent the afternoon trying this out on the CDH 5.7.0 QuickStart VM, with a kerberos enabled cluster and Cloudera Kafka 2.0.0. &amp;nbsp;I think perhaps I didn't quite phrase my question clearly, but what I was trying to ask was whether the spark-streaming-kafka client would support consuming from a Kafka cluster that has client SSL authentication required enabled.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;For anyone else who tries this, the summary is it won't work due to upstream Spark issue &lt;A href="https://issues.apache.org/jira/browse/SPARK-12177" target="_blank"&gt;[SPARK-12177]&lt;/A&gt;, which deals with support for the new Kafka 0.9 consumer / producer API. &amp;nbsp;SSL, SASL_PLAINTEXT or SASL_SSL connections to Kafka all require use of the new API.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;In fact, this issue is referenced in the &lt;A href="http://www.cloudera.com/documentation/enterprise/release-notes/topics/cdh_rn_spark_ki.html#concept_zkq_4n5_gv" target="_self"&gt;known issues&lt;/A&gt; released with CDH 5.7.0, I just didn't spot it in time.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;There's a pull request which appears to support SSL (but no form of Kerberos client authentication) in Github &lt;A href="https://github.com/apache/spark/pull/11863" target="_self"&gt;here&lt;/A&gt;, if anyone feels brave.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Looking at the comments on the Spark ticket, it's going to be at least post Spark 2.0.0 release that this feature gets merged in, and probably not until 2.1.0.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Back to the drawing board for me!&lt;/P&gt;</description>
      <pubDate>Thu, 26 May 2016 19:58:55 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Spark-and-Kafka-broker-with-SSL-or-Kerberos-authentication/m-p/41384#M29511</guid>
      <dc:creator>hadooptom</dc:creator>
      <dc:date>2016-05-26T19:58:55Z</dc:date>
    </item>
  </channel>
</rss>

