<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Issue with Kafka after Kerberos was enabled: does the data go to trash because the node is unavailable? in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Issue-with-Kafka-after-Kerberos-was-enabled-does-the-data-go/m-p/129653#M39330</link>
    <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/505/dsharma.html" nodeid="505"&gt;@deepak sharma&lt;/A&gt; &lt;/P&gt;&lt;P&gt;Crazy enough. I just reached to this customer and s simple restart of Kafka service addressed the issue. Kerberos was enabled recently and probably this service was not restarted. Not much to learn.&lt;/P&gt;&lt;P&gt;The symlink suggestion from you is an interesting approach which while not applicable here, is worth it to remember for other situations. Thank you for the suggestion.&lt;/P&gt;</description>
    <pubDate>Fri, 02 Sep 2016 01:32:13 GMT</pubDate>
    <dc:creator>cstanca</dc:creator>
    <dc:date>2016-09-02T01:32:13Z</dc:date>
    <item>
      <title>Issue with Kafka after Kerberos was enabled: does the data go to trash because the node is unavailable?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Issue-with-Kafka-after-Kerberos-was-enabled-does-the-data-go/m-p/129648#M39325</link>
      <description>&lt;H1&gt;Question:&lt;/H1&gt;&lt;P&gt;For the issue described below: Does the data go to trash because the node is unavailable? What could cause this exception in the context of the recent cluster kerberos enabling?&lt;/P&gt;&lt;H1&gt;Issue Description&lt;/H1&gt;&lt;P&gt;Here is the issue an organization is facing with Kafka after recently enabling Kerberos in HDP 2.4.2 cluster.&lt;/P&gt;&lt;P&gt;They are trying to build a pipeline from Data Center to HDFS. The data is first being mirrored to the cluster using Mirror Maker .8  as the Data Center uses kafka .8. The data is then avro serialized using a Flume agent and dumped into HDFS through the Confluent HDFS connector.&lt;/P&gt;&lt;P&gt;However, from the MirrorMaker, they notice that only about half of the data is mirrored. Since Kerberos was enabled in their cluster, they are noticing the following error in the kafka logs:&lt;/P&gt;&lt;P&gt;[2016-08-29 16:51:28,479] INFO Returning HDFS Filesystem Config: Configuration: core-default.xml, core-site.xml, hdfs-default.xml, hdfs-site.xml (org.apache.ranger.audit.destination.HDFSAuditDestination)&lt;/P&gt;&lt;P&gt;[2016-08-29 16:51:28,496] ERROR Error writing to log file. (org.apache.ranger.audit.provider.BaseAuditHandler)&lt;/P&gt;&lt;P&gt;java.lang.IllegalArgumentException: java.net.UnknownHostException: xyzlphdpd1&lt;/P&gt;&lt;P&gt;        at org.apache.hadoop.security.SecurityUtil.buildTokenService(SecurityUtil.java:406)&lt;/P&gt;&lt;P&gt;        at org.apache.hadoop.hdfs.NameNodeProxies.createNonHAProxy(NameNodeProxies.java:311)&lt;/P&gt;&lt;P&gt;        at org.apache.hadoop.hdfs.NameNodeProxies.createProxy(NameNodeProxies.java:176)&lt;/P&gt;&lt;P&gt;        at org.apache.hadoop.hdfs.DFSClient.&amp;lt;init&amp;gt;(DFSClient.java:678)&lt;/P&gt;&lt;P&gt;        at org.apache.hadoop.hdfs.DFSClient.&amp;lt;init&amp;gt;(DFSClient.java:619)&lt;/P&gt;&lt;P&gt;        at org.apache.hadoop.hdfs.DistributedFileSystem.initialize(DistributedFileSystem.java:149)&lt;/P&gt;&lt;P&gt;        at org.apache.hadoop.fs.FileSystem.createFileSystem(FileSystem.java:2653)&lt;/P&gt;&lt;P&gt;        at org.apache.hadoop.fs.FileSystem.access$200(FileSystem.java:92)&lt;/P&gt;&lt;P&gt;        at org.apache.hadoop.fs.FileSystem$Cache.getInternal(FileSystem.java:2687)&lt;/P&gt;&lt;P&gt;        at org.apache.hadoop.fs.FileSystem$Cache.get(FileSystem.java:2669)&lt;/P&gt;&lt;P&gt;        at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:371)&lt;/P&gt;&lt;P&gt;        at org.apache.ranger.audit.destination.HDFSAuditDestination.getLogFileStream(HDFSAuditDestination.java:221)&lt;/P&gt;&lt;P&gt;        at org.apache.ranger.audit.destination.HDFSAuditDestination.logJSON(HDFSAuditDestination.java:123)&lt;/P&gt;&lt;P&gt;        at org.apache.ranger.audit.queue.AuditFileSpool.sendEvent(AuditFileSpool.java:890)&lt;/P&gt;&lt;P&gt;        at org.apache.ranger.audit.queue.AuditFileSpool.runDoAs(AuditFileSpool.java:838)&lt;/P&gt;&lt;P&gt;        at org.apache.ranger.audit.queue.AuditFileSpool$2.run(AuditFileSpool.java:759)&lt;/P&gt;&lt;P&gt;        at org.apache.ranger.audit.queue.AuditFileSpool$2.run(AuditFileSpool.java:757)&lt;/P&gt;&lt;P&gt;        at java.security.AccessController.doPrivileged(Native Method)&lt;/P&gt;&lt;P&gt;        at javax.security.auth.Subject.doAs(Subject.java:356)&lt;/P&gt;&lt;P&gt;        at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1637)&lt;/P&gt;&lt;P&gt;        at org.apache.ranger.audit.queue.AuditFileSpool.run(AuditFileSpool.java:765)&lt;/P&gt;&lt;P&gt;        at java.lang.Thread.run(Thread.java:745)&lt;/P&gt;&lt;P&gt;Caused by: java.net.UnknownHostException: xyzlphdpd1&lt;/P&gt;&lt;P&gt;        ... 22 more&lt;/P&gt;&lt;P&gt;[2016-08-29 16:51:28,496] ERROR Error sending logs to consumer. provider=kafka.async.summary.multi_dest.batch, consumer=kafka.async.summary.multi_dest.batch.hdfs (org.apache.ranger.audit.queue.AuditFileSpool)&lt;/P&gt;</description>
      <pubDate>Thu, 01 Sep 2016 04:03:43 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Issue-with-Kafka-after-Kerberos-was-enabled-does-the-data-go/m-p/129648#M39325</guid>
      <dc:creator>cstanca</dc:creator>
      <dc:date>2016-09-01T04:03:43Z</dc:date>
    </item>
    <item>
      <title>Re: Issue with Kafka after Kerberos was enabled: does the data go to trash because the node is unavailable?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Issue-with-Kafka-after-Kerberos-was-enabled-does-the-data-go/m-p/129649#M39326</link>
      <description>&lt;P&gt;here looks like audit to hdfs is enabled for ranger kakfa plugin , and the audit to hdfs is failing for kafka &lt;/P&gt;&lt;P&gt;if you notice there is an error in bottom ,&lt;/P&gt;&lt;PRE&gt;Caused by: java.net.UnknownHostException: xyzlphdpd1... 22 more&lt;/PRE&gt;&lt;P&gt;as it is HA cluster , i think there is some issue with configuration of ranger hdfs audit in kafka ,&lt;/P&gt;&lt;P&gt;i remember seeing such issue in test connection at ranger side and symlinking to hdfs-site.xml in /etc/ranger/admin/conf had solved it , can you please try same with kafka conf , I mean symlink hdfs-site.xml in kafka conf &lt;/P&gt;</description>
      <pubDate>Fri, 02 Sep 2016 00:07:22 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Issue-with-Kafka-after-Kerberos-was-enabled-does-the-data-go/m-p/129649#M39326</guid>
      <dc:creator>dsharma</dc:creator>
      <dc:date>2016-09-02T00:07:22Z</dc:date>
    </item>
    <item>
      <title>Re: Issue with Kafka after Kerberos was enabled: does the data go to trash because the node is unavailable?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Issue-with-Kafka-after-Kerberos-was-enabled-does-the-data-go/m-p/129650#M39327</link>
      <description>&lt;P&gt;&lt;A href="https://community.hortonworks.com/users/505/dsharma.html"&gt;@deepak sharma&lt;/A&gt;. &lt;/P&gt;&lt;P&gt;Thanks. I am aware that audit to hdfs is enabled for ranger kakfa plugin, and the audit to hdfs is failing for kafka, that's how we extracted the exception, from Kafka logs. Let me check if symlink hdfs-site.xml in kafka conf does it. Stay tuned.&lt;/P&gt;</description>
      <pubDate>Fri, 02 Sep 2016 01:18:28 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Issue-with-Kafka-after-Kerberos-was-enabled-does-the-data-go/m-p/129650#M39327</guid>
      <dc:creator>cstanca</dc:creator>
      <dc:date>2016-09-02T01:18:28Z</dc:date>
    </item>
    <item>
      <title>Re: Issue with Kafka after Kerberos was enabled: does the data go to trash because the node is unavailable?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Issue-with-Kafka-after-Kerberos-was-enabled-does-the-data-go/m-p/129651#M39328</link>
      <description>&lt;P&gt;sure , let me know if it works &lt;/P&gt;</description>
      <pubDate>Fri, 02 Sep 2016 01:28:54 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Issue-with-Kafka-after-Kerberos-was-enabled-does-the-data-go/m-p/129651#M39328</guid>
      <dc:creator>dsharma</dc:creator>
      <dc:date>2016-09-02T01:28:54Z</dc:date>
    </item>
    <item>
      <title>Re: Issue with Kafka after Kerberos was enabled: does the data go to trash because the node is unavailable?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Issue-with-Kafka-after-Kerberos-was-enabled-does-the-data-go/m-p/129652#M39329</link>
      <description>&lt;P&gt;It seems that the data does not go to thrash. A simple restart of Kafka service addressed the issue. Kerberos was enabled recently and probably this service was not restarted.&lt;/P&gt;&lt;P&gt;The symlink suggestion from deepak is an interesting approach which while not applicable here, is worth it to remember for other situations.&lt;/P&gt;</description>
      <pubDate>Fri, 02 Sep 2016 01:29:39 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Issue-with-Kafka-after-Kerberos-was-enabled-does-the-data-go/m-p/129652#M39329</guid>
      <dc:creator>cstanca</dc:creator>
      <dc:date>2016-09-02T01:29:39Z</dc:date>
    </item>
    <item>
      <title>Re: Issue with Kafka after Kerberos was enabled: does the data go to trash because the node is unavailable?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Issue-with-Kafka-after-Kerberos-was-enabled-does-the-data-go/m-p/129653#M39330</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/505/dsharma.html" nodeid="505"&gt;@deepak sharma&lt;/A&gt; &lt;/P&gt;&lt;P&gt;Crazy enough. I just reached to this customer and s simple restart of Kafka service addressed the issue. Kerberos was enabled recently and probably this service was not restarted. Not much to learn.&lt;/P&gt;&lt;P&gt;The symlink suggestion from you is an interesting approach which while not applicable here, is worth it to remember for other situations. Thank you for the suggestion.&lt;/P&gt;</description>
      <pubDate>Fri, 02 Sep 2016 01:32:13 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Issue-with-Kafka-after-Kerberos-was-enabled-does-the-data-go/m-p/129653#M39330</guid>
      <dc:creator>cstanca</dc:creator>
      <dc:date>2016-09-02T01:32:13Z</dc:date>
    </item>
  </channel>
</rss>

