<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: GSS Initiate failed even with a valid kerberos service ticket. in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/GSS-Initiate-failed-even-with-a-valid-kerberos-service/m-p/140801#M103408</link>
    <description>&lt;P&gt;"GSS initiate failed" without any detailed error message is like a catch-all in the class com.sun.security.sasl.gsskerb.GssKrb5Server in the method evaluateResponse(byte[] responseData). There are many lines in that big try-catch that can throw this exception. &lt;/P&gt;&lt;P&gt;So, after I debugged the running hbase master process through a debugger I am able to clearly see that the stack trace leading to this GSSException is actually coming from method acceptSecContext(InputStream is, int mechTokenSize) in the class Krb5Context. The actual exception is KrbException being thrown as GSSException. This points to issues in keytab in master node vs details in service ticket that region server is presenting, assuming JCE and encryption types on both nodes are appropriate.&lt;/P&gt;</description>
    <pubDate>Thu, 02 Feb 2017 19:03:25 GMT</pubDate>
    <dc:creator>sumit_nigam</dc:creator>
    <dc:date>2017-02-02T19:03:25Z</dc:date>
    <item>
      <title>GSS Initiate failed even with a valid kerberos service ticket.</title>
      <link>https://community.cloudera.com/t5/Support-Questions/GSS-Initiate-failed-even-with-a-valid-kerberos-service/m-p/140793#M103400</link>
      <description>&lt;P&gt;I am launching hbase (1.1.2) on a kerberized cluster (AD). Hbase region server fails to connect to master with following error:&lt;/P&gt;&lt;PRE&gt;2017-01-20 18:17:23,944 WARN  [regionserver/a1.example.com/xxxxx] regionserver.HRegionServer: error telling master we are up
com.google.protobuf.ServiceException: java.io.IOException: Couldn't setup connection for srvuser/a1.example.com@ADC.EXAMPLE.COM to srvuser/a2.example.com@ADC.EXAMPLE.COM
at org.apache.hadoop.hbase.ipc.AbstractRpcClient.callBlockingMethod(AbstractRpcClient.java:223)
at org.apache.hadoop.hbase.ipc.AbstractRpcClient$BlockingRpcChannelImplementation.callBlockingMethod(AbstractRpcClient.java:287)
at org.apache.hadoop.hbase.protobuf.generated.RegionServerStatusProtos$RegionServerStatusService$BlockingStub.regionServerStartup(RegionServerStatusProtos.java:8982)
at org.apache.hadoop.hbase.regionserver.HRegionServer.reportForDuty(HRegionServer.java:2270)&lt;/PRE&gt;&lt;P&gt;...&lt;/P&gt;&lt;PRE&gt;Caused by: org.apache.hadoop.ipc.RemoteException(javax.security.sasl.SaslException): GSS initiate failed
at org.apache.hadoop.hbase.security.HBaseSaslRpcClient.readStatus(HBaseSaslRpcClient.java:153)
at org.apache.hadoop.hbase.security.HBaseSaslRpcClient.saslConnect(HBaseSaslRpcClient.java:189)&lt;/PRE&gt;&lt;P&gt;I turned ON detailed debug logs for kerberos as well as Hbase. I can see that the service ticket is successfully obtained by host a1 for a2:&lt;/P&gt;&lt;P&gt;&lt;EM&gt;Found ticket for srvuser/a1.example.com@ADC.EXAMPLE.COM to go to krbtgt/ADC.EXAMPLE.COM@ADC.EXAMPLE.COM expiring on Sat Jan 21 04:17:10 PST 2017 &lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;Found ticket for srvuser/a1.example.com@ADC.EXAMPLE.COM to go to srvuser/a2.example.com@ADC.EXAMPLE.COM expiring on Sat Jan 21 04:17:10 PST 2017&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;Client Principal = srvuser/a1.example.com@ADC.EXAMPLE.COM &lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;Server Principal = srvuser/a2.example.com@ADC.EXAMPLE.COM Session Key = EncryptionKey: keyType=23 keyBytes &lt;/EM&gt;&lt;/P&gt;&lt;P&gt;I do not see any errors post the above lines in detailed kerberos level logs so I assume that the problem of GSS Initiate failed has not anything to do with kerberos now else I would have seen some error reported (such as say ticket being corrupted?)&lt;/P&gt;&lt;P&gt;I notice that GSS Initiate failed message without any details reported is specified by experts as one of the most useless messages - &lt;A target="_blank" href="https://github.com/steveloughran/kerberos_and_hadoop/blob/master/sections/errors.md#gss-initiate-failed-no-further-details-provided "&gt;Steve's error messages to fear&lt;/A&gt;. &lt;/P&gt;&lt;P&gt;Already verified unlimited JCE policy files are present, and that both hosts are using the same encryption algorithm. Can anyone help here? Even if it is about what next steps I can take to debug this? &lt;/P&gt;&lt;P&gt;Thank you!&lt;/P&gt;</description>
      <pubDate>Sun, 22 Jan 2017 22:36:32 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/GSS-Initiate-failed-even-with-a-valid-kerberos-service/m-p/140793#M103400</guid>
      <dc:creator>sumit_nigam</dc:creator>
      <dc:date>2017-01-22T22:36:32Z</dc:date>
    </item>
    <item>
      <title>Re: GSS Initiate failed even with a valid kerberos service ticket.</title>
      <link>https://community.cloudera.com/t5/Support-Questions/GSS-Initiate-failed-even-with-a-valid-kerberos-service/m-p/140794#M103401</link>
      <description>&lt;P&gt;try to turn on the java kerberos logging by adding -Dsun.security.krb5.debug=true to the HADOOP_OPTS. Usually it helps to understand what exactly fails during GSS initialization. &lt;/P&gt;</description>
      <pubDate>Mon, 23 Jan 2017 03:24:46 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/GSS-Initiate-failed-even-with-a-valid-kerberos-service/m-p/140794#M103401</guid>
      <dc:creator>ssoldatov</dc:creator>
      <dc:date>2017-01-23T03:24:46Z</dc:date>
    </item>
    <item>
      <title>Re: GSS Initiate failed even with a valid kerberos service ticket.</title>
      <link>https://community.cloudera.com/t5/Support-Questions/GSS-Initiate-failed-even-with-a-valid-kerberos-service/m-p/140795#M103402</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/2924/ssoldatov.html" nodeid="2924"&gt;@Sergey Soldatov&lt;/A&gt; - Thanks for the suggestion. I had added that flag already and that only showed me that I have a valid service ticket (as mentioned above). The SecurityAuth.audit log of hbase master shows the following error:&lt;/P&gt;&lt;P&gt;2017-01-20 18:17:08,221 WARN SecurityLogger.org.apache.hadoop.hbase.Server: Auth failed for x.y.z.q:55872:null &lt;/P&gt;&lt;P&gt;2017-01-20 18:17:11,857 WARN SecurityLogger.org.apache.hadoop.hbase.Server: Auth failed for x.y.z.q:46270:null &lt;/P&gt;&lt;P&gt;Not sure if this can point to any problems. The IP for which auth shows as failed above is where HRegion server is running.&lt;/P&gt;</description>
      <pubDate>Mon, 23 Jan 2017 11:19:41 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/GSS-Initiate-failed-even-with-a-valid-kerberos-service/m-p/140795#M103402</guid>
      <dc:creator>sumit_nigam</dc:creator>
      <dc:date>2017-01-23T11:19:41Z</dc:date>
    </item>
    <item>
      <title>Re: GSS Initiate failed even with a valid kerberos service ticket.</title>
      <link>https://community.cloudera.com/t5/Support-Questions/GSS-Initiate-failed-even-with-a-valid-kerberos-service/m-p/140796#M103403</link>
      <description>&lt;P&gt;"I had added that flag already and that only showed me that I have a valid service ticket (as mentioned above)."&lt;/P&gt;&lt;P&gt;Would suggest sharing that information anyways. Sometimes the details printed by that option are very subtle yet telling.&lt;/P&gt;&lt;P&gt;You can also try setting the log4j level to DEBUG or TRACE for org.apache.hadoop.hbase.ipc to see if there is more context there.&lt;/P&gt;</description>
      <pubDate>Tue, 24 Jan 2017 00:32:25 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/GSS-Initiate-failed-even-with-a-valid-kerberos-service/m-p/140796#M103403</guid>
      <dc:creator>elserj</dc:creator>
      <dc:date>2017-01-24T00:32:25Z</dc:date>
    </item>
    <item>
      <title>Re: GSS Initiate failed even with a valid kerberos service ticket.</title>
      <link>https://community.cloudera.com/t5/Support-Questions/GSS-Initiate-failed-even-with-a-valid-kerberos-service/m-p/140797#M103404</link>
      <description>&lt;P&gt;Hi &lt;A href="https://community.hortonworks.com/questions/79084/gss-initiate-failed-even-with-a-valid-kerberos-ser.html#"&gt;@Josh Elser&lt;/A&gt; - I have attached 2 logs (after kerberos debug flag) for the HMaster and HRegion server. Overall logs are quite huge so I have tried to remove some portions. Like I removed the hex dump of tickets. However, do let me know if I need to attach full logs.&lt;/P&gt;&lt;P&gt;Some comments in general:
The error "GSS initiate failed" shows up even when master and region server come up on same host.&lt;/P&gt;&lt;P&gt;After enabling debug logs for HBase, HMaster shows:
2017-01-20 18:17:11,699 DEBUG [main-EventThread] zookeeper.RegionServerTracker: Added tracking of RS /srvuser/hbase/rs/a1.example.com,52412,1484889430172 &lt;/P&gt;&lt;P&gt;2017-01-20 18:17:11,823 DEBUG [RpcServer.listener,port=42263] ipc.RpcServer: RpcServer.listener,port=42263: connection from 10.64.130.53:46270; # active connections: 1 &lt;/P&gt;&lt;P&gt;2017-01-20 18:17:11,856 DEBUG [RpcServer.reader=2,bindAddress=a1.example.com,port=42263] ipc.RpcServer: Kerberos principal name is srvuser/a1.example.com@ADC.EXAMPLE.COM &lt;/P&gt;&lt;P&gt;2017-01-20 18:17:11,857 DEBUG [RpcServer.reader=2,bindAddress=a1.example.com,port=42263] ipc.RpcServer: Created SASL server with mechanism = GSSAPI &lt;/P&gt;&lt;P&gt;2017-01-20 18:17:11,857 DEBUG [RpcServer.reader=2,bindAddress=a1.example.com,port=42263] ipc.RpcServer: Have read input token of size 1824 for processing by saslServer.evaluateResponse() &lt;/P&gt;&lt;P&gt;2017-01-20 18:17:11,857 DEBUG [RpcServer.reader=2,bindAddress=a1.example.com,port=42263] ipc.RpcServer: RpcServer.listener,port=42263: &lt;STRONG&gt;Caught exception while reading:GSS initiate failed &lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;2017-01-20 18:17:11,857 DEBUG [RpcServer.reader=2,bindAddress=a1.example.com,port=42263] ipc.RpcServer: RpcServer.listener,port=42263: DISCONNECTING client 10.64.130.53:46270 because read count=-1. Number of active connections: 1&lt;A href="https://community.cloudera.com/legacyfs/online/attachments/11719-hbase-regionsvr-kerberos-output.txt"&gt;hbase-regionsvr-kerberos-output.txt&lt;/A&gt;&lt;A href="https://community.cloudera.com/legacyfs/online/attachments/11720-hmaster-kerberos-flag-output.txt"&gt;hmaster-kerberos-flag-output.txt&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 25 Jan 2017 14:08:01 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/GSS-Initiate-failed-even-with-a-valid-kerberos-service/m-p/140797#M103404</guid>
      <dc:creator>sumit_nigam</dc:creator>
      <dc:date>2017-01-25T14:08:01Z</dc:date>
    </item>
    <item>
      <title>Re: GSS Initiate failed even with a valid kerberos service ticket.</title>
      <link>https://community.cloudera.com/t5/Support-Questions/GSS-Initiate-failed-even-with-a-valid-kerberos-service/m-p/140798#M103405</link>
      <description>&lt;P&gt;Hi &lt;A rel="user" href="https://community.cloudera.com/users/223/jelser.html" nodeid="223"&gt;@Josh Elser&lt;/A&gt; - I hope the logs I attached show some issue. I am not able to find anything amiss. Plus, with a simple socket server test program I notice that we are able to successfully get service ticket and also send data back and forth. So, I'd assume that from kerberos side the issue is not there. Then, something from hbase side is messing things up? Or maybe some permissions of user / user groups in HDFS, LDAP?&lt;/P&gt;</description>
      <pubDate>Mon, 30 Jan 2017 11:47:33 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/GSS-Initiate-failed-even-with-a-valid-kerberos-service/m-p/140798#M103405</guid>
      <dc:creator>sumit_nigam</dc:creator>
      <dc:date>2017-01-30T11:47:33Z</dc:date>
    </item>
    <item>
      <title>Re: GSS Initiate failed even with a valid kerberos service ticket.</title>
      <link>https://community.cloudera.com/t5/Support-Questions/GSS-Initiate-failed-even-with-a-valid-kerberos-service/m-p/140799#M103406</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/5605/sumitnigam.html" nodeid="5605"&gt;@Sumit Nigam&lt;/A&gt; your trimming of the logs removes the ability for us to actually figure out what's going on. You have redacted much of the important information.&lt;/P&gt;</description>
      <pubDate>Tue, 31 Jan 2017 00:04:10 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/GSS-Initiate-failed-even-with-a-valid-kerberos-service/m-p/140799#M103406</guid>
      <dc:creator>elserj</dc:creator>
      <dc:date>2017-01-31T00:04:10Z</dc:date>
    </item>
    <item>
      <title>Re: GSS Initiate failed even with a valid kerberos service ticket.</title>
      <link>https://community.cloudera.com/t5/Support-Questions/GSS-Initiate-failed-even-with-a-valid-kerberos-service/m-p/140800#M103407</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/223/jelser.html" nodeid="223"&gt;@Josh Elser&lt;/A&gt; - I've uploaded the kerberos out logs from Hbase master and region servers - &lt;A href="https://community.cloudera.com/legacyfs/online/attachments/11975-hbase-kerberos-logs.zip"&gt;hbase-kerberos-logs.zip&lt;/A&gt;&lt;/P&gt;&lt;P&gt;For security reasons, I have had to mask the host names, etc.&lt;/P&gt;</description>
      <pubDate>Thu, 02 Feb 2017 01:16:48 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/GSS-Initiate-failed-even-with-a-valid-kerberos-service/m-p/140800#M103407</guid>
      <dc:creator>sumit_nigam</dc:creator>
      <dc:date>2017-02-02T01:16:48Z</dc:date>
    </item>
    <item>
      <title>Re: GSS Initiate failed even with a valid kerberos service ticket.</title>
      <link>https://community.cloudera.com/t5/Support-Questions/GSS-Initiate-failed-even-with-a-valid-kerberos-service/m-p/140801#M103408</link>
      <description>&lt;P&gt;"GSS initiate failed" without any detailed error message is like a catch-all in the class com.sun.security.sasl.gsskerb.GssKrb5Server in the method evaluateResponse(byte[] responseData). There are many lines in that big try-catch that can throw this exception. &lt;/P&gt;&lt;P&gt;So, after I debugged the running hbase master process through a debugger I am able to clearly see that the stack trace leading to this GSSException is actually coming from method acceptSecContext(InputStream is, int mechTokenSize) in the class Krb5Context. The actual exception is KrbException being thrown as GSSException. This points to issues in keytab in master node vs details in service ticket that region server is presenting, assuming JCE and encryption types on both nodes are appropriate.&lt;/P&gt;</description>
      <pubDate>Thu, 02 Feb 2017 19:03:25 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/GSS-Initiate-failed-even-with-a-valid-kerberos-service/m-p/140801#M103408</guid>
      <dc:creator>sumit_nigam</dc:creator>
      <dc:date>2017-02-02T19:03:25Z</dc:date>
    </item>
    <item>
      <title>Re: GSS Initiate failed even with a valid kerberos service ticket.</title>
      <link>https://community.cloudera.com/t5/Support-Questions/GSS-Initiate-failed-even-with-a-valid-kerberos-service/m-p/140802#M103409</link>
      <description>&lt;P&gt;Hi &lt;A rel="user" href="https://community.cloudera.com/users/223/jelser.html" nodeid="223"&gt;@Josh Elser&lt;/A&gt; - I hope you are able to spot a problem in data that I uploaded. Is there any hint in that which corroborates with my assessment above or points to a different problem altogether? Any input is appreciated. &lt;/P&gt;&lt;P&gt;Thanks,
Sumit&lt;/P&gt;</description>
      <pubDate>Mon, 06 Feb 2017 14:50:15 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/GSS-Initiate-failed-even-with-a-valid-kerberos-service/m-p/140802#M103409</guid>
      <dc:creator>sumit_nigam</dc:creator>
      <dc:date>2017-02-06T14:50:15Z</dc:date>
    </item>
    <item>
      <title>Re: GSS Initiate failed even with a valid kerberos service ticket.</title>
      <link>https://community.cloudera.com/t5/Support-Questions/GSS-Initiate-failed-even-with-a-valid-kerberos-service/m-p/140803#M103410</link>
      <description>&lt;P&gt;Nothing jumped out at me from your logs. Sorry I can't be of help with the information you provided. You should consider exercising your HDP support contract if you need more help.&lt;/P&gt;</description>
      <pubDate>Tue, 07 Feb 2017 08:12:58 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/GSS-Initiate-failed-even-with-a-valid-kerberos-service/m-p/140803#M103410</guid>
      <dc:creator>elserj</dc:creator>
      <dc:date>2017-02-07T08:12:58Z</dc:date>
    </item>
    <item>
      <title>Re: GSS Initiate failed even with a valid kerberos service ticket.</title>
      <link>https://community.cloudera.com/t5/Support-Questions/GSS-Initiate-failed-even-with-a-valid-kerberos-service/m-p/140804#M103411</link>
      <description>&lt;A rel="user" href="https://community.cloudera.com/users/740/vrathor.html" nodeid="740"&gt;&lt;/A&gt;&lt;P&gt;&lt;A href="https://community.hortonworks.com/questions/79084/gss-initiate-failed-even-with-a-valid-kerberos-ser.html#"&gt;@Vipin Rathor&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/740/vrathor.html" nodeid="740"&gt;@krajgur&lt;/A&gt;u&lt;/P&gt;</description>
      <pubDate>Tue, 07 Feb 2017 14:17:25 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/GSS-Initiate-failed-even-with-a-valid-kerberos-service/m-p/140804#M103411</guid>
      <dc:creator>KuldeepK</dc:creator>
      <dc:date>2017-02-07T14:17:25Z</dc:date>
    </item>
    <item>
      <title>Re: GSS Initiate failed even with a valid kerberos service ticket.</title>
      <link>https://community.cloudera.com/t5/Support-Questions/GSS-Initiate-failed-even-with-a-valid-kerberos-service/m-p/140805#M103412</link>
      <description>&lt;P&gt;Sure Josh, thanks a lot for taking time to look at these logs. &lt;/P&gt;</description>
      <pubDate>Fri, 10 Feb 2017 18:07:53 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/GSS-Initiate-failed-even-with-a-valid-kerberos-service/m-p/140805#M103412</guid>
      <dc:creator>sumit_nigam</dc:creator>
      <dc:date>2017-02-10T18:07:53Z</dc:date>
    </item>
  </channel>
</rss>

