Member since
12-15-2015
66
Posts
32
Kudos Received
1
Solution
My Accepted Solutions
Title | Views | Posted |
---|---|---|
1895 | 07-22-2016 08:15 PM |
04-01-2016
01:50 AM
@Josh Elser is this related to bug https://issues.apache.org/jira/browse/ACCUMULO-4069 ? This is pulled from another environment where we have same issue. Looks like master was unable to receive tablet status report from T server for 3 times,before that it fails to find any Kerberos ticket from Tserver: 2016-03-29 22:48:53,052 [tserver.TabletServer] [server.TThreadPoolServer] ERROR: Error occurred during processing of message.
java.lang.RuntimeException: org.apache.thrift.transport.TTransportException
at org.apache.thrift.transport.TSaslServerTransport$Factory.getTransport(TSaslServerTransport.java:219)
at org.apache.accumulo.core.rpc.UGIAssumingTransportFactory$1.run(UGIAssumingTransportFactory.java:51)
at org.apache.accumulo.core.rpc.UGIAssumingTransportFactory$1.run(UGIAssumingTransportFactory.java:48)
at java.security.AccessController.doPrivileged(Native Method)
at javax.security.auth.Subject.doAs(Subject.java:360)
at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1637)
...skipping...
2016-03-30 21:56:49,881 [tserver.TabletServer] INFO : Master requested tablet server halt
~ From Master server: unable to get tablet server status XXXYYYY XXX.com:9997[352d68b0c3801b6] org.apache.thrift.transport.TTransportE
xception: GSS initiate failed
2016-03-30 21:56:17,937 [master.Master] ERROR: master:XXXYYYY.XXX.com unable to get tablet server status From Monitor log: XXXYYYY1213.fg.XXX.com:9997[152d68b041401b8] org.apache.thrift.transport.TTransportE
xception: GSS initiate failed
2016-03-30 21:56:17,938 [master.Master] ERROR: master:XXXYYYY1 unable to get tablet server status 016-03-30 21:56:47,403 [transport.TSaslTransport] ERROR: SASL negotiation failure
javax.security.sasl.SaslException: GSS initiate failed [Caused by GSSException: No valid credentials provided (Mechanism level: Failed to find any Kerberos tgt)]
at com.sun.security.sasl.gsskerb.GssKrb5Client.evaluateChallenge(GssKrb5Client.java:211)
at org.apache.thrift.transport.TSaslClientTransport.handleSaslStartMessage(TSaslClientTransport.java:94)
at org.apache.thrift.transport.TSaslTransport.open(TSaslTransport.java:253)
at org.apache.thrift.transport.TSaslClientTransport.open(TSaslClientTransport.java:37)
at org.apache.accumulo.core.rpc.UGIAssumingTransport$1.run(UGIAssumingTransport.java:53)
at org.apache.accumulo.core.rpc.UGIAssumingTransport$1.run(UGIAssumingTransport.java:49)
at java.security.AccessController.doPrivileged(Native Method)
at javax.security.auth.Subject.doAs(Subject.java:422)
at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1657)
at org.apache.accumulo.core.rpc.UGIAssumingTransport.open(UGIAssumingTransport.java:49)
at org.apache.accumulo.core.rpc.ThriftUtil.createClientTransport(ThriftUtil.java:298)
at org.apache.accumulo.core.client.impl.ThriftTransportPool.createNewTransport(ThriftTransportPool.java:478)
at org.apache.accumulo.core.client.impl.ThriftTransportPool.getTransport(ThriftTransportPool.java:410)
at org.apache.accumulo.core.client.impl.ThriftTransportPool.getTransport(ThriftTransportPool.java:388)
at org.apache.accumulo.core.rpc.ThriftUtil.getClient(ThriftUtil.java:135)
at org.apache.accumulo.core.rpc.ThriftUtil.getClientNoTimeout(ThriftUtil.java:102)
at org.apache.accumulo.core.client.impl.MasterClient.getConnection(MasterClient.java:69)
at org.apache.accumulo.monitor.Monitor.fetchData(Monitor.java:252)
at org.apache.accumulo.monitor.Monitor$1.run(Monitor.java:486)
... View more
03-31-2016
06:57 PM
@Josh Elser 22 days back below errors got logged on all TServers and after 22 days Tservers all went down ERROR: Lost tablet server lock (reason = LOCK_DELETED), exiting at org.apache.accumulo.fate.util.LoggingRunnable.run(LoggingRunnable.java:35)
at java.lang.Thread.run(Thread.java:745)
Caused by: org.apache.thrift.transport.TTransportException: Peer indicated failure: GSS initiate failed
at org.apache.thrift.transport.TSaslTransport.receiveSaslMessage(TSaslTransport.java:190)
at org.apache.thrift.transport.TSaslServerTransport.handleSaslStartMessage(TSaslServerTransport.java:125)
at org.apache.thrift.transport.TSaslTransport.open(TSaslTransport.java:253)
at org.apache.thrift.transport.TSaslServerTransport.open(TSaslServerTransport.java:41)
at org.apache.thrift.transport.TSaslServerTransport$Factory.getTransport(TSaslServerTransport.java:216)
... 11 more
2016-03-09 20:35:36,971 [tserver.TabletServer] INFO : Master requested tablet server halt
... View more
03-31-2016
06:29 PM
@Josh Elser @Artem Ervits is there any timeline when this bug https://issues.apache.org/jira/browse/ACCUMULO-4059 will be fixed? we are also seeing same error (Tservers getting crashed often) [server.TThreadPoolServer] ERROR: Error occurred during processing of message.
java.lang.RuntimeException: org.apache.thrift.transport.TTransportException: Peer indicated failure: GSS initiate failed
at org.apache.thrift.transport.TSaslServerTransport$Factory.getTransport(TSaslServerTransport.java:219)
at org.apache.accumulo.core.rpc.UGIAssumingTransportFactory$1.run(UGIAssumingTransportFactory.java:51)
at org.apache.accumulo.core.rpc.UGIAssumingTransportFactory$1.run(UGIAssumingTransportFactory.java:48)
at java.security.AccessController.doPrivileged(Native Method)
at javax.security.auth.Subject.doAs(Subject.java:360)
at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1637)
at org.apache.accumulo.core.rpc.UGIAssumingTransportFactory.getTransport(UGIAssumingTransportFactory.java:48)
at org.apache.thrift.server.TThreadPoolServer$WorkerProcess.run(TThreadPoolServer.java:208)
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
at org.apache.accumulo.fate.util.LoggingRunnable.run(LoggingRunnable.java:35)
at java.lang.Thread.run(Thread.java:745)
Caused by: org.apache.thrift.transport.TTransportException: Peer indicated failure: GSS initiate failed
at org.apache.thrift.transport.TSaslTransport.receiveSaslMessage(TSaslTransport.java:190)
at org.apache.thrift.transport.TSaslServerTransport.handleSaslStartMessage(TSaslServerTransport.java:125)
at org.apache.thrift.transport.TSaslTransport.open(TSaslTransport.java:253)
at org.apache.thrift.transport.TSaslServerTransport.open(TSaslServerTransport.java:41)
at org.apache.thrift.transport.TSaslServerTransport$Factory.getTransport(TSaslServerTransport.java:216)
... 11 more
... View more
03-11-2016
07:19 PM
1 Kudo
Thanks @Andrew Watson @andrew watson @Vperiasamy Do we know which version of HDP saving ranger Audit to DB is going to be unsupported? https://community.hortonworks.com/questions/2202/ranger-audit-options-is-db-audit-still-supported-i.html
... View more
03-11-2016
03:25 PM
2 Kudos
whats the difference, prons and cons having ranger audit to DB vs ranger audit to HDFS?
... View more
Labels:
- Labels:
-
Apache Ranger
03-09-2016
01:59 AM
1 Kudo
Thanks Arterm
... View more
03-09-2016
01:41 AM
3 Kudos
can you please let me know is there any size limit that is send to trash after delete?? Please advise. we are trying to delete a file around 6GB but its not getting deleted
... View more
Labels:
- Labels:
-
Apache Hadoop
03-07-2016
07:21 PM
@Neeraj Sabharwal @Jonas Straub @vperiasamy Ranger
UI configuration input box for FSDefaultfs defaults to the “clustername”,
but documentation specifies to enter the “NameNode”. (Current assumption
is that “NameNode” is likely the correct parameter – working when configured according to Hortonworks HDP2.3.2 doc Chap 9:
“Special Requirements for High Availability Environments”). Question : if
there is a solution or fix that would allow Ranger HDFS Repository configuration
to use fs.defaultFS instead of the active NameNode.
... View more