About Hrishi1

Hrishi1 · ‎06-21-2018

Basically issue is the Hadoop is running an id command against the user and not including -- in front of the user if the username starts with a -. 1. As per POSIX standard, a username should not start with a hyphen character, and "-". It is causing this problem. 2. The authorization is done over Kerberos which is working fine. 3. Seems like this userid needs its Unix groups to perform some action on HDFS which is throwing this error (eg writing to a directory using group permissions). This is because the group lookup is not working due to above mentioned issue.

Hrishi1 · ‎05-21-2018

Any Luck?

Hrishi1 · ‎05-17-2018

If I am using -- option from the command line, I am getting the group details. I am not sure why this error is being logged. Any thoughts? # groups -- -SERV-CHEF groups: -SERV-CHEF: no such user # groups -- -SERV-CHEF@domain success!

Hrishi1 · ‎05-17-2018

We are getting continuous PartialGroupNameException for a specific user account. Below is the sample error snippet. ++++++ WARN org.apache.hadoop.security.ShellBasedUnixGroupsMapping: unable to return groups for user -SERV-CHEF PartialGroupNameException The user name '-SERV-CHEF' is not found. id: invalid option -- 'S' Try 'id --help' for more information. Try 'id --help' for more information. id: invalid option -- 'S' Try 'id --help' for more information. at org.apache.hadoop.security.ShellBasedUnixGroupsMapping.resolvePartialGroupNames(ShellBasedUnixGroupsMapping.java:212) at org.apache.hadoop.security.ShellBasedUnixGroupsMapping.getUnixGroups(ShellBasedUnixGroupsMapping.java:133) at org.apache.hadoop.security.ShellBasedUnixGroupsMapping.getGroups(ShellBasedUnixGroupsMapping.java:72) at org.apache.hadoop.security.Groups$GroupCacheLoader.fetchGroupList(Groups.java:368) at org.apache.hadoop.security.Groups$GroupCacheLoader.load(Groups.java:309) at org.apache.hadoop.security.Groups$GroupCacheLoader.load(Groups.java:267) at com.google.common.cache.LocalCache$LoadingValueReference.loadFuture(LocalCache.java:3568) at com.google.common.cache.LocalCache$Segment.loadSync(LocalCache.java:2350) at com.google.common.cache.LocalCache$Segment.lockedGetOrLoad(LocalCache.java:2313) at com.google.common.cache.LocalCache$Segment.get(LocalCache.java:2228) at com.google.common.cache.LocalCache.get(LocalCache.java:3965) at com.google.common.cache.LocalCache.getOrLoad(LocalCache.java:3969) at com.google.common.cache.LocalCache$LocalManualCache.get(LocalCache.java:4829) at org.apache.hadoop.security.Groups.getGroups(Groups.java:225) at org.apache.hadoop.security.UserGroupInformation.getGroups(UserGroupInformation.java:1778) at org.apache.hadoop.security.UserGroupInformation.getGroupNames(UserGroupInformation.java:1766) at org.apache.hadoop.hdfs.server.namenode.FSPermissionChecker.(FSPermissionChecker.java:66) at org.apache.hadoop.hdfs.server.namenode.FSDirectory.getPermissionChecker(FSDirectory.java:3468) at org.apache.hadoop.hdfs.server.namenode.FSNamesystem.getPermissionChecker(FSNamesystem.java:4079) at org.apache.hadoop.hdfs.server.namenode.FSNamesystem.getFileInfo(FSNamesystem.java:4269) at org.apache.hadoop.hdfs.server.namenode.NameNodeRpcServer.getFileInfo(NameNodeRpcServer.java:901) at org.apache.hadoop.hdfs.server.namenode.AuthorizationProviderProxyClientProtocol.getFileInfo(AuthorizationProviderProxyClientProtocol.java:528) at org.apache.hadoop.hdfs.protocolPB.ClientNamenodeProtocolServerSideTranslatorPB.getFileInfo(ClientNamenodeProtocolServerSideTranslatorPB.java:839) at org.apache.hadoop.hdfs.protocol.proto.ClientNamenodeProtocolProtos$ClientNamenodeProtocol$2.callBlockingMethod(ClientNamenodeProtocolProtos.java) at org.apache.hadoop.ipc.ProtobufRpcEngine$Server$ProtoBufRpcInvoker.call(ProtobufRpcEngine.java:617) at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:1073) at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2216) at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2212) at java.security.AccessController.doPrivileged(Native Method) at javax.security.auth.Subject.doAs(Subject.java:415) at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1920) at org.apache.hadoop.ipc.Server$Handler.run(Server.java:2210) ++++++ Source : /logs/hadoop-hdfs/hadoop-cmf-CD-HDFS-pFXTJXEX-NAMENODE-.log.out Command output from name node: # id -Gn -SERV-CHEF id: invalid option -- 'S' Try 'id --help' for more information. You have mail in /var/spool/mail/root # # hdfs groups -SERV-CHEF -SERV-CHEF : # Could you help me here with some insights, please?

Hrishi1 · ‎05-15-2018

I am using Splunk in my environment. Is it being logged anywhere while a query is spilling to disk? If yes, I can create an alert from Splunk to notify me the query details while spilling to disk happens.

Hrishi1 · ‎05-15-2018

Hi Tim, Thank you for your inputs. I have checked the matrics links given. Is unit "queries per second" is a list of individual queries or the count of the queries? ex: Metric Name Description Unit Parents CDH Version queries_spilled_memory_rate Impala queries that spilled to disk queries per second cluster CDH 5 I need to track specific queries that are spilling to disk. Means if I am enabling the spill-to-disk option, I need to get an alert if a specific query is spilling memory to disk with the query details so that I can notify the owner of that query. I can set the scratch limit to a specific value to control spill space usage. However, need to track each and every query that spills to disk.

Hrishi1 · ‎05-14-2018

Is spill-to-disk being logged in any logs? if yes, I can set up an alert from the Splunk.

Hrishi1 · ‎05-14-2018

Thanks Tim for your reply. The first option will display the details of the memory spilled per query from the impala query section if that attribute is selected to be displayed. The second option under the utilization section will give us the details of average spill and maximum spill per resource pool. My requirement is that, if I am able enabling spill to disk feature in my cluster, I want to be notified if any spill to disk is happening. Do we have any option in cloudera manager to create this alert?

Hrishi1 · ‎05-11-2018

Hello Team, As per Impala release notes for Impala 2.5, +++++ Spill-to-disk feature now always recommended. In earlier releases, the spill-to-disk feature could be turned off using a pair of configuration settings, enable_partitioned_aggregation=false and enable_partitioned_hash_join=false. The latest improvements in the spill-to-disk mechanism, and related features that interact with it, make this feature robust enough that disabling it is now no longer needed or supported. In particular, some new features in Impala 2.5 and higher do not work when the spill-to-disk feature is disabled. +++++ If spill-to-disk is enabled, is there an option to monitor the spill-to-disk instances so that I can monitor the query that is causing it.

Online	Offline
Last Visited	‎09-16-2018 01:02 PM

Member Since	‎05-11-2018 07:23 AM
Last Visited	‎09-16-2018 01:02 PM
Posts	10

Cloudera Community

Re: PartialGroupNameException errors

Re: Monitoring Disk-to-spill from Cloudera Manager

Re: PartialGroupNameException errors

PartialGroupNameException errors

Re: Monitoring Disk-to-spill from Cloudera Manager

Re: Monitoring Disk-to-spill from Cloudera Manager

Re: Monitoring Disk-to-spill from Cloudera Manager

Re: Monitoring Disk-to-spill from Cloudera Manager

Monitoring Disk-to-spill from Cloudera Manager