Posts: 13
Registered: ‎01-11-2018

sentry and yarn queues



One of the prerequisites for enabling Sentry is disabling impersonation, which basically means that all queries are executed by user hive, not the user that actually executed the command in Hue. However, this requirement invalidates yarn queue placement, which so far used user's group name to select proper queue, and now assigns all jobs to hive's group instead. It's possible to avoid this behaviour by setting 'Specified' placement policy on top - this way we are able to ensure that job will be assigned to the queue based on the user's group, not the default group for hive. However, this solution opens possibility for some intentional misuse - it's possible that the user sets the parameter in Hue's session and circumvents yarn queue placement, potentially using more cluster resources than administrator intended to give them.


We've tried to use parameter

to prevent users from  setting but apparently it requires enabling  Standard Hive Authorization


whereas Cloudera doesn't support using native Hive authorization frameworks. Anyway, we failed to configure Standard Hive Authorization and Sentry at the same time in our CDH 5.9.3. Morever, the clarifies that we are not even able to use

to block using 'set' command at all. Taking this all together, we'd like to ask:

1. if using 'Specified' placement policy in Yarn queue configuration is the only way to activate mapping based on group of user logged in Hue, not the hive user?

2. do you know other ways to block setting specific variables in Hive session?

3. is it possible to configure Standard Hive Authorization in CDH at all?


I'll be thankful for any clue. Thanks!

Posts: 13
Registered: ‎01-11-2018

Re: sentry and yarn queues

I've found the solution - it's possible to use another parameter to prevent user from using mapred.job.queuename: hive.conf.restricted.list