What are the ideal / Minimum required ACL's that need to be applied on a HDFS directory containing Hive External Tables?
1. I have a directory '/user/devteam/custdata' with permissions 770.
hdfs dfs -getfacl /user/devteam/custdata
# file: /user/devteam/custdata
# owner: devteam
# group: devteam
user::rwx
group::rwx
other::---
2. I set ACL of...
hdfs dfs -setfacl -R -m group:hive:rwx,group:qateam:r-x /user/devteam/custdata
3. Sentry Roles
create role qateamrole;
grant select on database devdb to role qateamrole;
create role devteamrole;
grant all on database devdb to role devteamrole;
grant all on uri '/user/devteam/custdata' to role devteamteamrole;
By setting these two permissions with HDFS sentry sync enabled. Will I be able to run all my sqoop jobs and hive queries as the owner and qateam successfully? At the same time I want data to be visible only to the owner and teams that have permissions to query / cat / list them.
Thanks