Created on 06-23-2016 01:12 PM - edited 08-18-2019 05:34 AM
Freshly installed HDP 2.4 using Ambari 2.2.2.0 over RHEL7 machines.
I have tried to depict the usage scenario in a hand-drawn diagram, please bear with it 🙂
Description :
Questions/Confusions :
Created 06-23-2016 02:34 PM
1. Ranger takes care of authorization. You will need something for authentication which is where kerberos and AD can come up.
2. You can set a /user/<username> in hdfs which is a user home directory. You might still need common hdfs directories where collaboration happens.
3. If you have AD, it will have kerberos. If you have write access to an OU in AD, you can create all service level principals there. So, no separate kerberos/KDC will be required. But if you don't want to create service level principals on AD, you can have local kerberos/KDC and have a one way trust with AD.
4. If you enable group based authorizations, adding users could be as easy adding user to the right group and creating a home directory for the user.
5. Ranger can take care of most authorizations and you can avoid working with ACLs.
Created 06-23-2016 02:34 PM
1. Ranger takes care of authorization. You will need something for authentication which is where kerberos and AD can come up.
2. You can set a /user/<username> in hdfs which is a user home directory. You might still need common hdfs directories where collaboration happens.
3. If you have AD, it will have kerberos. If you have write access to an OU in AD, you can create all service level principals there. So, no separate kerberos/KDC will be required. But if you don't want to create service level principals on AD, you can have local kerberos/KDC and have a one way trust with AD.
4. If you enable group based authorizations, adding users could be as easy adding user to the right group and creating a home directory for the user.
5. Ranger can take care of most authorizations and you can avoid working with ACLs.
Created 06-23-2016 03:32 PM
Can you elaborate and help me understand :
Does this mean that every time a new user is to be added, someone has to log-in as 'hdfs' on cli and create a hdfs dir. /user/<username> and then change the ownership of that dir. ?
An OU can be created but what is 'service level principal' - is it creating groups(or users?) like hadoop, hdfs, hive, yarn,sqoop etc. in that OU manually ? The biggest concern I have here is that during cluster installation, under Misc, the 'Skip group modifications during install' was left unchecked so the users and groups were created locally, now is it reqd. to change it(how to do that in Ambari) and if yes, will the cluster function properly? Can you provide a documentation link ?
Unsure if I understood, I believe, the addition of users to a group has to be done at both Linux and HDFS levels, this will still involve creating /user/<username> dir. on HDFS manually. Can you provide some detailed inputs here ?
Created 06-23-2016 04:30 PM
1. If you need home directories for each of the users, then you need to create home directories. Ownership can be changed from CLI or you can set using Ranger (though I think changing from CLI is better than creating a new profile in Ranger for these things)
2. I am talking about principals here, not service users (like hdfs, hive, yarn) coming from AD (using SSSD or some other such too). So, with you setup local users are create on each node. But they still need to authenticate with your KDC. Ambari can create it for you on the OU once you give the credentials to ambari.
3. Its not mandatory to have /user/<username> for each user. We have cases where BI users how use ODBC/JDBC and don't even have login access to the nodes not needing /user/<username>. Even users that login don't need /user/<username> and could use something like /data/<group>/... to read/write to hdfs.