Member since
01-30-2020
5
Posts
0
Kudos Received
0
Solutions
11-04-2020
03:03 AM
Hello Everyone, Do HDFS ACLs apply on impala queries? If, for example, I want to give read permissions on specific directories in HDFS to a specific user and I modify the appropriate ACLs using "hdfs dfs setfacl" on the mentioned directories, will the user have only SELECT permissions when querying the files? I want to avoid using Kerberos auth and Sentry service.
... View more
Labels:
09-23-2020
01:31 AM
Thank you for your reply Tim. Just to clarify, security-wise, are we better off with our current configuration (default), with sentry service disabled, or with sentry enabled in testing mode? You mentioned that sentry in testing mode does not authenticate the clients, but in the documentation it is mentioned that testing mode uses weaker authentication mechanisms. We need this in order to prevent our analysts from doing accidental writes, drops, etc. on the data. Our cluster is in a secure isolated environment.
... View more
09-22-2020
07:47 AM
Hello Everyone,
Is there a way to permit SELECT only impala queries in HUE without enabling and configuring Sentry service? (maybe in OS level)
The problem with enabling Sentry is that I have to first enable Kerberos and before that renaming some of my nodes.
There is the option to enable Sentry testing mode but Cloudera does not recommend that in production environments.
... View more
02-04-2020
11:38 PM
Correction: On each old datanode we have 10 hdfs partitions one for every disk. Should we create 12 hdfs partitions on each new datanode or create 10 volumes using the 12 disks?
... View more
02-04-2020
12:41 AM
Hello,
At the organization I work for we have a hadoop cluster. We recently purchased new datanodes to expand the cluster. The new datanodes have 12 disks (8TB) instead of 10 (6TB), as the older datanodes.
After some research (mainly this community) I came to understand that its not a requirement to have the same number of drives in each node or equally sized disks.
On each old datanode we have 8 hdfs partitions one for every disk.
Should we create 10 hdfs partitions on each new datanode or create 8 volumes using the 10 disks?
Should we utilize all the available storage?
And if we do is there something specific we should monitor?
... View more