Member since
05-10-2017
6
Posts
0
Kudos Received
0
Solutions
07-21-2017
05:32 AM
thank you for your answer! I know that i can cover sql/impala through hue and r+python+spark with workbench, but i dont like the approach. Most of the time we use sql(impala/hive) for our quick data analysis and then we we go to python/spark to go deeper or dev/test our etl/elt parts. When everything is fine we put the tested lines our dev ide (intellij+sbt...testing) and and deploy it. So Zeppelin is a bit more en extention of hour ide dev process in one tool. I know that the dev approach "should" be different when using the workbench but thats how our process is. Zeppelin/jupyter solves it all in one for us. ("Blue elephant guys" should be funny for "Cloudera Users" compared to the green elephant used by Hortonworks)
... View more
- Tags:
- h
07-21-2017
02:19 AM
Hi there, working on a CDH 5.11 (ADS/LDAP,Kerberos,Sentry enabled) Cluster. Now we are evaluation a Notebooksolution. Workbench (sadly ) does not support the same sql+spark+impala+hive features so we need to take a look beside. Hue seems to be stop improving the notebook feature so this is out. Jupyter seems to have everything but using one instance per user is a bit ... i dont like this approavh. Zeppelin looks nice but im not sure how good it works together with CDH because the support seems to be poor. I managed to connect zeppelin<->ads but now im facing the Kerberos stuff. Is it possible that each zeppelin step uses and manages the kerberos tokens of the user logged in, or do i have to provide one "technical principal" which zeppelin uses. Besides that question Hortonworks fully supports Zeppelin. What is Clouderas answer? What notebook should we (blue elefant guys) use? Workbench is a good one but does not cover all what zeppelin does and seems to point a bit different clients/peaple. Thanks and BR
... View more
Labels:
- Labels:
-
Cloudera Hue
05-24-2017
12:25 AM
OK impala is not on top of yarn and for yarn applications such as spark or hive every user needs to be on the local os on every node. But why is impala working without the users with kerberos and sentry activated?
... View more
05-18-2017
11:39 AM
Tested bit further. Is there an Easy answer why impala is working with kerberos via hue (count(*)...) and hive is not. Am i right that it is yarn which requires the local os users?
... View more
05-10-2017
12:06 PM
Thanks for you answer. But that would mean that we need to sync 100 of users and groups to all of our cluster nodes to provide the fully linage and governance data lake with full fine security... Isnt there another solution ? How other enterprise customers solve this i can not imagine that big customers sync hundreds or thousand users to there local os machines?
... View more
05-10-2017
04:43 AM
Hi Folks, i searched for an answer for one hour and did not find any clear solution for it. Here my infra: - Cloudera CDH5.10 - Kerberos against (ADS) - Hue and CM against Ldap(ADS) Heres the question: With the configuration i have right now everything runs nice when i use one of the technical users which exists on both (LDAP and the local OS), but when i try to login as an ldap-only user at hue i can do only my hdfs stuff. The moment I try to exec impala or a hive/MR ill get the "yarn: user not found" exception. The moment i add the user to all nodes it works fine. But do i need to sync/create all my ldap users on all my cluster nodes or is it possible to that i can login as ldapuser and hue uses an internal technical account for impala and MR. And of course : How and where it needs to be configured. Thanks in advance
... View more
Labels:
- Labels:
-
Cloudera Hue
-
Kerberos