About bgooley

Anudas · ‎01-17-2019

Thank you so much !

dennistanpunya · ‎01-01-2019

Hi, tried this and it still looks for auto-tls setting. I note that this auto-tls feature cant be turn off as after saving new setting in CM security section, and restarting cm server. It will still revert to original setting which has auto-tls enabled. As such, ive decided to use CDH5 & CM5 instead. Thanks for assistance.

Mo · ‎12-19-2018

Hi Ben, But if we cannot find that file, what should we do? Why these files are missing? Thanks, Mo

ebeb · ‎12-19-2018

@bgooley Thanks a bunch! This is good info. I do see the below now which means /usr/lib/jvm is good for openJDK. Note: Cloudera strongly recommends installing Oracle JDK at /usr/java/<jdk-version> and OpenJDK at /usr/lib/jvm (or /usr/lib64/jvm on SLES 12), which allows Cloudera Manager to auto-detect and use the correct JDK version. Unfortunately in the CDH 5.16 install guide it doesnt clarify that for openJDK /usr/lib/jvm is good path but makes a blanket statement that The JDK must be installed at /usr/java/jdk-version. Hopefully they will update the doc in future. https://www.cloudera.com/documentation/enterprise/5-16-x/topics/cdh_ig_jdk_installation.html .

orak · ‎12-15-2018

So the problem was with Snapshots. I had configured snapshots a long time ago on the /user/hive/warehouse directory, and they were still being generated. I was finding the space using the commands hadoop fs -du -h /user/hive hadoop fs -du -h /user/hive/warehouse Snapshot directories can be found using command: hdfs lsSnapshottabledir hadoop fs -delteSnapshot <path without .snapshot> <snapshotname>

desind · ‎11-18-2018

Thank you @bgooley Thats what i did and it worked. thank you and appreciate your response.

Timothy · ‎11-16-2018

@bgooley @Tomas79 I ended up loging in a user when the EMR is launching in the bootstrap action.I did this via curl commands.This will avoid any user being given super user status. For anyone needing guidance on the workaround you could follow the below steps. 1) Curl command to get the cookie.txt(it has the session id and csrf token) file. 2) Curl command to login(You have to grep from the cookie.txt file the session id and csrftoken) If anyone has a better idea please let me know

vaibhav06 · ‎11-15-2018

Hi , I want to install cdh5.15.1 hadoop client on my Ubuntu 16.04 host. I have added the repo url in /etc/apt/sources.list.d/cloudera.list file. How can I specify a particular hadoop-client version in apt-get command. I thought apt-get install hadoop-client=< version > should work. But I am unable to figure out the version number. Here is the link for repo http://archive.cloudera.com/cdh5/ubuntu/xenial/amd64/cdh/pool/contrib/h/hadoop/ Thanks 🙂

bgooley · ‎11-14-2018

@orak, OpenLDAP is just fine for hadoop LDAP purposes. Active Directory is part of many existing IT infrastructures, so it is often used due to the way it does combine LDAP and Kerberos (along with other things). Users in your Kerberos KDC and LDAP server do not necessarily need to originate in the same object. Any true relationship between the two where the KDC principal exists in an end user object that is used for authentication would exist due to some sort of integration at the KDC / LDAP server level. This is not necessary for hadoop services to work. In general, there are 3 needs if you are going to secure your cluster with Kerberos: - Kerberos - means of mapping users to groups (usually OS shell-based, but can be LDAP based) - OS users as which services will run and end user OS users for YARN containers (running MR jobs) If I kinit as bgooley@EXAMPLE.COM and then attempt to perform a listing on a directory that is read for user/group and owned by someone else, then the NameNode must be able to determine if the user is a member of the group who has permission to list files. The principal would be trimmed to a "short name" by trimming off the realm to arrive at bgooley. The user bgooley's group membership would then be determined (shell group mapping or ldap group mapping) . See the following for details: https://hadoop.apache.org/docs/current/hadoop-project-dist/hadoop-common/GroupsMapping.html This mapping is used by several services so it is part of core hadoop. Then, you have the OS users that must exist at the OS level so that various processes can start as those users and files be owned. Also YARN containers will store information in the OS file system as the user running the job. This means that users who run jobs need to exist on all nodes in the cluster. Some of these topics are covered in a bit more detail here: https://www.cloudera.com/documentation/enterprise/latest/topics/sg_auth_overview.html That's a lot to process, so I'll stop there and wait to see if you have any questions.

Krish216 · ‎11-09-2018

@bgooley Thanks for the explanation and help. Intially I have tried with self signed and now I have received signed certificates. Fixed the issue after creating a trustore.

Online	Offline
Last Visited	‎04-24-2020 01:13 PM

Member Since	‎04-22-2014 02:47 PM
Last Visited	‎04-24-2020 01:13 PM
Posts	1,218
Kudos received	339

Cloudera Community

Re: ALL hadoop-mapreduce-examples.jar fail cdh6

Re: YARN NodeManagers failed to start with permiss...

Re: Disable admin Login in Cloudera Manager

Re: Kerberos not authenticating from Hadoop Gatewa...

Re: Sqoop connection to Kerberos authenticated RDB...

Re: CDH 6 Supported OS

Re: Auto TLS. Cloudera agent unable to send heartb...

Re: Cloudera manger , HBase Deploy Client Configur...

Re: openJDK install path

Re: Where did my space on Hadoop cluster go?

Re: Error in Headlamp Debug Server Status

Re: Urgent:HUE LDAP Super User Issue

Re: install hadoop client on unmanaged host

Re: Local MIT KDC with OpenLDAP Integration

Re: Enable TLS for Cloudera Manager Level 1