About rvillanueva

Kartik_Agarwal · ‎10-03-2021

I was facing the similar error and got it resolved by added Hadoop users to passwd file. resource_management.core.exceptions.ExecutionFailed: Execution of 'usermod -G hadoop -g hadoop hive' returned 6. usermod: user 'hive' does not exist in /etc/passwd Error: Error: Unable to run the custom hook script ['/usr/bin/python', '/var/lib/ambari-agent/cache/stacks/HDP/2.0.6/hooks/before-ANY/scripts/hook.py', 'ANY', '/var/lib/ambari-agent/data/command-59009.json', '/var/lib/ambari-agent/cache/stacks/HDP/2.0.6/hooks/before-ANY', '/var/lib/ambari-agent/data/structured-out-59009.json', 'INFO', '/var/lib/ambari-agent/tmp', 'PROTOCOL_TLSv1_2', ''] >> File location /etc/passwd >> Adduser hadoop

KPG1 · ‎06-24-2021

Thanks, ccibi75 This worked for me.

Madhur · ‎05-14-2020

Hello @rvillanueva , You can check how many threads are used by a user by running ps -L -u <username> | wc -l if the user’s open files limit ( ulimit -n <user name >) is hit then the user can’t spawn any further more threads. Most possible reasons in this case could be, Same user running other jobs and having open files on the node where it tries to launch/spawn the container. systems thread might have excluded. see which application is running and what is their current open files Kindly check application log (application_XXX),if available and see which phase it throw's the exception and on which node the issue is faced.

ask_bill_brooks · ‎02-04-2020

@asmarz, As this is an older thread which was previously marked 'Solved', you would have a better chance of receiving a resolution by starting a new thread. This will also provide the opportunity to provide details specific to your environment (for example, what happened once you added the affected user accounts with "normal" command add user on all nodes) that could aid others in providing a more relevant, accurate answer to your question.

senthh · ‎12-25-2019

Hi, ``You can configure proxy user using properties hadoop.proxyuser.$superuser.hosts along with either or both of hadoop.proxyuser.$superuser.groups and hadoop.proxyuser.$superuser.users.`` Refer: [1] https://hadoop.apache.org/docs/current/hadoop-project-dist/hadoop-common/Superusers.html

kshimpi · ‎12-16-2019

@rvillanueva Please refer article https://community.cloudera.com/t5/Customer/Unable-to-start-Pyspark-jobs-when-running-with-Python-3/ta-p/272990

rvillanueva · ‎12-11-2019

From the Ranger email list, this is another bit of information that I found helpful: ---------- I’ve configured ranger using the following approach to control who must be synced with AD. Only users belonging to groups inside a specific OU will be synced. I’ve created the OU OU=ArthurAmericasGroups,OU=Security Groups,OU=Groups,OU=SHARED,OU=Brazil,DC=domain,DC=com Create a group called R2Users inside that OU. I put all desired sync users as its members. Also, you can put other groups as its member. And, you can create other groups like R2TEAM as well. Remember to update this property ranger.usersync.ldap.user.searchfilter to include more than one. I’ve configured ranger to sync groups before users. Here is the configuration. in COMMON CONFIGS Label Property Value LDAP/AD URL ranger.usersync.ldap.url ldap://myacticedirectoryserver.domain.com:389 Bind User ranger.usersync.ldap.binddn CN=LDAP_AD_ACCOUNT,OU=Service Accounts,OU=LCB,OU=Brazil,DC=domain,DC=com Bind User Password ranger.usersync.ldap.ldapbindpassword LDAP_AD_ACCOUNT user’s password Inclemetal Sync ranger.usersync.ldap.deltasync Yes Enable LDAP STARTTLS ranger.usersync.ldap.starttls No GROUP CONFIGS Label Property Value Enable Group Sync ranger.usersync.group.searchenable Yes Group Member Attribute ranger.usersync.group.memberattributename member Group Name Attribute ranger.usersync.group.nameattribute Cn Group Object Class ranger.usersync.group.objectclass Group Group Search Base ranger.usersync.group.searchbase OU=ArthurAmericasGroups,OU=Security Groups,OU=Groups,OU=SHARED,OU=Brazil,DC=domain,DC=com Group Search Filter ranger.usersync.group.searchfilter Enable Group Search First ranger.usersync.group.search.first.enabled Yes Sync Nested Groups is_nested_groupsync_enabled Yes Group Hierarchy Levels ranger.usersync.ldap.grouphierarchylevels 5 USER CONFIGS Label Property Value Username Attribute ranger.usersync.ldap.user.nameatributte sAMAccountName User Object Class ranger.usersync.ldap.objectclass User User Search Base ranger.usersync.ldap.searchbase DC=domain,DC=com User Search Filter ranger.usersync.ldap.user.searchfilter (memberOf=CN=R2Users,OU=ArthurAmericasGroups,OU=Security Groups,OU=Groups,OU=SHARED,OU=Brazil,DC=domain,DC=com) User Search Scope ranger.usersync.ldap.user.searchscope Sub User Group Name Attribute ranger.usersync.ldap.groupnameattribute sAMAccountName Group User Map Sync ranger.usersync.group.usermapsyncenabled Yes Enable User Search ranger.usersync.user.searchenabled Yes ADVANCED Ranger Settings Label Property Value Authentication method ACTIVE_DIRECTORY AD Settings Label Property Value AD Bind Password ranger.ldap.ad.bind.password LDAP_AD_ACCOUNT user’s password Domain Name (Only for AD) anger.ldap.ad.domain DC=domain,DC=com AD Base DN ranger.ldap.ad.base.dn DC=domain,DC=com AD Referreal ranger.ldap.ad.referreal Follow AD User Serach Filter ranger.ldap.ad.user.search (sAMAccountName={0}) Advanced ranger-ugsync-site Label Property Value ranger.usersync.ldap.referral ranger.usersync.ldap.referral Follow

stevenmatison · ‎11-04-2019

Very good question here. Let me share some of my thoughts as I have installed ambari both from source and from Hortonworks Repos. Before I get started you should know that Hortonworks was a major contributor to Ambari Project, as such their documentation is very detailed for how to install Ambari and its components. In my opinion this is the preferred documentation. Hortonwork repos are THE public repos for ambari. Using them is much easier than building from source. The Ambari Project page at ambari.apache.org is just the project page. The documentation is specifically for ambari, and not necessarily for "hadoop" and does not include all the screen shots and deeper info you will find in the HortonWorks/Cloudera documentation for the same. Although the Project Page does not go into much detail, it does have the required artifacts, and enough information to setup nodes and get into the Cluster Install Wizard. For those organizations which are required to use private repos or to build their own, the Ambari Project page is very important.

dinhhoanglong · ‎09-24-2019

OK. tested and successful

Online	Offline
Last Visited	‎10-31-2020 09:19 PM

Member Since	‎07-11-2019 08:54 PM
Last Visited	‎10-31-2020 09:19 PM
Posts	102
Kudos received	4

Cloudera Community

Re: How to run spark-submit in virtualenv for pysp...

Re: LDAP/AD users not appearing in Ranger

Re: Ambari unable to run custom hook for modifying...

Re: Where are the spark2 binaries?

Re: What are HDFS NFS "access times"?

Re: Ambari unable to run custom hook for modifying...

Re: Timeline Service V2.0 Reader not starting

Re: Previously working spark jobs only now throwin...

Re: Create users in hadoop/HDP 2.5

Re: hadoop.proxyuser.root.groups config setting. W...

Re: How to run spark-submit in virtualenv for pysp...

Re: LDAP/AD users not appearing in Ranger

Re: Difference between installing from Apache vs H...

Re: HDP Ambari installation throws "org.postgresql...