Member since
08-05-2014
9
Posts
2
Kudos Received
0
Solutions
04-01-2015
09:33 AM
Hi Harsh, thanks for the reply. I didn't know about the "yarn.nodemanager.linux-container-executor.nonsecure-mode.limit-users" property. I will explore the two opportunities in order to choose the better one for the current experiments. Thanks, Michele
... View more
04-01-2015
03:13 AM
Hi Dave, thanks for the quick answer. You are right, the problem is that it is enabled the flag of "Always Use Linux Container Executor". I have unchecked it and now things seems to be working. However, the description on Cloudera Manager of the "Always Use Linux Container Executor" flag says "Cgroups enforcement only works when the Linux Container Executor is used", so if I want to use the desired "Static Resource Pool" where YARN gets only X% of the resources, I have to maintain the flag enabled (now I also understand when the flag gets checked, after making the first configuration of the resource pool...). So I have tried to install what needed for cgroups (libcgroup) and reenabled the flag. Now if I execute YARN application (like Hive query) everything works. If instead I try to execute a Oozie job with a shell action inside, the shell action is executed by "nobody" user (real Oozie user "m.giusto"). Normally shell action are executed as "yarn", so I have added "yarn" in "allowed.system.users" and removed it from "banned.users". "nobody" user remains the MR user. Any idea? Michele
... View more
03-31-2015
04:26 AM
1 Kudo
Hi, I am having some problems with YARN and it is not the first clusters where this happens, so I don't get what I am doing wrong. Every night I shut down the clusters (installed on AWS and SoftLayer) to not spend money while not working. Also, sooner or later I need bigger machines, so I change the AWS instance type (similar name also for SoftLayer). What happens in a not very clear moment is that after a particular restart YARN generates problems in the NodeManager user cache directory (e.g. /bigdata1/yarn/nm/usercache/m.giusto), like in this case (https://community.cloudera.com/t5/Data-Ingestion-Integration/Sqoop-Error-Sqoop-2-not-working-through-hue/td-p/24261/page/2) and I am forced to remove everything from all the user cache directories (acceptable) otherwise jobs are unable to start. However the bigger problem is that YARN also starts applying a not desired rule for which each user that submit a job is considered not allowed and YARN starts the job as "nobody" (yarn.nodemanager.linux-container-executor.nonsecure-mode.local-user default value). This happens for a not super-user like "m.giusto" (UID over 1000) and also for "hdfs" (UID less than 500). I have tried to move "hdfs" from "banned.users" to "allowed.system.users" and to set "min.user.id" to 0 , no changes. Moreover "nobody" user is not able to write to the real-user user cache folder (permission denied) and so the job fails. main : user is nobody main : requested yarn user is m.giusto Can't create directory /bigdata1/yarn/nm/usercache/m.giusto/appcache/application_1427799738120_0001 - Permission denied Can't create directory /bigdata2/yarn/nm/usercache/m.giusto/appcache/application_1427799738120_0001 - Permission denied Did not create any app directories .Failing this attempt.. Failing the application. What I do not get is why the system starts applying these rules and how to fix. At the moment the only solution is to reinstall the cluster.. Some other infos: OS is Centos6.6, tested CDH version are 5.2.1, 5.3.1 and 5.3.2. Thanks, Michele
... View more
Labels:
08-18-2014
06:16 AM
Hi Robert, thanks for the answer. You are right, that one is my exception. I had it even copying the entire "sqoop" dir in "sqoop2" dir without replacing any jar. However, doing tests I had changed permissions and restarted the Oozie server and now it is working. Thanks, Michele
... View more
08-12-2014
04:22 PM
Hi, due to the bug reported here: https://issues.apache.org/jira/browse/SQOOP-1420 I have created a custom version of Sqoop forcing the timezone to be GMT when converting epoch to String. Now, I need to use the custom jar in Oozie, when launching the Sqoop action. Looking at this: http://blog.cloudera.com/blog/2012/12/how-to-use-the-sharelib-in-apache-oozie/ I choose to override the shared lib for the Sqoop action adding: <configuration>
<property>
<name>oozie.action.sharelib.for.sqoop</name>
<value>sqoopPatched</value>
</property>
</configuration> in the Sqoop action (and creating the "sqoopPatched" folder in the shared lib path "/user/oozie/share/lib/lib_20140625170654/sqoopPatched" with the modified "sqoop-1.4.4-cdh5.0.2.jar"). However, the workflow fails because it doesn't find the main method of Sqoop. Anyone knows how to solve this? May be that with the new version of the shared lib (http://blog.cloudera.com/blog/2014/05/how-to-use-the-sharelib-in-apache-oozie-cdh-5/) is it not possible anymore to override libs? Thanks, Michele PS: I do not want to change the standard sqoop in the shared lib because I am not sure that the modified jar is right for any kind of timestamp column/database. But I still need it in 10 jobs, so I prefer to create a custom shared lib instead of adding a lib folder in each workflow folder.
... View more
Labels:
- Labels:
-
Apache Oozie