<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Adding a new user to the cluster in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Adding-a-new-user-to-the-cluster/m-p/130328#M18471</link>
    <description>&lt;P&gt;please see my comment above. &lt;/P&gt;&lt;P&gt;In secure mode you need local user accounts on all Nodemanager nodes&lt;/P&gt;</description>
    <pubDate>Thu, 18 Feb 2016 15:00:21 GMT</pubDate>
    <dc:creator>jstraub</dc:creator>
    <dc:date>2016-02-18T15:00:21Z</dc:date>
    <item>
      <title>Adding a new user to the cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Adding-a-new-user-to-the-cluster/m-p/130318#M18461</link>
      <description>&lt;P&gt;Say users are allowed to access a cluster from the edge node of a cluster. If the user wants to run jobs on the cluster, does the user should have his account on all the nodes of the cluster or just having an account on the edge node is enough?&lt;/P&gt;</description>
      <pubDate>Mon, 08 Feb 2016 18:34:48 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Adding-a-new-user-to-the-cluster/m-p/130318#M18461</guid>
      <dc:creator>arunpoy</dc:creator>
      <dc:date>2016-02-08T18:34:48Z</dc:date>
    </item>
    <item>
      <title>Re: Adding a new user to the cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Adding-a-new-user-to-the-cluster/m-p/130319#M18462</link>
      <description>&lt;P&gt;Hi &lt;A rel="user" href="https://community.cloudera.com/users/2302/arunpoy.html" nodeid="2302"&gt;@ARUNKUMAR RAMASAMY&lt;/A&gt;&lt;/P&gt;&lt;P&gt;No. User should not have account on all the nodes of the cluster. He should only have account on edge node.&lt;/P&gt;&lt;P&gt;For a new user there are 2 types are directories we need to create before the user access the cluster.
1- User home directory [directory created on Linux Filesystem ie. /home/&amp;lt;username&amp;gt;]
2- User HDFS directory [directory created on HDFS filesystem ie. /user/&amp;lt;username&amp;gt;]&lt;/P&gt;&lt;P&gt;As per neeraj, you only need to create HDFS home directory[ie. /user/&amp;lt;username&amp;gt;] on edge node.
You can still run jobs with the new user on cluster, even if you havent created his home directory in linux.&lt;/P&gt;&lt;P&gt;==============&lt;/P&gt;&lt;P&gt;Below are 2 scenarios -&lt;/P&gt;&lt;P&gt;a. I added new user on edge node using command - 
#useradd &amp;lt;username&amp;gt;
Before launching job on cluster, i need to create hdfs directory for user
#sudo -u hdfs hadoop fs -mkdir &amp;lt;/user/{username}&amp;gt;
#sudo -u hdfs hadoop fs chown -R &amp;lt;username&amp;gt;:&amp;lt;grp_name&amp;gt; &amp;lt;/user/{username}&amp;gt;&lt;/P&gt;&lt;P&gt;b. If the user is coming from ldap server, then you only need to make your edge node as ldap client and create a directory in HDFS using below command -&lt;/P&gt;&lt;P&gt;#sudo -u hdfs hadoop fs -mkdir &amp;lt;/user/{username}&amp;gt;
#sudo -u hdfs hadoop fs chown -R &amp;lt;username&amp;gt;:&amp;lt;grp_name&amp;gt; &amp;lt;/user/{username}&amp;gt;&lt;/P&gt;&lt;P&gt;Let me know if this clears, what you are looking for.&lt;/P&gt;</description>
      <pubDate>Tue, 09 Feb 2016 10:30:25 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Adding-a-new-user-to-the-cluster/m-p/130319#M18462</guid>
      <dc:creator>sshimpi</dc:creator>
      <dc:date>2016-02-09T10:30:25Z</dc:date>
    </item>
    <item>
      <title>Re: Adding a new user to the cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Adding-a-new-user-to-the-cluster/m-p/130320#M18463</link>
      <description>&lt;P&gt;Thanks &lt;A rel="user" href="https://community.cloudera.com/users/2648/sshimpi.html" nodeid="2648"&gt;@Sagar Shimpi&lt;/A&gt;, i got a clear picture. one more question, what permission do you give for the root directory / on hdfs&lt;/P&gt;</description>
      <pubDate>Tue, 09 Feb 2016 14:35:25 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Adding-a-new-user-to-the-cluster/m-p/130320#M18463</guid>
      <dc:creator>arunpoy</dc:creator>
      <dc:date>2016-02-09T14:35:25Z</dc:date>
    </item>
    <item>
      <title>Re: Adding a new user to the cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Adding-a-new-user-to-the-cluster/m-p/130321#M18464</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/2302/arunpoy.html" nodeid="2302"&gt;@ARUNKUMAR RAMASAMY&lt;/A&gt; &lt;/P&gt;&lt;P&gt;The root directory "/" permissions are 755[ie. rwxr-xr-x], by default these permissions are as per linux standards [ie umask].Umask for hdfs user is "022"
And the owner and group are set to hdfs:hdfs.&lt;/P&gt;</description>
      <pubDate>Tue, 09 Feb 2016 15:01:45 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Adding-a-new-user-to-the-cluster/m-p/130321#M18464</guid>
      <dc:creator>sshimpi</dc:creator>
      <dc:date>2016-02-09T15:01:45Z</dc:date>
    </item>
    <item>
      <title>Re: Adding a new user to the cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Adding-a-new-user-to-the-cluster/m-p/130322#M18465</link>
      <description>&lt;P&gt;&lt;STRONG&gt;This is right in beginner setup, when your
hadoop cluster integrated with Kerberos security then authenticated user must exist in
the every node where the task runs.&lt;/STRONG&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 16 Feb 2016 16:59:48 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Adding-a-new-user-to-the-cluster/m-p/130322#M18465</guid>
      <dc:creator>Vikas1</dc:creator>
      <dc:date>2016-02-16T16:59:48Z</dc:date>
    </item>
    <item>
      <title>Re: Adding a new user to the cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Adding-a-new-user-to-the-cluster/m-p/130323#M18466</link>
      <description>&lt;P&gt;Hi &lt;A rel="user" href="https://community.cloudera.com/users/2284/vcgadade.html" nodeid="2284"&gt;@Vikas  Gadade&lt;/A&gt; - I think this is not the case. Even if you have kerberized cluster you still have only user added on Gateway/Client node. Make sure you have proper keyabs in place.&lt;/P&gt;&lt;P&gt;Hadoop Service always use Delegation token to nodes and access/execute jobs within kerberized cluster where it executes task.&lt;/P&gt;</description>
      <pubDate>Wed, 17 Feb 2016 12:26:40 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Adding-a-new-user-to-the-cluster/m-p/130323#M18466</guid>
      <dc:creator>sshimpi</dc:creator>
      <dc:date>2016-02-17T12:26:40Z</dc:date>
    </item>
    <item>
      <title>Re: Adding a new user to the cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Adding-a-new-user-to-the-cluster/m-p/130324#M18467</link>
      <description>&lt;P&gt;Hi &lt;A rel="user" href="https://community.cloudera.com/users/2648/sshimpi.html" nodeid="2648"&gt;@Sagar Shimpi&lt;/A&gt;, &lt;A rel="user" href="https://community.cloudera.com/users/2284/vcgadade.html" nodeid="2284"&gt;@Vikas  Gadade&lt;/A&gt; may be correct. This is the info i got from one of the hadoop admins.&lt;/P&gt;&lt;P&gt;Can someone clarify or validate&lt;/P&gt;&lt;H4&gt;
&lt;/H4&gt;&lt;P&gt;In non-security mode (without kerberos), an account on edgenode is sufficient because user's container will run on slave nodes under the yarn account. In secure mode (with kerberos), you should have accounts on all machines, because user's container should run on slave nodes under the real username.&lt;/P&gt;</description>
      <pubDate>Wed, 17 Feb 2016 12:51:26 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Adding-a-new-user-to-the-cluster/m-p/130324#M18467</guid>
      <dc:creator>arunpoy</dc:creator>
      <dc:date>2016-02-17T12:51:26Z</dc:date>
    </item>
    <item>
      <title>Re: Adding a new user to the cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Adding-a-new-user-to-the-cluster/m-p/130325#M18468</link>
      <description>&lt;P&gt;&lt;STRONG&gt;hi  &lt;/STRONG&gt;&lt;A href="https://community.hortonworks.com/users/2302/arunpoy.html"&gt;@ARUNKUMAR RAMASAMY&lt;/A&gt;, &lt;STRONG&gt; &lt;/STRONG&gt;&lt;A href="https://community.hortonworks.com/users/2648/sshimpi.html"&gt;@Sagar Shimpi&lt;/A&gt;  &lt;STRONG&gt;I&lt;/STRONG&gt; &lt;STRONG&gt;verified with removing user "vgadade"
from  datanode, please find below output
on secure hadoop cluster&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Here is sample job output..&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;16/02/17 01:47:24 INFO mapreduce.Job: Job
job_1455179809801_0007 running in uber mode : false&lt;/P&gt;&lt;P&gt;16/02/17 01:47:24 INFO mapreduce.Job:  map 0% reduce 0%&lt;/P&gt;&lt;P&gt;16/02/17 01:47:26 INFO mapreduce.Job: Task Id :
attempt_1455179809801_0007_m_000001_0, Status : FAILED&lt;/P&gt;&lt;P&gt;Application application_1455179809801_0007 initialization
failed (exitCode=255) with output: main : command provided 0&lt;/P&gt;&lt;P&gt;main : run as user is vgadade&lt;/P&gt;&lt;P&gt;main : requested yarn user is vgadade&lt;/P&gt;&lt;P&gt;User vgadade not found&lt;/P&gt;&lt;P&gt;16/02/17 01:47:29 INFO mapreduce.Job: Task Id :
attempt_1455179809801_0007_m_000001_1, Status : FAILED&lt;/P&gt;&lt;P&gt;Application application_1455179809801_0007 initialization
failed (exitCode=255) with output: main : command provided 0&lt;/P&gt;&lt;P&gt;main : run as user is vgadade&lt;/P&gt;&lt;P&gt;main : requested yarn user is vgadade&lt;/P&gt;&lt;P&gt;User vgadade not found&lt;/P&gt;&lt;P&gt;16/02/17 01:47:30 INFO mapreduce.Job:  map 50% reduce 0%&lt;/P&gt;&lt;P&gt;16/02/17 01:47:32 INFO mapreduce.Job: Task Id :
attempt_1455179809801_0007_m_000001_2, Status : FAILED&lt;/P&gt;&lt;P&gt;Application application_1455179809801_0007 initialization
failed (exitCode=255) with output: main : command provided 0&lt;/P&gt;&lt;P&gt;main : run as user is vgadade&lt;/P&gt;&lt;P&gt;main : requested yarn user is vgadade&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;User vgadade not found&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;16/02/17 01:47:37 INFO mapreduce.Job:  map 100% reduce 0%&lt;/P&gt;&lt;P&gt;16/02/17 01:47:43 INFO mapreduce.Job:  map 100% reduce 100%&lt;/P&gt;&lt;P&gt;16/02/17 01:47:44 INFO mapreduce.Job: Job
job_1455179809801_0007 completed successfully&lt;/P&gt;&lt;P&gt;  Job Counters&lt;/P&gt;&lt;P&gt;  Failed
map tasks=3&lt;/P&gt;&lt;P&gt; 
Launched map tasks=5&lt;/P&gt;&lt;P&gt; 
Launched reduce tasks=1&lt;/P&gt;&lt;P&gt;  Other
local map tasks=3&lt;/P&gt;&lt;P&gt;   Data-local map tasks=1&lt;/P&gt;&lt;P&gt; 
Rack-local map tasks=1&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Job Finished in 28.409 seconds&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Nodemanager log out file ouput&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;2016-02-17 01:47:24,944 WARN
org.apache.hadoop.yarn.server.nodemanager.LinuxContainerExecutor: Exit code
from container container_e21_1455179809801_0007_01_000002 startLocalizer is :
255&lt;/P&gt;&lt;P&gt;java.io.IOException: Application
application_1455179809801_0007 initialization failed (exitCode=255) with
output: main : command provided 0&lt;/P&gt;&lt;P&gt;main : run as user is vgadade&lt;/P&gt;&lt;P&gt;main : requested yarn user is vgadade&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;User vgadade not found&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;  at
org.apache.hadoop.yarn.server.nodemanager.LinuxContainerExecutor.startLocalizer(LinuxContainerExecutor.java:262)&lt;/P&gt;&lt;P&gt;2016-02-17 01:47:24,946 INFO
org.apache.hadoop.yarn.server.nodemanager.containermanager.container.Container:
Container container_e21_1455179809801_0007_01_000002 transitioned from
LOCALIZING to LOCALIZATION_FAILED&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;vgadade user added again on datanode and executed job (
container is successfully launched on previous Data node) &lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Here is sample job output..&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;16/02/17 01:55:46 INFO mapreduce.Job: Running job: job_1455179809801_0008&lt;/P&gt;&lt;P&gt;16/02/17 01:55:52 INFO mapreduce.Job: Job
job_1455179809801_0008 running in uber mode : false&lt;/P&gt;&lt;P&gt;16/02/17 01:55:52 INFO mapreduce.Job:  map 0% reduce 0%&lt;/P&gt;&lt;P&gt;16/02/17 01:55:58 INFO mapreduce.Job:  map 100% reduce 0%&lt;/P&gt;&lt;P&gt;16/02/17 01:56:04 INFO mapreduce.Job:  map 100% reduce 100%&lt;/P&gt;&lt;P&gt;16/02/17 01:56:04 INFO mapreduce.Job: Job
job_1455179809801_0008 completed successfully&lt;/P&gt;&lt;P&gt;16/02/17 01:56:05 INFO mapreduce.Job: Counters: 50&lt;/P&gt;&lt;P&gt;  Job Counters&lt;/P&gt;&lt;P&gt; 
Launched map tasks=2&lt;/P&gt;&lt;P&gt; 
Launched reduce tasks=1&lt;/P&gt;&lt;P&gt; 
Data-local map tasks=1&lt;/P&gt;&lt;P&gt; 
Rack-local map tasks=1&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Job Finished in 20.333 seconds&lt;/STRONG&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 17 Feb 2016 15:30:24 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Adding-a-new-user-to-the-cluster/m-p/130325#M18468</guid>
      <dc:creator>Vikas1</dc:creator>
      <dc:date>2016-02-17T15:30:24Z</dc:date>
    </item>
    <item>
      <title>Re: Adding a new user to the cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Adding-a-new-user-to-the-cluster/m-p/130326#M18469</link>
      <description>&lt;P&gt;Thanks &lt;A rel="user" href="https://community.cloudera.com/users/2284/vcgadade.html" nodeid="2284"&gt;@Vikas  Gadade&lt;/A&gt;. It helped.&lt;/P&gt;</description>
      <pubDate>Thu, 18 Feb 2016 13:14:23 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Adding-a-new-user-to-the-cluster/m-p/130326#M18469</guid>
      <dc:creator>arunpoy</dc:creator>
      <dc:date>2016-02-18T13:14:23Z</dc:date>
    </item>
    <item>
      <title>Re: Adding a new user to the cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Adding-a-new-user-to-the-cluster/m-p/130327#M18470</link>
      <description>&lt;P style="margin-left: 40px;"&gt;&lt;A rel="user" href="https://community.cloudera.com/users/2648/sshimpi.html" nodeid="2648"&gt;@Sagar Shimpi&lt;/A&gt; &lt;A rel="user" href="https://community.cloudera.com/users/2302/arunpoy.html" nodeid="2302"&gt;@ARUNKUMAR RAMASAMY&lt;/A&gt; I agree with &lt;A rel="user" href="https://community.cloudera.com/users/2284/vcgadade.html" nodeid="2284"&gt;@Vikas  Gadade&lt;/A&gt;, if you want to execute jobs with your user account, you have to make sure the user is available on every Nodemanager node! &lt;/P&gt;&lt;P style="margin-left: 40px;"&gt;Please see this =&amp;gt; &lt;EM&gt;"&lt;/EM&gt;YARN containers in a secure cluster use the operating system facilities to offer execution isolation for containers. Secure containers execute under the credentials of the job user. The operating system enforces access restriction for the container. The container must run as the user that submitted the application."&lt;EM&gt;  &lt;/EM&gt;more info =&amp;gt; &lt;A href="https://hadoop.apache.org/docs/stable/hadoop-yarn/hadoop-yarn-site/SecureContainer.html" target="_blank"&gt;https://hadoop.apache.org/docs/stable/hadoop-yarn/hadoop-yarn-site/SecureContainer.html&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 18 Feb 2016 14:53:36 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Adding-a-new-user-to-the-cluster/m-p/130327#M18470</guid>
      <dc:creator>jstraub</dc:creator>
      <dc:date>2016-02-18T14:53:36Z</dc:date>
    </item>
    <item>
      <title>Re: Adding a new user to the cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Adding-a-new-user-to-the-cluster/m-p/130328#M18471</link>
      <description>&lt;P&gt;please see my comment above. &lt;/P&gt;&lt;P&gt;In secure mode you need local user accounts on all Nodemanager nodes&lt;/P&gt;</description>
      <pubDate>Thu, 18 Feb 2016 15:00:21 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Adding-a-new-user-to-the-cluster/m-p/130328#M18471</guid>
      <dc:creator>jstraub</dc:creator>
      <dc:date>2016-02-18T15:00:21Z</dc:date>
    </item>
    <item>
      <title>Re: Adding a new user to the cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Adding-a-new-user-to-the-cluster/m-p/130329#M18472</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/113/jstraub.html" nodeid="113"&gt;@Jonas Straub&lt;/A&gt; &lt;A rel="user" href="https://community.cloudera.com/users/2284/vcgadade.html" nodeid="2284"&gt;@Vikas  Gadade&lt;/A&gt;&lt;/P&gt;&lt;P&gt;Thank you for clarify. Is there any workaround for that? Or is it fixed in HDP2.6? I also use SSSD + Kerberos only on management nodes. On Nodemanager hosts AD users does not exists thus YARN is not working.&lt;/P&gt;</description>
      <pubDate>Wed, 28 Jun 2017 13:53:47 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Adding-a-new-user-to-the-cluster/m-p/130329#M18472</guid>
      <dc:creator>frank93</dc:creator>
      <dc:date>2017-06-28T13:53:47Z</dc:date>
    </item>
  </channel>
</rss>

