<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: BDR in secured cluster - Run as Username in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/BDR-in-secured-cluster-Run-as-Username/m-p/55116#M47206</link>
    <description>&lt;P&gt;The trick is to create a BDR specific user and add them to the hive or supergroup groups as relevant for hive or general hdfs backups. No facl or sticky bits are required.&lt;/P&gt;</description>
    <pubDate>Fri, 26 May 2017 13:28:45 GMT</pubDate>
    <dc:creator>ScottE</dc:creator>
    <dc:date>2017-05-26T13:28:45Z</dc:date>
    <item>
      <title>BDR in secured cluster - Run as Username</title>
      <link>https://community.cloudera.com/t5/Support-Questions/BDR-in-secured-cluster-Run-as-Username/m-p/54422#M47205</link>
      <description>&lt;P&gt;I am attempting to configure a BDR backup from a secured (kerberos &amp;amp; Sentry with HDFS permission synchronization enabled) CDH 5.9.0 cluster to S3. I can successfully use BDR to backup my own data (e.g. /users/myname) but now I want to backup some Hive/Impala data that is protected by Sentry. I am using HDFS rather than Hive replication (I don't believe this is material to the question).&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;If I configure BDR to run using my own userid, which happens to have full access according to Sentry permissions this results in an AccessControlException&lt;/P&gt;&lt;PRE&gt;org.apache.hadoop.security.AccessControlException: Permission denied: user=myuser, access=READ, inode="/data":hive:hive:drwxrwx--x&lt;/PRE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I would have thought that the fact that Sentry has been configured to synchronize HDFS permissions would have meant that I could run this.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;According to&amp;nbsp;&lt;A href="https://www.cloudera.com/documentation/enterprise/5-9-x/topics/cm_bdr_hive_replication.html" target="_blank"&gt;https://www.cloudera.com/documentation/enterprise/5-9-x/topics/cm_bdr_hive_replication.html&lt;/A&gt; when Kerberos is in use it is necessary to use a user with an ID greater than 1000, so this rules out the hdfs and hive users.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;It also states that read and execute permissions are needed on the source cluster for BDR to operate.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;So if my user cannot be used this would&amp;nbsp;means I need to create a BDR user account that has these permissions.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The directories I want to back up are protected with Sentry, so as per&amp;nbsp;&lt;A href="https://www.cloudera.com/documentation/enterprise/5-9-x/topics/sg_sentry_service_config.html#concept_z5b_42s_p4__section_lvc_4g4_rp" target="_blank"&gt;https://www.cloudera.com/documentation/enterprise/5-9-x/topics/sg_sentry_service_config.html#concept_z5b_42s_p4__section_lvc_4g4_rp&lt;/A&gt; these directories have permissions as follows&lt;/P&gt;&lt;PRE&gt;$ hdfs dfs -chown hive:hive /data
$ hdfs dfs -chmod 771 /data&lt;/PRE&gt;&lt;P&gt;Continuing down this path, to be able to use BDR I will need to use an extended ACL to assign&amp;nbsp;rx permissions on the relevant directories to the user.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;To cater for new directories that come along I am thinking that it would also&amp;nbsp;be necessary to&amp;nbsp;add the sticky bit on this operation. Does the following seem reasonable (running as hdfs user with relevant keytab)?&lt;/P&gt;&lt;PRE&gt;$ hdfs dfs -setfacl -R -m group:backup_users:r-xt /data&lt;/PRE&gt;&lt;P&gt;Information on using the sticky bit is thin on the ground; is this even supported and supported for&amp;nbsp;extended ACLs?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Is there something I am missing that makes BDR with a kerberos enabled cluster easier than this?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks, S.&lt;/P&gt;</description>
      <pubDate>Fri, 16 Sep 2022 11:33:49 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/BDR-in-secured-cluster-Run-as-Username/m-p/54422#M47205</guid>
      <dc:creator>ScottE</dc:creator>
      <dc:date>2022-09-16T11:33:49Z</dc:date>
    </item>
    <item>
      <title>Re: BDR in secured cluster - Run as Username</title>
      <link>https://community.cloudera.com/t5/Support-Questions/BDR-in-secured-cluster-Run-as-Username/m-p/55116#M47206</link>
      <description>&lt;P&gt;The trick is to create a BDR specific user and add them to the hive or supergroup groups as relevant for hive or general hdfs backups. No facl or sticky bits are required.&lt;/P&gt;</description>
      <pubDate>Fri, 26 May 2017 13:28:45 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/BDR-in-secured-cluster-Run-as-Username/m-p/55116#M47206</guid>
      <dc:creator>ScottE</dc:creator>
      <dc:date>2017-05-26T13:28:45Z</dc:date>
    </item>
  </channel>
</rss>

