<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Confusion about Hive &amp;amp; Impala behaviour for access to HDFS file with only HDFS user permissi in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Confusion-about-Hive-amp-Impala-behaviour-for-access-to-HDFS/m-p/46178#M43181</link>
    <description>&lt;P&gt;&lt;SPAN&gt;&amp;nbsp;When you run impala-shell &amp;nbsp;it would not run as "impala", it would run as the current user. Impala does not support HDFS-level user impersonation .if you need&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN&gt;grandular level authorization / user permission you might want to use Sentry .&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;please refer this link.&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;&lt;A href="https://www.cloudera.com/documentation/enterprise/5-2-x/topics/cm_sg_sentry_service.html" target="_blank"&gt;https://www.cloudera.com/documentation/enterprise/5-2-x/topics/cm_sg_sentry_service.html&lt;/A&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Wed, 12 Oct 2016 14:24:16 GMT</pubDate>
    <dc:creator>csguna</dc:creator>
    <dc:date>2016-10-12T14:24:16Z</dc:date>
    <item>
      <title>Confusion about Hive &amp; Impala behaviour for access to HDFS file with only HDFS user permissions</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Confusion-about-Hive-amp-Impala-behaviour-for-access-to-HDFS/m-p/46137#M43180</link>
      <description>&lt;P&gt;I need some clarity on my chosen solution.&lt;BR /&gt;&lt;BR /&gt;I have a CDH 5.3.9 cluster. After assigning roles, I wanted to add some custom UDF's to Hive and Impala. The .jar of the UDF is placed in &lt;FONT face="courier new,courier"&gt;/user/hdfs&lt;/FONT&gt; in HDFS. &lt;FONT face="courier new,courier"&gt;/user/hdfs&lt;/FONT&gt; has &lt;FONT face="courier new,courier"&gt;700&lt;/FONT&gt; for &lt;FONT face="courier new,courier"&gt;hdfs:supergroup&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;For Hive (NOT Hiveserver2), // Version 0.13&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;Login to Hive CLI as hdfs user&lt;/LI&gt;&lt;LI&gt;Execute create function statement&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;&lt;STRONG&gt;It works. It can access the .jar and create the function. I can test the UDF etc.&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;For Impala, // Version 2.1.7&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;Login to Impala CLI as hdfs user&lt;/LI&gt;&lt;LI&gt;Execute create function statement&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;&lt;STRONG&gt;It doesn't work since Impala doesnt have permissions to access &lt;FONT face="courier new,courier"&gt;/user/hdfs&lt;/FONT&gt;&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;If I add &lt;FONT face="courier new,courier"&gt;impala&lt;/FONT&gt; user to supergroup in Linux, it works since impala is added to HDFS superuser group&lt;/P&gt;&lt;P&gt;OR&lt;/P&gt;&lt;P&gt;If i give execute permissions to &lt;FONT face="courier new,courier"&gt;Other&lt;/FONT&gt; users on &lt;FONT face="courier new,courier"&gt;/user/hdfs&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;FONT face="terminal,monaco"&gt;&lt;FONT face="lucida sans unicode,lucida sans"&gt;If I&lt;/FONT&gt; do a ps aux to see how the CLI is handled for Hive as well as Impala cases, I can see it being run as hdfs user (since I logged in as hdfs) so I assumed it should have access to /user/hdfs for impala as well. But looks like that is not sufficient for impala but works for Hive somehow.&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;FONT face="terminal,monaco"&gt;Is it because for hive I am using a plain client? and that has access to&amp;nbsp;/user/hdfs since user for login is hdfs?&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT face="terminal,monaco"&gt;Impala has to run via impalad which runs as impala user and that doesnt have access to the /user/hdfs&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Can someone please clarify what is going on in here?&lt;/P&gt;</description>
      <pubDate>Fri, 16 Sep 2022 10:43:53 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Confusion-about-Hive-amp-Impala-behaviour-for-access-to-HDFS/m-p/46137#M43180</guid>
      <dc:creator>SuvP</dc:creator>
      <dc:date>2022-09-16T10:43:53Z</dc:date>
    </item>
    <item>
      <title>Re: Confusion about Hive &amp; Impala behaviour for access to HDFS file with only HDFS user permissi</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Confusion-about-Hive-amp-Impala-behaviour-for-access-to-HDFS/m-p/46178#M43181</link>
      <description>&lt;P&gt;&lt;SPAN&gt;&amp;nbsp;When you run impala-shell &amp;nbsp;it would not run as "impala", it would run as the current user. Impala does not support HDFS-level user impersonation .if you need&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN&gt;grandular level authorization / user permission you might want to use Sentry .&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;please refer this link.&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;&lt;A href="https://www.cloudera.com/documentation/enterprise/5-2-x/topics/cm_sg_sentry_service.html" target="_blank"&gt;https://www.cloudera.com/documentation/enterprise/5-2-x/topics/cm_sg_sentry_service.html&lt;/A&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 12 Oct 2016 14:24:16 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Confusion-about-Hive-amp-Impala-behaviour-for-access-to-HDFS/m-p/46178#M43181</guid>
      <dc:creator>csguna</dc:creator>
      <dc:date>2016-10-12T14:24:16Z</dc:date>
    </item>
    <item>
      <title>Re: Confusion about Hive &amp; Impala behaviour for access to HDFS file with only HDFS user permissi</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Confusion-about-Hive-amp-Impala-behaviour-for-access-to-HDFS/m-p/46187#M43182</link>
      <description>Actually that is exactly the missing piece I am trying to figure out. I am aware that impala shell will run as whatever user I login as. hdfs user as in my case. However that is not sufficient for the impala shell to access a jar present in HDFS with 700 hdfs permissions. Where hive client shell which similarly runs the shell as hdfs user in my case is able to access. So I am assuming the impalad daemon running as impala user the cause of this ? Authorization is not what I am looking for.</description>
      <pubDate>Wed, 12 Oct 2016 17:45:54 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Confusion-about-Hive-amp-Impala-behaviour-for-access-to-HDFS/m-p/46187#M43182</guid>
      <dc:creator>SuvP</dc:creator>
      <dc:date>2016-10-12T17:45:54Z</dc:date>
    </item>
    <item>
      <title>Re: Confusion about Hive &amp; Impala behaviour for access to HDFS file with only HDFS user permissi</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Confusion-about-Hive-amp-Impala-behaviour-for-access-to-HDFS/m-p/46219#M43183</link>
      <description>&lt;P&gt;&lt;SPAN&gt;impalad daemon is the one that is not able to access the jar for query processing since you have set the hdfs permission as 700. &amp;nbsp;Your assumption is right and thats what I was refering in my previous post by stating&amp;nbsp; Impala does not support HDFS-level user impersonation.&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 13 Oct 2016 04:07:02 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Confusion-about-Hive-amp-Impala-behaviour-for-access-to-HDFS/m-p/46219#M43183</guid>
      <dc:creator>csguna</dc:creator>
      <dc:date>2016-10-13T04:07:02Z</dc:date>
    </item>
  </channel>
</rss>

