<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Impala Failing to Recognize Partitioning in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Impala-Failing-to-Recognize-Partitioning/m-p/91771#M12295</link>
    <description>&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;your answer does not explain the reason of this behaviour.&lt;/P&gt;&lt;P&gt;In my case I used the partitioning field with the same case sensitive &lt;EM&gt;FiledID&lt;/EM&gt;.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;But the table were created properly only when I used an lower case name.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The main undesiderate effect in my opinion is that the user is not notified when he creates a table and partitions are not recovered properly.&lt;/P&gt;&lt;P&gt;Also in the impalad logs there is nothing about it (ERROR and WARN).&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thank you&lt;/P&gt;</description>
    <pubDate>Wed, 19 Jun 2019 13:52:49 GMT</pubDate>
    <dc:creator>vaccarinicarlo</dc:creator>
    <dc:date>2019-06-19T13:52:49Z</dc:date>
    <item>
      <title>Impala Failing to Recognize Partitioning</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Impala-Failing-to-Recognize-Partitioning/m-p/60703#M12290</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I currently have data sitting in an HDFS location at, say, /location. The data is paritioned by YEAR/MONTH/DAY, and the subfolder structure looks like YEAR=2017/MONTH=8/DAY=2. I am attempting to create an external table on this data, but upon doing so the partitioning is not being recognized. The two commands I've tried are:&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT face="courier new,courier"&gt;drop table if exists db.table;&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier"&gt;create external table db.table like parquet '/location/file.parquet' partitioned by (YEAR int, MONTH int, DAY int) stored as parquet location '/location';&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier"&gt;alter table db.table recover partitions;&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier"&gt;compute incremental stats db.table;&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;And...&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;FONT face="courier new,courier"&gt;drop table if exists&amp;nbsp;db.table&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT face="courier new,courier"&gt;create external table&amp;nbsp;db.table(&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT face="courier new,courier"&gt;field1 string,&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT face="courier new,courier"&gt;field2 string,&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT face="courier new,courier"&gt;...&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT face="courier new,courier"&gt;) partitioned by (YEAR int, MONTH int, DAY int) stored as parquet location '/location/';&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT face="courier new,courier"&gt;alter table db.table recover partitions;&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT face="courier new,courier"&gt;compute incremental stats db.table&lt;/FONT&gt;;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;In both cases, I end up with an empty table that is correctly partitioned. Calling &lt;FONT face="courier new,courier"&gt;invalidate metadata;&lt;/FONT&gt; after the fact did not resolve the issue. I've verifified that the impala user is on the facl lists for these areas. Does anyone know why it would not be finding the data?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I should point out that if I ignore partitioning and instead just try and build a table on top of data from one day (IE. YEAR=2017/MONTH=8/DAY=2), the data shows.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 16 Sep 2022 12:22:00 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Impala-Failing-to-Recognize-Partitioning/m-p/60703#M12290</guid>
      <dc:creator>mtrepanier</dc:creator>
      <dc:date>2022-09-16T12:22:00Z</dc:date>
    </item>
    <item>
      <title>Re: Impala Failing to Recognize Partitioning</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Impala-Failing-to-Recognize-Partitioning/m-p/60706#M12291</link>
      <description>&lt;P&gt;Not sure if this is the problem, but you might try using lower case names in the HDFS path, i.e.:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;year=2017/month=8/day=2&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;instead of&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;YEAR=2017/MONTH=8/DAY=2&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 06 Oct 2017 17:04:33 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Impala-Failing-to-Recognize-Partitioning/m-p/60706#M12291</guid>
      <dc:creator>alex.behm</dc:creator>
      <dc:date>2017-10-06T17:04:33Z</dc:date>
    </item>
    <item>
      <title>Re: Impala Failing to Recognize Partitioning</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Impala-Failing-to-Recognize-Partitioning/m-p/60709#M12292</link>
      <description>&lt;P&gt;Setting them to lower case didn't work immediately - what did work was going back and setting each hdfs file name to lowercase and refreshing the partitioning.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Lesson learned, always set column partitioning names to lowercase when you need to build an external table on them.&lt;/P&gt;</description>
      <pubDate>Fri, 06 Oct 2017 18:07:27 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Impala-Failing-to-Recognize-Partitioning/m-p/60709#M12292</guid>
      <dc:creator>mtrepanier</dc:creator>
      <dc:date>2017-10-06T18:07:27Z</dc:date>
    </item>
    <item>
      <title>Re: Impala Failing to Recognize Partitioning</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Impala-Failing-to-Recognize-Partitioning/m-p/60718#M12293</link>
      <description>&lt;P&gt;Thanks for following up with the solution.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Sorry for the pain, I understand it's somewhat user unfriendly. The explanation for the current behavior goes like this:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Column names are generally case insensitive from the Impala SQL perspective, but HDFS file paths are case sensitive. So it could cause confusion if you had paths like this in HDFS:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;YEAR=2000/MONTH=1&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;year=2000/month=1&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Year=2000/Month=1&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Are they different partitions? All the same partition? Can one partition point to multiple directories... You see where I am going :). It's just easier to accept one canonical casing.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 06 Oct 2017 23:19:15 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Impala-Failing-to-Recognize-Partitioning/m-p/60718#M12293</guid>
      <dc:creator>alex.behm</dc:creator>
      <dc:date>2017-10-06T23:19:15Z</dc:date>
    </item>
    <item>
      <title>Re: Impala Failing to Recognize Partitioning</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Impala-Failing-to-Recognize-Partitioning/m-p/91770#M12294</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I had the same behaviour on Impala CREATE EXTERNAL TABLE but with a camel case partitioning (i.e. '&lt;EM&gt;FieldID=123&lt;/EM&gt;').&amp;nbsp;&lt;BR /&gt;With &lt;STRONG&gt;all lower case partitioning name&lt;/STRONG&gt; I solved the issue immediately ('&lt;EM&gt;fieldid=123&lt;/EM&gt;').&lt;/P&gt;</description>
      <pubDate>Wed, 19 Jun 2019 13:45:04 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Impala-Failing-to-Recognize-Partitioning/m-p/91770#M12294</guid>
      <dc:creator>vaccarinicarlo</dc:creator>
      <dc:date>2019-06-19T13:45:04Z</dc:date>
    </item>
    <item>
      <title>Re: Impala Failing to Recognize Partitioning</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Impala-Failing-to-Recognize-Partitioning/m-p/91771#M12295</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;your answer does not explain the reason of this behaviour.&lt;/P&gt;&lt;P&gt;In my case I used the partitioning field with the same case sensitive &lt;EM&gt;FiledID&lt;/EM&gt;.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;But the table were created properly only when I used an lower case name.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The main undesiderate effect in my opinion is that the user is not notified when he creates a table and partitions are not recovered properly.&lt;/P&gt;&lt;P&gt;Also in the impalad logs there is nothing about it (ERROR and WARN).&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thank you&lt;/P&gt;</description>
      <pubDate>Wed, 19 Jun 2019 13:52:49 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Impala-Failing-to-Recognize-Partitioning/m-p/91771#M12295</guid>
      <dc:creator>vaccarinicarlo</dc:creator>
      <dc:date>2019-06-19T13:52:49Z</dc:date>
    </item>
    <item>
      <title>Re: Impala Failing to Recognize Partitioning</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Impala-Failing-to-Recognize-Partitioning/m-p/91793#M12296</link>
      <description>&lt;P&gt;Hi&amp;nbsp;vaccarinicarlo,&lt;/P&gt;&lt;P&gt;In the hadoop world where different components may have different rules about&amp;nbsp; cases sensitivity, it may be best to do as Alex Behm said above: "It's just easier to accept one canonical casing".&lt;/P&gt;&lt;P&gt;I agree with you that it might be better to issue more warnings when anythign other than lower case is used.&lt;/P&gt;</description>
      <pubDate>Wed, 19 Jun 2019 23:04:43 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Impala-Failing-to-Recognize-Partitioning/m-p/91793#M12296</guid>
      <dc:creator>Andrew_Sherman</dc:creator>
      <dc:date>2019-06-19T23:04:43Z</dc:date>
    </item>
  </channel>
</rss>

