<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Impala: Parquet error &amp;quot;Invalid file footer&amp;quot; on pipe-delimited file in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Impala-Parquet-error-quot-Invalid-file-footer-quot-on-pipe/m-p/34625#M15725</link>
    <description>&lt;P&gt;Hi&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;We are also facing the same issue of invalid file footer, the table is created as follows :&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;2 tables created&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;CREATE EXTERNAL TABLE ABC_TEXT (&lt;/P&gt;&lt;P&gt;NAME STRING,&lt;/P&gt;&lt;P&gt;ID INT,&lt;/P&gt;&lt;P&gt;PHONE INT)&lt;/P&gt;&lt;P&gt;PARTITION BY (Customer_id INT)&lt;/P&gt;&lt;P&gt;ROW FORMAT DELIMITED&lt;/P&gt;&lt;P&gt;FIELDS TERMINATED BY ';'&lt;/P&gt;&lt;P&gt;STORED AS TEXTFILE&lt;/P&gt;&lt;P&gt;LOCATION '/USER/ABC_TEXT ;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;CREATE EXTERNAL TABLE ABC_PARQUET (&lt;/P&gt;&lt;P&gt;NAME STRING,&lt;/P&gt;&lt;P&gt;ID INT,&lt;/P&gt;&lt;P&gt;PHONE INT )&lt;/P&gt;&lt;P&gt;PARTITION BY (Customer_id INT)&lt;/P&gt;&lt;P&gt;STORED AS PARQUET&lt;/P&gt;&lt;P&gt;&lt;SPAN class="s1"&gt;LOCATION '/USER/ABC_PARQUET' ;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Then run the insert script, which inserts data perfectly but when queried on parquet table getting following error&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN class="s1"&gt;Error:&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN class="s1"&gt;Caused by: java.sql.SQLException: [Simba][ImpalaJDBCDriver](500312) Error in fetching data rows:&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN class="s1"&gt;Invalid file footer&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Please let me know what I am doing wrong.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN class="s1"&gt;&amp;nbsp;&lt;SPAN class="Apple-converted-space"&gt;&amp;nbsp; &amp;nbsp;&amp;nbsp;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Tue, 01 Dec 2015 23:38:13 GMT</pubDate>
    <dc:creator>Jais</dc:creator>
    <dc:date>2015-12-01T23:38:13Z</dc:date>
    <item>
      <title>Impala: Parquet error "Invalid file footer" on pipe-delimited file</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Impala-Parquet-error-quot-Invalid-file-footer-quot-on-pipe/m-p/16586#M15721</link>
      <description>&lt;P&gt;I have pipe-delimited text files in HDFS (lines delimited by new line), and a parquet table:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;CREATE EXTERNAL TABLE IF NOT EXISTS table_parquet(&lt;BR /&gt;TRADE_DATE TIMESTAMP,&lt;/P&gt;&lt;P&gt;[removed for brevity]&lt;BR /&gt;FILLER STRING&lt;BR /&gt;)&lt;BR /&gt;PARTITIONED BY(REGION STRING)&lt;BR /&gt;ROW FORMAT DELIMITED FIELDS TERMINATED BY '|' LINES TERMINATED BY '\n'&lt;BR /&gt;STORED AS PARQUET&lt;/P&gt;&lt;P&gt;LOCATION 'hdfs://path/to/location/';&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;ALTER TABLE table_parquet ADD PARTITION(REGION ='euro')&lt;BR /&gt;LOCATION 'hdfs://path/to/location/euro';&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;However, when I try to query the table, I get&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Backend 0:File hdfs://path/to/file.txt is invalid.&amp;nbsp; Invalid file footer: |&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;I have tried inserting spaces between the last pipe and the newline, and I've tried removing the last pipe, but no luck.&lt;/P&gt;&lt;P&gt;Any ideas what I'm doing wrong?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;Edit: This is Impala 1.4.0&lt;/P&gt;</description>
      <pubDate>Fri, 16 Sep 2022 09:04:28 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Impala-Parquet-error-quot-Invalid-file-footer-quot-on-pipe/m-p/16586#M15721</guid>
      <dc:creator>MB</dc:creator>
      <dc:date>2022-09-16T09:04:28Z</dc:date>
    </item>
    <item>
      <title>Re: Impala: Parquet error "Invalid file footer" on pipe-delimited file</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Impala-Parquet-error-quot-Invalid-file-footer-quot-on-pipe/m-p/17030#M15722</link>
      <description>&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;i read also the same issue with comma separaed csv format. a bit different is, I am running old 1.2.4 still. please could expert shed some light on this. thanks&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;thanks&amp;nbsp;&lt;/P&gt;&lt;P&gt;Jason&lt;/P&gt;</description>
      <pubDate>Fri, 15 Aug 2014 18:25:27 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Impala-Parquet-error-quot-Invalid-file-footer-quot-on-pipe/m-p/17030#M15722</guid>
      <dc:creator>jasonshih</dc:creator>
      <dc:date>2014-08-15T18:25:27Z</dc:date>
    </item>
    <item>
      <title>Re: Impala: Parquet error "Invalid file footer" on pipe-delimited file</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Impala-Parquet-error-quot-Invalid-file-footer-quot-on-pipe/m-p/18714#M15723</link>
      <description>&lt;P&gt;The ROW FORMAT clause only applies to tables using the text format. &amp;nbsp;STORED AS PARQUET means the table expects the data to already be in Parquet format. You will need to make 2 tables. &amp;nbsp;One table with no STORED AS clause but with ROW FORMAT DELIMITED etc.&amp;nbsp; You will be able to query this table after you move the delimited data files into the table directory and REFRESH the table). &amp;nbsp;Then another (empty) table&amp;nbsp;with the same columns and a STORED AS PARQUET clause. &amp;nbsp;Then to convert the data to Parquet, you do:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;insert into parquet_table select * from text_table;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;John&lt;/P&gt;</description>
      <pubDate>Sat, 13 Sep 2014 20:26:14 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Impala-Parquet-error-quot-Invalid-file-footer-quot-on-pipe/m-p/18714#M15723</guid>
      <dc:creator>John Russell</dc:creator>
      <dc:date>2014-09-13T20:26:14Z</dc:date>
    </item>
    <item>
      <title>Re: Impala: Parquet error "Invalid file footer" on pipe-delimited file</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Impala-Parquet-error-quot-Invalid-file-footer-quot-on-pipe/m-p/33170#M15724</link>
      <description>&lt;P&gt;In addition to what John Russell suggests (which works great), ensure you remove any partitions and the source HDFS files from any partitions/tables created incorrectly as parquet.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;In my case, I was inserting the data into a table stored as text, then transfering to a parquet type storage table successfully. &amp;nbsp;However, my query was still throwing an "invalid file footer" error because I had invalid partitions that hadn't been completely dropped/deleted from HDFS, &amp;nbsp;specfifically:&lt;/P&gt;&lt;PRE&gt;/user/hive/warehouse/&amp;lt;databasename&amp;gt;/&amp;lt;table_name&amp;gt;/&amp;lt;partition&amp;gt;&lt;/PRE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 20 Oct 2015 16:55:54 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Impala-Parquet-error-quot-Invalid-file-footer-quot-on-pipe/m-p/33170#M15724</guid>
      <dc:creator>GordINV</dc:creator>
      <dc:date>2015-10-20T16:55:54Z</dc:date>
    </item>
    <item>
      <title>Re: Impala: Parquet error "Invalid file footer" on pipe-delimited file</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Impala-Parquet-error-quot-Invalid-file-footer-quot-on-pipe/m-p/34625#M15725</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;We are also facing the same issue of invalid file footer, the table is created as follows :&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;2 tables created&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;CREATE EXTERNAL TABLE ABC_TEXT (&lt;/P&gt;&lt;P&gt;NAME STRING,&lt;/P&gt;&lt;P&gt;ID INT,&lt;/P&gt;&lt;P&gt;PHONE INT)&lt;/P&gt;&lt;P&gt;PARTITION BY (Customer_id INT)&lt;/P&gt;&lt;P&gt;ROW FORMAT DELIMITED&lt;/P&gt;&lt;P&gt;FIELDS TERMINATED BY ';'&lt;/P&gt;&lt;P&gt;STORED AS TEXTFILE&lt;/P&gt;&lt;P&gt;LOCATION '/USER/ABC_TEXT ;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;CREATE EXTERNAL TABLE ABC_PARQUET (&lt;/P&gt;&lt;P&gt;NAME STRING,&lt;/P&gt;&lt;P&gt;ID INT,&lt;/P&gt;&lt;P&gt;PHONE INT )&lt;/P&gt;&lt;P&gt;PARTITION BY (Customer_id INT)&lt;/P&gt;&lt;P&gt;STORED AS PARQUET&lt;/P&gt;&lt;P&gt;&lt;SPAN class="s1"&gt;LOCATION '/USER/ABC_PARQUET' ;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Then run the insert script, which inserts data perfectly but when queried on parquet table getting following error&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN class="s1"&gt;Error:&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN class="s1"&gt;Caused by: java.sql.SQLException: [Simba][ImpalaJDBCDriver](500312) Error in fetching data rows:&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN class="s1"&gt;Invalid file footer&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Please let me know what I am doing wrong.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN class="s1"&gt;&amp;nbsp;&lt;SPAN class="Apple-converted-space"&gt;&amp;nbsp; &amp;nbsp;&amp;nbsp;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 01 Dec 2015 23:38:13 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Impala-Parquet-error-quot-Invalid-file-footer-quot-on-pipe/m-p/34625#M15725</guid>
      <dc:creator>Jais</dc:creator>
      <dc:date>2015-12-01T23:38:13Z</dc:date>
    </item>
  </channel>
</rss>

