Reply
Expert Contributor
Posts: 87
Registered: ‎09-17-2014
Accepted Solution

Parquet index page for Impala

Hi dear experts!

 

could anybody know does parquet index page available for Impala(https://github.com/Parquet/parquet-format)? if so, where i can find more information about that?

 

thank you in advance!

Cloudera Employee
Posts: 307
Registered: ‎10-16-2013

Re: Parquet index page for Impala

Hi fil,

 

I'm afraid Impala currently does not take advantage of index pages when reading/writing data.

 

Index pages are part of the Parquet spec, so in theory, you shoud be able to write Parquet files with index pages (via some other tool) and have them be readable by Impala - but Impala will ignore the index pages.

 

We do plan on taking advantage of index pages and min/max/etc values, but we do not have a concrete target date for that feature yet.

 

Alex

 

Expert Contributor
Posts: 87
Registered: ‎09-17-2014

Re: Parquet index page for Impala

thanks for your comment!
Highlighted
Explorer
Posts: 10
Registered: ‎07-28-2015

Re: Parquet index page for Impala

https://github.com/Parquet/parquet-format/blob/f7ab552f569df63bdb59f751d0dd36e826682739/src/thrift/p...

 

Index pages are declared in Parquet format, but not actually implemented.

See code above. 

 

Announcements