Support Questions

Find answers, ask questions, and share your expertise
Announcements
Celebrating as our community reaches 100,000 members! Thank you!

Parquet index page for Impala

avatar
Rising Star

Hi dear experts!

 

could anybody know does parquet index page available for Impala(https://github.com/Parquet/parquet-format)? if so, where i can find more information about that?

 

thank you in advance!

1 ACCEPTED SOLUTION

avatar

Hi fil,

 

I'm afraid Impala currently does not take advantage of index pages when reading/writing data.

 

Index pages are part of the Parquet spec, so in theory, you shoud be able to write Parquet files with index pages (via some other tool) and have them be readable by Impala - but Impala will ignore the index pages.

 

We do plan on taking advantage of index pages and min/max/etc values, but we do not have a concrete target date for that feature yet.

 

Alex

 

View solution in original post

3 REPLIES 3

avatar

Hi fil,

 

I'm afraid Impala currently does not take advantage of index pages when reading/writing data.

 

Index pages are part of the Parquet spec, so in theory, you shoud be able to write Parquet files with index pages (via some other tool) and have them be readable by Impala - but Impala will ignore the index pages.

 

We do plan on taking advantage of index pages and min/max/etc values, but we do not have a concrete target date for that feature yet.

 

Alex

 

avatar
Rising Star
thanks for your comment!

avatar
Contributor

https://github.com/Parquet/parquet-format/blob/f7ab552f569df63bdb59f751d0dd36e826682739/src/thrift/p...

 

Index pages are declared in Parquet format, but not actually implemented.

See code above.