Created on 07-02-2015 03:11 PM - edited 09-16-2022 02:33 AM
Hi dear experts!
could anybody know does parquet index page available for Impala(https://github.com/Parquet/parquet-format)? if so, where i can find more information about that?
thank you in advance!
Created 07-08-2015 08:21 AM
Hi fil,
I'm afraid Impala currently does not take advantage of index pages when reading/writing data.
Index pages are part of the Parquet spec, so in theory, you shoud be able to write Parquet files with index pages (via some other tool) and have them be readable by Impala - but Impala will ignore the index pages.
We do plan on taking advantage of index pages and min/max/etc values, but we do not have a concrete target date for that feature yet.
Alex
Created 07-08-2015 08:21 AM
Hi fil,
I'm afraid Impala currently does not take advantage of index pages when reading/writing data.
Index pages are part of the Parquet spec, so in theory, you shoud be able to write Parquet files with index pages (via some other tool) and have them be readable by Impala - but Impala will ignore the index pages.
We do plan on taking advantage of index pages and min/max/etc values, but we do not have a concrete target date for that feature yet.
Alex
Created 07-08-2015 10:44 AM
Created 11-20-2016 10:46 PM
Index pages are declared in Parquet format, but not actually implemented.
See code above.