Support Questions
Find answers, ask questions, and share your expertise
Announcements
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

Impala has problems reading complex types from Parquet

SOLVED Go to solution

Impala has problems reading complex types from Parquet

Explorer

Hi all,

 

I reported IMPALA-4725  last week but it seems like it has not been triaged yet. I wanted to bring some more attention to this issue (and possible suggestions for workarounds) since it has a heavy impact on us.

 

To summarize it seems like Impala mixes-up values in arrays of structs which to me seems like a fundamental problem in the parquet reader. Alternatively the values gets mixed-up when presented as a result.

 

Either way, I would very much appreciated an initiated persons view on this issue.

 

We are running Impala that is bundled with CDH 5.8.3

 

Br,

Petter

1 ACCEPTED SOLUTION

Accepted Solutions
Highlighted

Re: Impala has problems reading complex types from Parquet

Master Collaborator
Hi Petter,
This was on our radar - we usually triage anything with a "correctness"
label (which you added) periodically - it's obviously a serious issue. I
updated the JIRA.

- Tim
2 REPLIES 2
Highlighted

Re: Impala has problems reading complex types from Parquet

Master Collaborator
Hi Petter,
This was on our radar - we usually triage anything with a "correctness"
label (which you added) periodically - it's obviously a serious issue. I
updated the JIRA.

- Tim

Re: Impala has problems reading complex types from Parquet

Explorer

Hi Tim,

 

thank you for taking the time to look at this issue!

 

Br,

Petter