Support Questions

Find answers, ask questions, and share your expertise

Impala and virtual columns

avatar
Expert Contributor

Hi, imagine I do have a table:

CREATE TABLE partitioned_table(....)

PARTITIONED BY (fulldate String)

 

And a query:

 

 select distinct(fulldate) from partitioned_table order by fulldate desc limit 100;

 

What would impala do? Only "virtual" (partition???) column takes place in query. Therese no need to fetch HDFS data.

It looks like right now Impala does read ALL partitions and calculated DISTINCT for a virtual column (virtual=is not present in data, this is metadata-only column)

 

1 ACCEPTED SOLUTION

avatar
Contributor
Sounds like something that could be done, so I've added a JIRA to track it.
https://issues.cloudera.org/browse/IMPALA-633


View solution in original post

1 REPLY 1

avatar
Contributor
Sounds like something that could be done, so I've added a JIRA to track it.
https://issues.cloudera.org/browse/IMPALA-633