Archives of Support Questions (Read Only)

This is an archived board for historical reference. Information and links may no longer be available or relevant
Announcements
This board is archived and read-only for historical reference. To ask a new question, please post a new topic on the appropriate active board.

Impala and virtual columns

avatar
Expert Contributor

Hi, imagine I do have a table:

CREATE TABLE partitioned_table(....)

PARTITIONED BY (fulldate String)

 

And a query:

 

 select distinct(fulldate) from partitioned_table order by fulldate desc limit 100;

 

What would impala do? Only "virtual" (partition???) column takes place in query. Therese no need to fetch HDFS data.

It looks like right now Impala does read ALL partitions and calculated DISTINCT for a virtual column (virtual=is not present in data, this is metadata-only column)

 

1 ACCEPTED SOLUTION

avatar
Contributor
Sounds like something that could be done, so I've added a JIRA to track it.
https://issues.cloudera.org/browse/IMPALA-633


View solution in original post

1 REPLY 1

avatar
Contributor
Sounds like something that could be done, so I've added a JIRA to track it.
https://issues.cloudera.org/browse/IMPALA-633