Support Questions
Find answers, ask questions, and share your expertise
Announcements
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

Impala and virtual columns

SOLVED Go to solution
Highlighted

Impala and virtual columns

Expert Contributor

Hi, imagine I do have a table:

CREATE TABLE partitioned_table(....)

PARTITIONED BY (fulldate String)

 

And a query:

 

 select distinct(fulldate) from partitioned_table order by fulldate desc limit 100;

 

What would impala do? Only "virtual" (partition???) column takes place in query. Therese no need to fetch HDFS data.

It looks like right now Impala does read ALL partitions and calculated DISTINCT for a virtual column (virtual=is not present in data, this is metadata-only column)

 

1 ACCEPTED SOLUTION

Accepted Solutions

Re: Impala and virtual columns

Cloudera Employee
Sounds like something that could be done, so I've added a JIRA to track it.
https://issues.cloudera.org/browse/IMPALA-633


1 REPLY 1

Re: Impala and virtual columns

Cloudera Employee
Sounds like something that could be done, so I've added a JIRA to track it.
https://issues.cloudera.org/browse/IMPALA-633