Archives of Support Questions (Read Only)

This is an archived board for historical reference. Information and links may no longer be available or relevant
Announcements
This board is archived and read-only for historical reference. To ask a new question, please post a new topic on the appropriate active board.

can we match HBase partitions with Impala Partitions?

avatar
Champion Alumni

Hello,

 

Impala/ is having better performances on partitioned tables (if they are big enough).

However, I use Impala on an external HBase table. My HBase table is partitioned.

 

Therefore, I wonder if I can match the HBase partitions with the Impala/Hive ones?

If yes, How can I do this?

 

 

Thank you!

 

GHERMAN Alina
1 ACCEPTED SOLUTION

avatar

Hi!

 

HBase tables are range partitioned by the key and Impala will do the equivalent of "partition pruning" for HBase resions if suitable predicates are available in the query. The "PARTITION" concept of HDFS tables is not really applicable to HBase tables. The docs explain this in more detail, including how to interpred Impala's explain plan to see whether region pruning is taking place or not:

 

http://www.cloudera.com/content/www/en-us/documentation/archive/impala/2-x/2-1-x/topics/impala_hbase...

 

Does that answer your question?

View solution in original post

3 REPLIES 3

avatar

Hi!

 

HBase tables are range partitioned by the key and Impala will do the equivalent of "partition pruning" for HBase resions if suitable predicates are available in the query. The "PARTITION" concept of HDFS tables is not really applicable to HBase tables. The docs explain this in more detail, including how to interpred Impala's explain plan to see whether region pruning is taking place or not:

 

http://www.cloudera.com/content/www/en-us/documentation/archive/impala/2-x/2-1-x/topics/impala_hbase...

 

Does that answer your question?

avatar
Contributor

The input data set consists of three tables as shown with the following table creation statements in Impala SQL dialect

avatar
Contributor

Yes. Use a spark-hbase-connector.