Reply
New Contributor
Posts: 1
Registered: ‎02-27-2018

Understand Impala Query Plan

Hello,

 

I am currently tuning  requests with Impala, in the frame of a study where I will compare different storage formats. My queries are SELECT queries.

When I run these queries under Impala, I get the display on the screen for these SELECT results and a global time, let's say T. I can find this time T under cloudera Manager in 'Impala Query', at the line 'Unregister query' line of 'Query timeline' section.

 

I would like to know, among the quantity of information I can find there, how I can measure precisely the duration of thr SELECT. I suppose that the display time is also computed, but I would like to know how much the SELECT query costs, without taking into account the display time.

What duration is also the most appropriate to measure query performance in my case ?

 

Your help would be greatly appreciated.

 

Thank you in advance. Have a good day.

 

Best regards,

Jean-Luc.

Expert Contributor
Posts: 105
Registered: ‎07-17-2017

Re: Understand Impala Query Plan

Hi @JLB92

I belive that you will find the best answers of your questions by reading this cloudera document about Understanding Impala Query Performance - EXPLAIN Plans and Query Profiles

Good luck.

Highlighted
Cloudera Employee
Posts: 307
Registered: ‎10-16-2013

Re: Understand Impala Query Plan

I highly recommend reading and understanding the Impala Cookbook. It has a section on running benchmarks:

https://blog.cloudera.com/blog/2017/02/latest-impala-cookbook/

 

The ClientFetchWaitTimer in the query profile indicates how much time the server is waiting for the client to issue the next fetch. A long time may mean that the client is slow or is not fetching for some other reason.

Announcements