I am currently tuning requests with Impala, in the frame of a study where I will compare different storage formats. My queries are SELECT queries.
When I run these queries under Impala, I get the display on the screen for these SELECT results and a global time, let's say T. I can find this time T under cloudera Manager in 'Impala Query', at the line 'Unregister query' line of 'Query timeline' section.
I would like to know, among the quantity of information I can find there, how I can measure precisely the duration of thr SELECT. I suppose that the display time is also computed, but I would like to know how much the SELECT query costs, without taking into account the display time.
What duration is also the most appropriate to measure query performance in my case ?
The ClientFetchWaitTimer in the query profile indicates how much time the server is waiting for the client to issue the next fetch. A long time may mean that the client is slow or is not fetching for some other reason.