Hello,
I am currently tuning requests with Impala, in the frame of a study where I will compare different storage formats. My queries are SELECT queries.
When I run these queries under Impala, I get the display on the screen for these SELECT results and a global time, let's say T. I can find this time T under cloudera Manager in 'Impala Query', at the line 'Unregister query' line of 'Query timeline' section.
I would like to know, among the quantity of information I can find there, how I can measure precisely the duration of thr SELECT. I suppose that the display time is also computed, but I would like to know how much the SELECT query costs, without taking into account the display time.
What duration is also the most appropriate to measure query performance in my case ?
Your help would be greatly appreciated.
Thank you in advance. Have a good day.
Best regards,
Jean-Luc.