Support Questions
Find answers, ask questions, and share your expertise
Announcements
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

impala query kudu table very slow

impala query kudu table very slow

New Contributor

After a period of time in the kudu cluster, all queries have become slow. The following is the Impala query profile file, please help analyze it, thank you.

profile file

1 REPLY 1
Highlighted

Re: impala query kudu table very slow

Cloudera Employee

The exchange operators seem to be the bottleneck:

16:EXCHANGE 1 26m14s 26m14s 22.55K 0 0 0 KUDU(KuduPartition(shift_timekey)) 
13:EXCHANGE 1 26m13s 26m13s 21.46K -1 0 0 HASH(t.pnl_id,d.oper_code,d.factory)
12:EXCHANGE 1 24m17s 24m17s 108.37M -1 0 0 BROADCAST

Each take 20+ minutes.

More importantly, the profile says "WARNING: The following tables are missing relevant table and/or column statistics. rptpid.i_f_r_mes_hist_pnl_mod, rptpid.i_f_t_mes_hist_gop_inout_fab". You should compute stats on these tables. Having accurate statistics is important for Impala performance.

Don't have an account?
Coming from Hortonworks? Activate your account here