Support Questions
Find answers, ask questions, and share your expertise
Announcements
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

Impala Query is running for a long time and its consuming much time with Exchange Node.

Highlighted

Impala Query is running for a long time and its consuming much time with Exchange Node.

New Contributor

Hi All,

 

Impala Query is running for a long time and its consuming much time with Exchange Node.

 

Below is the query summary:

 

Operator #Hosts Avg Time Max Time #Rows Est. #Rows Peak Mem Est. Peak Mem Detail
----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
15:AGGREGATE 5 1h52m 2h18m 1.17B 0 15.48 GB 10.00 MB FINALIZE
14:EXCHANGE 5 1h25m 1h27m 3.09B 0 0 0 HASH(col1,col2,col3,col4,col5,col6,col7,col8,col9,col10,col11,col12,col13,col14,col15,col16,col17,col18,)
10:AGGREGATE 5 16m9s 19m1s 3.09B 0 117.71 GB 10.00 MB STREAMING
00:UNION 5 13m37s 17m14s 3.69B 9223372036.85B 10.36 MB 0
|--06:HASH JOIN 5 7s671ms 9s459ms 599.64M 4.10M 102.94 MB 77.90 MB INNER JOIN, BROADCAST
| |--12:EXCHANGE 5 162.939ms 182.292ms 239.52K 239.52K 0 0 BROADCAST
| | 05:SCAN HDFS 5 45.041ms 76.222ms 239.52K 239.52K 37.06 MB 224.00 MB table2 a1
| 04:SCAN HDFS 5 50.648ms 52.646ms 6.15M 6.15M 176.36 MB 552.00 MB table1 b1
|--09:HASH JOIN 5 44s163ms 51s467ms 3.09B 7.88M 100.54 MB 77.90 MB INNER JOIN, BROADCAST
| |--13:EXCHANGE 5 20s783ms 28s728ms 239.52K 239.52K 0 0 BROADCAST
| | 08:SCAN HDFS 5 44.140ms 71.896ms 239.52K 239.52K 37.46 MB 224.00 MB table2 a2
| 07:SCAN HDFS 5 197.715ms 222.330ms 6.15M 6.15M 177.07 MB 552.00 MB table1 b2
03:HASH JOIN 5 65.422ms 76.880ms 0 9223372036.85B 36.42 MB 77.90 MB INNER JOIN, BROADCAST
|--11:EXCHANGE 5 276.818ms 409.393ms 239.52K 239.52K 0 0 BROADCAST
| 02:SCAN HDFS 5 40.296ms 60.636ms 239.52K 239.52K 36.65 MB 224.00 MB table2 a
01:SCAN HDFS 5 153.891ms 164.360ms 6.15M 6.15M 61.61 MB 552.00 MB table1 b

1 REPLY 1

Re: Impala Query is running for a long time and its consuming much time with Exchange Node.

Super Collaborator

Hi @ravkar I see you AGG phase still running? how many rows the query should return? can you run the query with limit 5 and see if you will get the same behaviour? are you running it from CLI or another UI like Hue?

Don't have an account?
Coming from Hortonworks? Activate your account here