Reply
New Contributor
Posts: 1
Registered: ‎01-24-2019

Impala Query is running for a long time and its consuming much time with Exchange Node.

Hi All,

 

Impala Query is running for a long time and its consuming much time with Exchange Node.

 

Below is the query summary:

 

Operator #Hosts Avg Time Max Time #Rows Est. #Rows Peak Mem Est. Peak Mem Detail
----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
15:AGGREGATE 5 1h52m 2h18m 1.17B 0 15.48 GB 10.00 MB FINALIZE
14:EXCHANGE 5 1h25m 1h27m 3.09B 0 0 0 HASH(col1,col2,col3,col4,col5,col6,col7,col8,col9,col10,col11,col12,col13,col14,col15,col16,col17,col18,)
10:AGGREGATE 5 16m9s 19m1s 3.09B 0 117.71 GB 10.00 MB STREAMING
00:UNION 5 13m37s 17m14s 3.69B 9223372036.85B 10.36 MB 0
|--06:HASH JOIN 5 7s671ms 9s459ms 599.64M 4.10M 102.94 MB 77.90 MB INNER JOIN, BROADCAST
| |--12:EXCHANGE 5 162.939ms 182.292ms 239.52K 239.52K 0 0 BROADCAST
| | 05:SCAN HDFS 5 45.041ms 76.222ms 239.52K 239.52K 37.06 MB 224.00 MB table2 a1
| 04:SCAN HDFS 5 50.648ms 52.646ms 6.15M 6.15M 176.36 MB 552.00 MB table1 b1
|--09:HASH JOIN 5 44s163ms 51s467ms 3.09B 7.88M 100.54 MB 77.90 MB INNER JOIN, BROADCAST
| |--13:EXCHANGE 5 20s783ms 28s728ms 239.52K 239.52K 0 0 BROADCAST
| | 08:SCAN HDFS 5 44.140ms 71.896ms 239.52K 239.52K 37.46 MB 224.00 MB table2 a2
| 07:SCAN HDFS 5 197.715ms 222.330ms 6.15M 6.15M 177.07 MB 552.00 MB table1 b2
03:HASH JOIN 5 65.422ms 76.880ms 0 9223372036.85B 36.42 MB 77.90 MB INNER JOIN, BROADCAST
|--11:EXCHANGE 5 276.818ms 409.393ms 239.52K 239.52K 0 0 BROADCAST
| 02:SCAN HDFS 5 40.296ms 60.636ms 239.52K 239.52K 36.65 MB 224.00 MB table2 a
01:SCAN HDFS 5 153.891ms 164.360ms 6.15M 6.15M 61.61 MB 552.00 MB table1 b

Expert Contributor
Posts: 357
Registered: ‎01-25-2017

Re: Impala Query is running for a long time and its consuming much time with Exchange Node.

Hi @ravkar I see you AGG phase still running? how many rows the query should return? can you run the query with limit 5 and see if you will get the same behaviour? are you running it from CLI or another UI like Hue?

Announcements