About Andreyeff

Andreyeff · ‎04-24-2019

Could you advise if there is a solution to the problem, when Impala assigns heavy query parts to busy executors. For example the following was faced at CDH 5.16 with Impala 2.12.0: Impala has several (let's say 5) executors each having ~100GB RAM. Impala admission control is used. The mem_limit is set default (or about default ~80%), e.g. 80GB. The first relatively long and heavy query (let's name it Query1) comes and one of its steps take ~70GB RAM at executor1, i.e. there is ~10GB available RAM at this executor for reservation. Other 4 executor servers are nearly idle. At the same time the second query (let's name Query2) comes, which requires 40GB RAM, and it might happen the Query2 is assigned to the executor1, which is busy. So the Query2 fails due to it cannot allocate/reserve the memory. Is there a way to configure Impala to assign fragments/query parts to less busy executors? So far the concurrency reduction or reservation removal (since reserved memory amount usually is larger than really used) might work, but I see it too inefficient to use only 1-2 executors out of 5. Impala on YARN potentially might help, but as far as I see, it requires Llama, which is deprecated and is going to be removed soon.

PranayMunshi · ‎04-10-2019

Thank you very much Tim for providing this insight. I have assumption that MEM_LIMIT option is asking for that amount of space for query.

Tim Armstrong · ‎04-05-2019

If you have more than a handful of users it becomes difficult to manage the large number of pools. Resource limits are also of limited use - you can limit the total consumption per user, but you can't guarantee that any group of users gets memory.

mpercy · ‎09-21-2018

@Andreyeff Another thing you can try doing is increasing the raft heartbeat interval from 500ms to 1500ms or even 3000ms, see https://kudu.apache.org/docs/configuration_reference.html#kudu-tserver_raft_heartbeat_interval_ms This will affect your recovery time by a few seconds if a leader fails since by default, elections don't happen for 3 missed heartbeat periods (controlled by https://kudu.apache.org/docs/configuration_reference.html#kudu-tserver_leader_failure_max_missed_heartbeat_periods )

Andreyeff · ‎07-19-2018

Oh, the issue appeared to be even in functions. Thanks for fast reply and raising a ticket.

Online	Offline
Last Visited	‎12-05-2019 10:43 AM

Member Since	‎05-03-2018 01:03 AM
Last Visited	‎12-05-2019 10:43 AM
Posts	25

Cloudera Community

Re: Protecting queries in Impala using pools and m...

Re: Impala mem_limit query option is not working

Re: Why Impala Admission Control root.[username] i...

Re: Kudu backpressure and service queue is full

Re: SHOW CREATE VIEW fails for owner