Created on 09-25-2018 07:25 AM - edited 09-16-2022 06:44 AM
My cluster configuration is as follows:
My questions are:
1) If I submit 5 queries how does impala executes them concurrently? (How does Impala distribute the load in the cluster?)
2) If the number of queries is increased above 10 How will the load balancing of query execution happen?
I am observing an unexpected behaviour of impala if I am trying to execute concurrent queries at a time. So just want to understand the distribution of queries and resources in Impala?
Created 10-10-2018 03:28 PM
Created 10-10-2018 11:43 PM
Thank you for your response. I want to know how impala distribute the resources if it has multiple concurrent queries to execute. In my case, I am facing a slow down in the query performance in case of concurrent execution. If there any property or something that i need to do to get high concurrent performance from Impala.
Created 10-11-2018 01:02 AM
If you dont have setup load balancer, then you need to setup that so that once you run the queries it will go to load balancer then it will distribute across all the nodes based on the available resources, so in this case load balancer would take the responsibility.
Created 10-16-2018 10:24 PM
can you suggest me some load balancer that can check Impala health or available resources and based on that it distributes the load to the Impala nodes.