My cluster configuration is as follows:
My questions are:
1) If I submit 5 queries how does impala executes them concurrently? (How does Impala distribute the load in the cluster?)
2) If the number of queries is increased above 10 How will the load balancing of query execution happen?
I am observing an unexpected behaviour of impala if I am trying to execute concurrent queries at a time. So just want to understand the distribution of queries and resources in Impala?
Thank you for your response. I want to know how impala distribute the resources if it has multiple concurrent queries to execute. In my case, I am facing a slow down in the query performance in case of concurrent execution. If there any property or something that i need to do to get high concurrent performance from Impala.
If you dont have setup load balancer, then you need to setup that so that once you run the queries it will go to load balancer then it will distribute across all the nodes based on the available resources, so in this case load balancer would take the responsibility.