12-05-2018 02:23 AM
Hello team, From our web application whenever we are trying to fetch data from Impala, it's doing so from one particular node. There are multiple data nodes. There is haproxy acting as a load balancer. Replication factor set to 3. Is there any configuration in Impala that would allow to fetch data from multiple nodes. Thanks and regards Sayak
12-05-2018 03:32 AM
12-05-2018 09:49 PM
12-05-2018 09:53 PM
when LB is configured then the control of which impala to submit the query is vested with LB and impala-shell will have no control over choosing the coordinator
if you are not using loadbalancer, then when you run impala-shell -i <impalad_hostname> then query gets submitted to impalad running on "<impalad_hostname>" -- This host will act as the coordinator for the query.