28786
DISCUSSIONS
102035
MEMBERS
3160
ARTICLES
Created on 05-04-2017 05:24 AM - edited 09-16-2022 04:33 AM
According to this documentation when running a query from a DataNode via impala-shell, the Impala daemon running on that node acts as the coordinator node for that query, but in theory all nodes with Impala daemons will work in parallel to transmit partial results back.
It seems though that in our cluster this is not working properly because it only uses 2% CPU and it takes a lot of time to complete queries.
Also, since CDH 5.10 the use of Llama role is deprecated, so what is the right way to manage Impala resources? Chaning CPU shares in the configuration seems to have no effect.