I am considering HA configuration of Kerberized CDH cluster.
Is it possible to avoid SPoF of Load Balancer when constructing Load Balancer such as HAProxy and configuring HA of HiveServer2?
For example, is it possible to deploy two Load Balancer with Active-Active and specify two host and port for setting(HiveServer2 Load Balancer property)?
Or, should deploy Active and Stanby of Load Balancer be constructed, and when Active fails, should I change the setting(HiveServer2 Load Balancer property) to refer to Stanby?
Impalad thinks the same way.
I refer to the following procedure.
thank you for reading.
Thank you for your reply.
I understood about support for only one LB setting.
Please make sure that you can connect directly to HS2.
The following description in the procedure states that in a Kerberized CDH cluster, direct connection to HS2 fails, but is it possible to connect?
In the case of impalad, can you connect without using impala-shell? Can ODBC / JDBC and other connections be possible?
* ・On Kerberos-enabled clusters, you must use the load balancer for all connections. After you enable HiveServer2 high availability, direct connections to HiveServer2 instances fail.
* Once you enable a proxy server in a Kerberized cluster, users will not be able to connect to individual impala daemons directly from impala-shell.
Thank you for your quick reply.
In the Kerberized environment, when LB was set for HS2 and Impalad, I understood that I can connect directly to HS2, but I can not connect directly to Impalad.
Thank you for responding to DOC correction. Because I often refer to DOC, I help a lot.