Created on 04-29-2018 12:09 AM - edited 09-16-2022 06:09 AM
first time installing Hortwonworks cluster with 2 master node and 3 data node with High Availability, Need help on this.. do we have any step-wise document to follow..
Created 04-29-2018 07:42 AM
I think you will get a kick in trying it out by yourself. The most common HA components are NaneNode,RM and of course zookeeper minimum 3 instances to avoid the split brain decision.
When using Ambari it's quite straightforward. The HA components shouldn't reside on the same rack and each rack should have network redundancy Core Switch HA and further even have 2 ISP's, but goes beyond the responsibility of an Hadoop admin and should be the work of your network team though you have to be aware of the consequences when setting up you HA cluster.
So first questions first what components do you intend to have in HA?
Please revert !!
Created 05-01-2018 06:12 PM
@ Geoffrey Shelton
Thanks for the quick response, planning to go HA for these components Hive, oozie, hbase as of now. Can you please suggest me is this the right way to go with. I already setup Ambari server setup with 1 Master node and 2 Data nodes, it's working fine. First time will be going with HA. Is there step-wise documents will be great help.
Created 05-01-2018 06:29 PM
First, ensure you have ZooKeeper ensemble(3) running. then
Once all is fine, if you are using VMware take a snapshot and proceed
You will need a Loadbalancer, Virtual IP, or Round-Robin DNS. The load balancer should be configured for round-robin between the Oozie servers to distribute the requests
Created 05-02-2018 09:20 AM
@Geoffrey
Thank you. How is Zookeeper and YARN different in HA, can you please let me know, how exactly it works..