Support Questions

Find answers, ask questions, and share your expertise

How to configure Oozie and Hive High Availability without using Load Balancer

avatar
Rising Star

Can anyone share any documentation on configuring HA for Oozie and Hive without Load Balancer.

1 ACCEPTED SOLUTION

avatar
Master Mentor

@sanjeevan mahajan

Have a look at this document oozie and link

You can omit the HA setup steps.

View solution in original post

4 REPLIES 4

avatar
Rising Star

Sorry there is typo in the question its without using Load Balancer

avatar
Master Mentor

I have fixed it for you @sanjeevan mahajan

avatar
Master Mentor

@sanjeevan mahajan

Have a look at this document oozie and link

You can omit the HA setup steps.

avatar
Master Guru

This is actually well explained in the ambari docs. Be sure to add the services extensions to make sure the two servers don't duplicate jobs

http://docs.hortonworks.com/HDPDocuments/Ambari-2.2.0.0/bk_Ambari_Users_Guide/content/_adding_an_ooz...

http://docs.hortonworks.com/HDPDocuments/Ambari-2.2.0.0/bk_Ambari_Users_Guide/content/_hive_high_ava...

Just as some comments: Without Load Balancer Oozie will continue to schedule jobs even if one server is down. But administration (submitting jobs ... ) would require you to point to the surviving oozie.

Also what the documents not describe is how to configure the underlying database in DR, that is your job. Here is an example for Postgres:

https://www.digitalocean.com/community/tutorials/how-to-set-up-master-slave-replication-on-postgresq...

However databases like Mysql and Postgres seem to be very stable so making for example Oozie Server HA is much more important than doing that for the underlying database. So you could decide to just backup the database regularly.