Support Questions

Find answers, ask questions, and share your expertise
Announcements
Celebrating as our community reaches 100,000 members! Thank you!

Setup a CDH cluster for BDR purposes

avatar
Rising Star

Hello,

 

I have a kerberized + Sentry protected CDH cluster with:

 

1 x Edge

2 x Master

4 x Worker

 

nodes. I want to setup a secondary cluster for Hive replication purposes.

 

1. What should be the minimum topology for this task?

2. Should the secondary cluster be Sentry protected as well?

3. Should the 2 cluster share the same KDC principals? If so, can the secondary cluster use the KDC server currently installed on Master1 node?

 

Thank you,

Gerasimos

 

 

1 ACCEPTED SOLUTION

avatar
If you want to create a backup cluster, just for backup purposes, you can skip the Sentry. Number of nodes would be min 3. You can use the same KDC for the backup cluster (if those two cluster are on the same network - i.e. there is no overlap on the hostnames and IP addresses), but make sure you follow the configurations for distcp between kerberized environments. You dont need the edge node as well.

View solution in original post

3 REPLIES 3

avatar
If you want to create a backup cluster, just for backup purposes, you can skip the Sentry. Number of nodes would be min 3. You can use the same KDC for the backup cluster (if those two cluster are on the same network - i.e. there is no overlap on the hostnames and IP addresses), but make sure you follow the configurations for distcp between kerberized environments. You dont need the edge node as well.

avatar
Rising Star

Thank you @Tomas79

 

I am also searching for architecture designs for Active-Active or Active-Passive DR configurations using 2 clusters. This article  has some introductory info on this. I was wondring whether more resources are available on this topic.

 

Best regards,

Gerasimos

avatar
Master Guru

@gerasimos,

 

What are your goals for your failover or backup strategy?

 

BDR schedules only replicate on a schedule in one direction.

"Active-Active" concepts may not truly apply to CDH depending on what you mean by that.