Support Questions
Find answers, ask questions, and share your expertise
Announcements
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

Best practice for configurations/sync between two data centers

Best practice for configurations/sync between two data centers

New Contributor

Hi!

 

According with a multi-site deployment, with two data centers: one independent cluster in one data center regarding the other data center. I mean two separate Cloudera Manager based installations.

 

The data sync is solved with several aproaches, using Kafka mirroring, Cloudera Manager backup, Discp or event Apache Solr CDCR, we have a lot of different strategies.

 

Nevertheless, if we want to sync the configurations of Cloudera in DC1 and the configurations of Cloudera in DC2, in order to replicate the behaviour of both clusters. For instance try to replicate the configurations of Apache Flume morphline file in DC1, managed by Cloudera Manager interface, into the DC2. So any change in configurations files in DC1 will be replicated in the DC2.

 

It is that possible with standard configurations? 

Javi Roman

Twitter: @javiromanrh
Linkedin: es.linkedin.com/in/javiroman
Big Data Blog: dataintensive.info
1 REPLY 1

Re: Best practice for configurations/sync between two data centers

Champion
There isn't a way to replicate the configuration, just the data and the metadata. I imagine this could be accomplished through the CM API. You would have to fetch the configuration from DC1, possible modify (any thing related to the hosts will need to be updated with the DC2 host strings), and then push it to DC2.