- Subscribe to RSS Feed
- Mark Question as New
- Mark Question as Read
- Float this Question for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page
How can we keep the data in 2 data centers in sync ?
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi Team,
We have 12 nodes cluster hosted on premise in 2 different regions.
The question is how can we keep the data in 2 data centers in sync and what will be the latency.
Appreciate your help.
Thanks & Regards
Created ‎03-12-2020 02:28 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi @ARVINDR ,
I suggest you reach to Dell EMC for guidance around Isilon.
I found some documentation here:
that suggests some techniques. I'll caveat that I'm not an expert in Isilon so my responses here are best endeavors.
Regards,
Steve
Created ‎03-04-2020 12:57 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi @ARVINDR
I'd like to clarify your scenario. Do you have
A) 12 nodes in region A and 12 nodes in region B i.e. 2 distinct Cloudera clusters with a total of 24 nodes and you want to replicate data between these clusters?
or
B) 12 nodes split across regions A and B i.e. a single cluster of 12 nodes?
Also what version of Cloudera are you using?
Regards,
Steve
Created ‎03-04-2020 05:00 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi Steven,
Pls consider scenario A
A) 12 nodes in region A and 12 nodes in region B i.e. 2 distinct Cloudera clusters with a total of 24 nodes and we want to replicate data between these clusters
We are using version HDP 3.0.1
Thanks & Regards
Created ‎03-04-2020 06:43 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi @ARVINDR
In this case, you would use Data Lifecycle Manager to replicate data between the two clusters.
Here is a link to the documentation:
The latency will be a function of your network. I can share some general networking guidelines here:
Regards,
Steve
Created ‎03-09-2020 05:28 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi @StevenOD,
Thanks for the details.
Just one query, we are building Hadoop on top of Isilon , will the following still holds true in that case ?
Thanks & Regards
Arvind.
Created ‎03-11-2020 06:37 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi @ARVINDR ,
I am an expert in Isilon but I'm not sure that Data Lifecyle Manager (DLM) supports using Isilon as the storage layer.
Isilon uses the OneFS file system. OneFS supports its own utilities for backing up data and replication so it might be better to use tools that are native to Isilon in this scenario.
Regards,
Steve
Created ‎03-11-2020 07:07 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Created ‎03-12-2020 02:28 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi @ARVINDR ,
I suggest you reach to Dell EMC for guidance around Isilon.
I found some documentation here:
that suggests some techniques. I'll caveat that I'm not an expert in Isilon so my responses here are best endeavors.
Regards,
Steve
Created ‎03-12-2020 03:47 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Thanks for sharing documents @StevenOD
