<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Questions on Disaster Recovery in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Questions-on-Disaster-Recovery/m-p/169396#M29850</link>
    <description>&lt;P&gt;&lt;STRONG&gt;Teeing vs Copying- Which one is preferred over the other? Understand its scenario dependent. But which has better adaptability and more widely used in the industry? Copying?&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;
With Teeing, you can split up primary tasks between the 2 clusters and use the other cluster as DR for that task. As an example, if you have clusters C1 and C2, you can use C1 as primary cluster and C2 as DR for some teams/tasks and use C2 as primary cluster and C1 as DR for some other users/tasks&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;
Is it necessary to have both the main and the DR cluster on the same version of HDP? If not, what are things to consider if same version is not possible?&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;
It is convinent to have them both on same version. This is especially the case if you want to use DR with almost no code changes if primary server is down. &lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;
Should it be like for like topology between clusters in terms of component placement including gateway nodes and zookeeper services? &lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;This is not required.&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt; 
How does security play out for DR? Should both the cluster nodes be part of the same Kerberos realm or can they be part of different realms?&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;
As a DR, same realm is a lot easier to manage than cross realm. But cross realm is possible. &lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;
Can the replication factor be lower? Or it recommended to maintain it as the same as the primary cluster? &lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;I have seen using rep factor 2 on DR clusters, but in case this turns in primary after disaster you may have to change rep factor to 3 on all data sets. &lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;
Any specific network requirements in terms of latency, speed etc. between the clusters
For ditscp, each node one cluster should communicate with each of the other nodes on second cluster. 
Is there a need to run balancer on the DR cluster periodically?&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;
Yes. Always good to run balancer to keep similar number of blocks across nodes. &lt;/P&gt;&lt;P&gt;
&lt;STRONG&gt;How does encryption play out between the primary and DR clusters? If encryption at rest is enabled in the primary one, how is it handled in the DR cluster? What are the implications of wire-encryption while transferring the data between the clusters? &lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;&lt;/STRONG&gt;Wire encyprtion will slow down transfers a little bit.&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;
When HDFS snapshots is enabled on the primary cluster, how does it work when data is being synced to the DR cluster? Can Snapshots be exported onto another cluster? I understand this is possible for HBase snapshots. But is it allowed in HDFS case? For example, if a file is deleted on the primary cluster, but available in the snapshot, will that be synced to the snapshot directory on the DR cluster? &lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;If you are using snapshots, you can simply use distcp on snapshots instead of actual data set.&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt; 
For services which involve databases (Hive, Oozie, Ambari), instead of backing up periodically from the primary cluster to the DR cluster, is it recommended to setup one HA master in the DR cluster directly? &lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;I don't think automating ambari is a good idea. Configs don't change that much so a simple process of duplicating might be better. Backing up would mean you need to have same hostnames and same topology.
For hive, instead of complete backup, Falcon can take care of table level replication. &lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;
For configurations and application data, instead of backing up at regular intervals, is there a way to keep them in sync between the primary and DR clusters?&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;
Not sure where your application data resides, but for configuration since everything is managed by ambari, you can need to keep ambari configuration in sync.&lt;/P&gt;</description>
    <pubDate>Thu, 26 May 2016 21:46:55 GMT</pubDate>
    <dc:creator>ravi1</dc:creator>
    <dc:date>2016-05-26T21:46:55Z</dc:date>
    <item>
      <title>Questions on Disaster Recovery</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Questions-on-Disaster-Recovery/m-p/169394#M29848</link>
      <description>&lt;UL&gt;&lt;LI&gt;I have
seen few articles and questions on the community around Disaster Recovery.
However, its still not clear completely and hence posting a new question around
that:&lt;/LI&gt;
 
&lt;LI&gt;As I understand,  typically, these entities need to be
     backed-up / synced between the clusters&lt;/LI&gt; 
&lt;/UL&gt;&lt;UL&gt;
  
&lt;LI&gt;Primary Datasets&lt;/LI&gt;  
&lt;/UL&gt;&lt;UL&gt;
   &lt;/UL&gt;&lt;P style="margin-left: 40px;"&gt;HDFS Data&lt;/P&gt;&lt;P style="margin-left: 60px;"&gt;Teeing - Flume /
        Hortonworks Data Flow&lt;/P&gt;&lt;P style="margin-left: 60px;"&gt;Copying / Replication -
        distcp (invoking it manually), Falcon&lt;/P&gt;&lt;P style="margin-left: 40px;"&gt;Hive Data&lt;/P&gt;&lt;P style="margin-left: 60px;"&gt;Behind the scenes, Hive
        data is stored in HDFS. So I presume the techniques of teeing / copying
        can be employed for HDFS as above can be used here as well.&lt;/P&gt;&lt;P style="margin-left: 40px;"&gt;HBase Data&lt;/P&gt;&lt;P style="margin-left: 60px;"&gt;HBase native DR
        replication mechanism - master-slave, master-master and cyclic (&lt;A href="http://hbase.apache.org/book.html#_cluster_replication"&gt;http://hbase.apache.org/book.html#_cluster_replication&lt;/A&gt;)&lt;/P&gt;&lt;P style="margin-left: 40px;"&gt;Solr Indexes&lt;/P&gt;&lt;P style="margin-left: 60px;"&gt;If indexes are being
        stored in HDFS, HDFS techniques would cover Solr datasets as well&lt;/P&gt;&lt;LI&gt;DB backed services&lt;/LI&gt;&lt;P style="margin-left: 20px;"&gt;Hive Metadata - Periodic backup of the
        database from primary to DR cluster&lt;/P&gt;&lt;P style="margin-left: 20px;"&gt;Amber - Ambari DB contains
        configurations for other ecosystem components. Periodic backup of the
        database from primary to DR cluster&lt;/P&gt;&lt;P style="margin-left: 20px;"&gt;Oozie - Oozie database contains
        job and workflow level information. So this need to be backed up
        regularly to the DR cluster&lt;/P&gt;&lt;P style="margin-left: 20px;"&gt;Ranger - Ranger policy DB contains
        info about the various policies impacting RBAC. Need to be backed up to
        the DR cluster&lt;/P&gt;&lt;LI&gt;Configurations&lt;/LI&gt;  
&lt;UL&gt;
   
&lt;/UL&gt;&lt;P style="margin-left: 20px;"&gt;Periodic backup of Ambari
       Server and Agent configurations (Ambari folders under /etc and /var)&lt;/P&gt;&lt;P style="margin-left: 20px;"&gt;Periodic backup of
       Configuration files for each application or service under /etc directory&lt;/P&gt;&lt;P style="margin-left: 20px;"&gt;Periodic backup of binaries
       (/usr/hadoop/current)&lt;/P&gt;&lt;P style="margin-left: 20px;"&gt;Periodic backup of any OS
       specific changes at a node level in the primary cluster&lt;/P&gt;&lt;UL&gt;   
   
   
  &lt;/UL&gt;&lt;LI&gt;Application / User data&lt;/LI&gt;&lt;STRONG&gt;Queries on DR Strategy&lt;/STRONG&gt;
&lt;UL&gt;
 
&lt;LI&gt;Teeing vs Copying- Which one
     is preferred over the other? Understand its scenario dependent. But which
     has better adaptability and more widely used in the industry? Copying?&lt;/LI&gt; 
&lt;LI&gt;Is it necessary to have both
     the main and the DR cluster on the same version of HDP? If not, what are
     things to consider if same version is not possible?&lt;/LI&gt; 
&lt;LI&gt;Should it be like for like
     topology between clusters in terms of component placement including
     gateway nodes and zookeeper services?&lt;/LI&gt; 
&lt;LI&gt;How does security play out
     for DR? Should both the cluster nodes be part of the same Kerberos realm
     or can they be part of different realms?&lt;/LI&gt; 
&lt;LI&gt;Can the replication factor be
     lower? Or it recommended to maintain it as the same as the primary
     cluster?&lt;/LI&gt; 
&lt;LI&gt;Any specific network
     requirements in terms of latency, speed etc. between the clusters&lt;/LI&gt; 
&lt;LI&gt;Is there a need to run
     balancer on the DR cluster periodically?&lt;/LI&gt; 
&lt;LI&gt;How does encryption play out
     between the primary and DR clusters? If encryption at rest is enabled in
     the primary one, how is it handled in the DR cluster? What are the
     implications of wire-encryption while transferring the data between the
     clusters?&lt;/LI&gt; 
&lt;LI&gt;When HDFS snapshots is
     enabled on the primary cluster, how does it work when data is being synced
     to the DR cluster? Can Snapshots be exported onto another cluster? I
     understand this is possible for HBase snapshots. But is it allowed in HDFS
     case? For example, if a file is deleted on the primary cluster, but
     available in the snapshot, will that be synced to the snapshot directory
     on the DR cluster?&lt;/LI&gt; 
&lt;LI&gt;For services which involve
     databases (Hive, Oozie, Ambari), instead of backing up periodically from
     the primary cluster to the DR cluster, is it recommended to setup one HA
     master in the DR cluster directly?&lt;/LI&gt; 
&lt;LI&gt;For configurations and
     application data, instead of backing up at regular intervals, is there a
     way to keep them in sync between the primary and DR clusters?&lt;/LI&gt; 
&lt;LI&gt;What extra / different
     functionality will third party solutions like WANDisco provide  in comparison to Falcon? I am trying to
     understand the "active-active" working of WANDisco and why it is
     not possible with Falcon.&lt;/LI&gt; 
&lt;LI&gt;What is the recommendation to
     ensure gateway node services like Knox and client libraries are kept in
     sync between the clusters?&lt;/LI&gt; 
&lt;LI&gt;What is the recommendation
     for keeping application data, for example, Spark / Sqoop job level
     information?&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;Apologies for the lengthy post, but want to cover all the areas around DR. Hence posted in a single question.&lt;/P&gt;&lt;P&gt;Thanks&lt;/P&gt;&lt;P&gt;Vijay&lt;/P&gt;</description>
      <pubDate>Thu, 26 May 2016 20:56:04 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Questions-on-Disaster-Recovery/m-p/169394#M29848</guid>
      <dc:creator>bhoomireddy_vij</dc:creator>
      <dc:date>2016-05-26T20:56:04Z</dc:date>
    </item>
    <item>
      <title>Re: Questions on Disaster Recovery</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Questions-on-Disaster-Recovery/m-p/169395#M29849</link>
      <description>&lt;P&gt;Hey Vijay, yep, this might be too big of a set of questions for HCC.  My suggestion is to search for particular topics to see if they are already being addressed and then ultimately, imagine these as separate discrete questions.  For example, see &lt;A href="https://community.hortonworks.com/questions/35539/snapshots-backup-and-dr.html" target="_blank"&gt;https://community.hortonworks.com/questions/35539/snapshots-backup-and-dr.html&lt;/A&gt; as a pointed set of questions around snapshots; ok... that one had a bunch of Q's in one, too.  &lt;span class="lia-unicode-emoji" title=":winking_face:"&gt;😉&lt;/span&gt;  Another alternative is to get hold of a solutions engineer from a company like (well, like Hortonworks!) to try to help you through all of these what-if questions.  Additionally, a consultant can help you build an operational "run book" that addresses all of these concerns in a customized version for your org.  Good luck!&lt;/P&gt;</description>
      <pubDate>Thu, 26 May 2016 21:28:17 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Questions-on-Disaster-Recovery/m-p/169395#M29849</guid>
      <dc:creator>LesterMartin</dc:creator>
      <dc:date>2016-05-26T21:28:17Z</dc:date>
    </item>
    <item>
      <title>Re: Questions on Disaster Recovery</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Questions-on-Disaster-Recovery/m-p/169396#M29850</link>
      <description>&lt;P&gt;&lt;STRONG&gt;Teeing vs Copying- Which one is preferred over the other? Understand its scenario dependent. But which has better adaptability and more widely used in the industry? Copying?&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;
With Teeing, you can split up primary tasks between the 2 clusters and use the other cluster as DR for that task. As an example, if you have clusters C1 and C2, you can use C1 as primary cluster and C2 as DR for some teams/tasks and use C2 as primary cluster and C1 as DR for some other users/tasks&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;
Is it necessary to have both the main and the DR cluster on the same version of HDP? If not, what are things to consider if same version is not possible?&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;
It is convinent to have them both on same version. This is especially the case if you want to use DR with almost no code changes if primary server is down. &lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;
Should it be like for like topology between clusters in terms of component placement including gateway nodes and zookeeper services? &lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;This is not required.&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt; 
How does security play out for DR? Should both the cluster nodes be part of the same Kerberos realm or can they be part of different realms?&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;
As a DR, same realm is a lot easier to manage than cross realm. But cross realm is possible. &lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;
Can the replication factor be lower? Or it recommended to maintain it as the same as the primary cluster? &lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;I have seen using rep factor 2 on DR clusters, but in case this turns in primary after disaster you may have to change rep factor to 3 on all data sets. &lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;
Any specific network requirements in terms of latency, speed etc. between the clusters
For ditscp, each node one cluster should communicate with each of the other nodes on second cluster. 
Is there a need to run balancer on the DR cluster periodically?&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;
Yes. Always good to run balancer to keep similar number of blocks across nodes. &lt;/P&gt;&lt;P&gt;
&lt;STRONG&gt;How does encryption play out between the primary and DR clusters? If encryption at rest is enabled in the primary one, how is it handled in the DR cluster? What are the implications of wire-encryption while transferring the data between the clusters? &lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;&lt;/STRONG&gt;Wire encyprtion will slow down transfers a little bit.&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;
When HDFS snapshots is enabled on the primary cluster, how does it work when data is being synced to the DR cluster? Can Snapshots be exported onto another cluster? I understand this is possible for HBase snapshots. But is it allowed in HDFS case? For example, if a file is deleted on the primary cluster, but available in the snapshot, will that be synced to the snapshot directory on the DR cluster? &lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;If you are using snapshots, you can simply use distcp on snapshots instead of actual data set.&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt; 
For services which involve databases (Hive, Oozie, Ambari), instead of backing up periodically from the primary cluster to the DR cluster, is it recommended to setup one HA master in the DR cluster directly? &lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;I don't think automating ambari is a good idea. Configs don't change that much so a simple process of duplicating might be better. Backing up would mean you need to have same hostnames and same topology.
For hive, instead of complete backup, Falcon can take care of table level replication. &lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;
For configurations and application data, instead of backing up at regular intervals, is there a way to keep them in sync between the primary and DR clusters?&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;
Not sure where your application data resides, but for configuration since everything is managed by ambari, you can need to keep ambari configuration in sync.&lt;/P&gt;</description>
      <pubDate>Thu, 26 May 2016 21:46:55 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Questions-on-Disaster-Recovery/m-p/169396#M29850</guid>
      <dc:creator>ravi1</dc:creator>
      <dc:date>2016-05-26T21:46:55Z</dc:date>
    </item>
  </channel>
</rss>

