<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question HDFS resiliency - DR - rack aware in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/HDFS-resiliency-DR-rack-aware/m-p/185587#M147694</link>
    <description>&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;I am in the process of improving the resilience of our hadoop clusters.&lt;/P&gt;&lt;P&gt;We are using a twin-datacenter architecture; the hadoop cluster nodes are  located in two different buildings separated by 10 km with Namenode HA activated.&lt;/P&gt;&lt;P&gt;We are using a replica factor of 4 + 2 rack awareness  (on rack per site).&lt;/P&gt;&lt;P&gt;The replica factor of 4 is probably  a bit "luxury", but it might  protect against the lost of an entire rack (lost of a site)  + the lost of some nodes on the remaining site.&lt;/P&gt;&lt;P&gt;In case of losing en entire rack, I am wondering if HDFS will try to replicate the data on the remaining rack, thus we will get 4 replica on the same rack and overconsume space  on the remaining rack ?...or will it "disable" the replica that is supposed to be located on the failed rack  ?&lt;/P&gt;&lt;P&gt;Does it make sense to create 4 racks (one for each replica) in order to ensure that the data will be replicated on the both sites in a balanced way (2x2) ?&lt;/P&gt;&lt;P&gt;Many thanks in advance for your feedback.&lt;/P&gt;&lt;P&gt;Regards&lt;/P&gt;&lt;P&gt;Laurent&lt;/P&gt;</description>
    <pubDate>Thu, 13 Jul 2017 16:06:48 GMT</pubDate>
    <dc:creator>laule75</dc:creator>
    <dc:date>2017-07-13T16:06:48Z</dc:date>
    <item>
      <title>HDFS resiliency - DR - rack aware</title>
      <link>https://community.cloudera.com/t5/Support-Questions/HDFS-resiliency-DR-rack-aware/m-p/185587#M147694</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;I am in the process of improving the resilience of our hadoop clusters.&lt;/P&gt;&lt;P&gt;We are using a twin-datacenter architecture; the hadoop cluster nodes are  located in two different buildings separated by 10 km with Namenode HA activated.&lt;/P&gt;&lt;P&gt;We are using a replica factor of 4 + 2 rack awareness  (on rack per site).&lt;/P&gt;&lt;P&gt;The replica factor of 4 is probably  a bit "luxury", but it might  protect against the lost of an entire rack (lost of a site)  + the lost of some nodes on the remaining site.&lt;/P&gt;&lt;P&gt;In case of losing en entire rack, I am wondering if HDFS will try to replicate the data on the remaining rack, thus we will get 4 replica on the same rack and overconsume space  on the remaining rack ?...or will it "disable" the replica that is supposed to be located on the failed rack  ?&lt;/P&gt;&lt;P&gt;Does it make sense to create 4 racks (one for each replica) in order to ensure that the data will be replicated on the both sites in a balanced way (2x2) ?&lt;/P&gt;&lt;P&gt;Many thanks in advance for your feedback.&lt;/P&gt;&lt;P&gt;Regards&lt;/P&gt;&lt;P&gt;Laurent&lt;/P&gt;</description>
      <pubDate>Thu, 13 Jul 2017 16:06:48 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/HDFS-resiliency-DR-rack-aware/m-p/185587#M147694</guid>
      <dc:creator>laule75</dc:creator>
      <dc:date>2017-07-13T16:06:48Z</dc:date>
    </item>
    <item>
      <title>Re: HDFS resiliency - DR - rack aware</title>
      <link>https://community.cloudera.com/t5/Support-Questions/HDFS-resiliency-DR-rack-aware/m-p/185588#M147695</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/1327/laule75.html" nodeid="1327"&gt;@Laurent lau&lt;/A&gt;&lt;/P&gt;&lt;P&gt;Let's take it one by one:&lt;/P&gt;&lt;OL&gt;&lt;LI&gt;One big advantage of 4 replicas might be actually faster jobs in the situation where big jobs are fired simultaneously.&lt;/LI&gt;&lt;LI&gt;Rack or no rack if data is lost, and replication factor falls below the specified level hdfs will try to replicate and bring it to the original replication factor. &lt;/LI&gt;&lt;LI&gt;All the replicas will never be on the same rack until and unless that is the only rack alive.&lt;/LI&gt;&lt;LI&gt;My suggestion for best performance and availability use a minimum of  3 racks per data center.&lt;/LI&gt;&lt;/OL&gt;&lt;P&gt;Wrote an article some time ago, it might also help in clarifying some of your doubts:&lt;/P&gt;&lt;P&gt;&lt;A target="_blank" href="https://community.hortonworks.com/content/kbentry/43057/rack-awareness-1.html"&gt;https://community.hortonworks.com/content/kbentry/43057/rack-awareness-1.html&lt;/A&gt; &lt;/P&gt;&lt;P&gt;Thanks&lt;/P&gt;</description>
      <pubDate>Fri, 14 Jul 2017 07:49:09 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/HDFS-resiliency-DR-rack-aware/m-p/185588#M147695</guid>
      <dc:creator>rbiswas1</dc:creator>
      <dc:date>2017-07-14T07:49:09Z</dc:date>
    </item>
    <item>
      <title>Re: HDFS resiliency - DR - rack aware</title>
      <link>https://community.cloudera.com/t5/Support-Questions/HDFS-resiliency-DR-rack-aware/m-p/185589#M147696</link>
      <description>&lt;P&gt;thanks &lt;A rel="user" href="https://community.cloudera.com/users/3902/rbiswas.html" nodeid="3902"&gt;@rbiswas&lt;/A&gt;  for your answer.&lt;/P&gt;&lt;P&gt;My  concern is regarding the speed of the replication if, let's say one rack is unavailable during 24 /  48hours for maintenance reasons, and in the meantime HDFS is trying to replicate all then data on the remaining rack, thus might saturate the disk space on this rack !&lt;/P&gt;&lt;P&gt;I can't find any documentation mentionning this " HDFS rebalance speed" .&lt;/P&gt;&lt;P&gt;Also it looks to me that, if the number of replica factor is equal to the number of racks, there is no guarantee that there will be a replica spread in each rack.&lt;/P&gt;&lt;P&gt;Do you confirm it ?&lt;/P&gt;&lt;P&gt;Thanks in advance.&lt;/P&gt;&lt;P&gt;rgds&lt;/P&gt;&lt;P&gt;Laurent&lt;/P&gt;</description>
      <pubDate>Mon, 17 Jul 2017 15:19:59 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/HDFS-resiliency-DR-rack-aware/m-p/185589#M147696</guid>
      <dc:creator>laule75</dc:creator>
      <dc:date>2017-07-17T15:19:59Z</dc:date>
    </item>
    <item>
      <title>Re: HDFS resiliency - DR - rack aware</title>
      <link>https://community.cloudera.com/t5/Support-Questions/HDFS-resiliency-DR-rack-aware/m-p/185590#M147697</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/1327/laule75.html" nodeid="1327"&gt;@Laurent lau&lt;/A&gt; &lt;/P&gt;&lt;P&gt;You can adjust HDFS rebalance speed per your need. Refer this documentation:&lt;/P&gt;&lt;P&gt;&lt;A target="_blank" href="https://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.6.0/bk_hdfs-administration/content/configuring_balancer.html"&gt;https://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.6.0/bk_hdfs-administration/content/configuring_balancer.html&lt;/A&gt;&lt;/P&gt;&lt;P&gt;If you are worried about filling the disk space during rack maintenance operation you can configure the balancer to be really slow so that you can have 48 hours window and virtually nothing will be replicated. Or if the situation permits you can take the namenode in safe mode. This will allow read operations but no write.&lt;/P&gt;&lt;P&gt;This is correct "if the number of replica factor is equal to the number of racks, there is no guarantee that there will be a replica spread in each rack." The policy is all the replicas will not be on the same rack.&lt;/P&gt;</description>
      <pubDate>Mon, 17 Jul 2017 21:54:29 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/HDFS-resiliency-DR-rack-aware/m-p/185590#M147697</guid>
      <dc:creator>rbiswas1</dc:creator>
      <dc:date>2017-07-17T21:54:29Z</dc:date>
    </item>
    <item>
      <title>Re: HDFS resiliency - DR - rack aware</title>
      <link>https://community.cloudera.com/t5/Support-Questions/HDFS-resiliency-DR-rack-aware/m-p/185591#M147698</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/1327/laule75.html" nodeid="1327"&gt;@Laurent lau&lt;/A&gt; &lt;/P&gt;&lt;P&gt;Do you have any more questions on this? Else you can accept the answer to close the thread.&lt;/P&gt;&lt;P&gt;Thanks&lt;/P&gt;</description>
      <pubDate>Thu, 20 Jul 2017 00:18:37 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/HDFS-resiliency-DR-rack-aware/m-p/185591#M147698</guid>
      <dc:creator>rbiswas1</dc:creator>
      <dc:date>2017-07-20T00:18:37Z</dc:date>
    </item>
    <item>
      <title>Re: HDFS resiliency - DR - rack aware</title>
      <link>https://community.cloudera.com/t5/Support-Questions/HDFS-resiliency-DR-rack-aware/m-p/185592#M147699</link>
      <description>&lt;P&gt;Hello &lt;A rel="user" href="https://community.cloudera.com/users/3902/rbiswas.html" nodeid="3902" target="_blank"&gt;@rbiswas&lt;/A&gt;,&lt;/P&gt;&lt;P&gt;Sorry, I'm a bit confused by your  last statement.&lt;/P&gt;&lt;P&gt;Could you please confirm that if I define a replication factor of 4, and 4 racks, I will get the following distribution of replicas ? (see diagram below) &lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="19517-hdfs-rack.png" style="width: 546px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/18878i47C61D032D8225FC/image-size/medium?v=v2&amp;amp;px=400" role="button" title="19517-hdfs-rack.png" alt="19517-hdfs-rack.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;regards&lt;/P&gt;&lt;P&gt;Laurent&lt;/P&gt;</description>
      <pubDate>Sun, 18 Aug 2019 08:21:09 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/HDFS-resiliency-DR-rack-aware/m-p/185592#M147699</guid>
      <dc:creator>laule75</dc:creator>
      <dc:date>2019-08-18T08:21:09Z</dc:date>
    </item>
    <item>
      <title>Re: HDFS resiliency - DR - rack aware</title>
      <link>https://community.cloudera.com/t5/Support-Questions/HDFS-resiliency-DR-rack-aware/m-p/185593#M147700</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/1327/laule75.html" nodeid="1327"&gt;@Laurent lau&lt;/A&gt; that equal distribution of replica is not guaranteed.&lt;/P&gt;&lt;P&gt;If you think about it in high level it does compromise the speed of writes as well as reads. So not recommended even if you are planning to do it programmatically.&lt;/P&gt;</description>
      <pubDate>Fri, 21 Jul 2017 23:34:00 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/HDFS-resiliency-DR-rack-aware/m-p/185593#M147700</guid>
      <dc:creator>rbiswas1</dc:creator>
      <dc:date>2017-07-21T23:34:00Z</dc:date>
    </item>
    <item>
      <title>Re: HDFS resiliency - DR - rack aware</title>
      <link>https://community.cloudera.com/t5/Support-Questions/HDFS-resiliency-DR-rack-aware/m-p/185594#M147701</link>
      <description>&lt;P&gt;Hello &lt;A rel="user" href="https://community.cloudera.com/users/3902/rbiswas.html" nodeid="3902"&gt;@rbiswas&lt;/A&gt;,&lt;/P&gt;&lt;P&gt;Sorry for the delay in getting back to you (I was on holidays).&lt;/P&gt;&lt;P&gt;Thanks for your answers. Yes we can close the thread.&lt;/P&gt;&lt;P&gt;regards&lt;/P&gt;&lt;P&gt;Laurent&lt;/P&gt;</description>
      <pubDate>Wed, 02 Aug 2017 21:39:44 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/HDFS-resiliency-DR-rack-aware/m-p/185594#M147701</guid>
      <dc:creator>laule75</dc:creator>
      <dc:date>2017-08-02T21:39:44Z</dc:date>
    </item>
  </channel>
</rss>

