<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Data Node recovery after a day while under replicated blocks are still replicating in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Data-Node-recovery-after-a-day-while-under-replicated-blocks/m-p/318761#M227534</link>
    <description>&lt;P&gt;Hello &lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/57620"&gt;@pauljoshiva&lt;/a&gt;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;The NameNode endeavors to ensure that each block always has the intended number of replicas. The NameNode detects that a block has become under- or over-replicated when a block report from a DataNode arrives. When a block becomes over replicated, the NameNode chooses a replica to remove. The NameNode will prefer not to reduce the number of racks that host replicas, and secondly prefer to remove a replica from the DataNode with the least amount of available disk space. The goal is to balance storage utilization across DataNodes without reducing the block's availability.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Hope this answers your query.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Regards,&lt;/P&gt;&lt;P&gt;Manoj&lt;/P&gt;</description>
    <pubDate>Wed, 16 Jun 2021 08:08:39 GMT</pubDate>
    <dc:creator>amk</dc:creator>
    <dc:date>2021-06-16T08:08:39Z</dc:date>
    <item>
      <title>Data Node recovery after a day while under replicated blocks are still replicating</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Data-Node-recovery-after-a-day-while-under-replicated-blocks/m-p/318744#M227521</link>
      <description>&lt;P&gt;I have 5 Data nodes, where 1 Data node crashed. Name node is still replicating the data blocks of the crashed Data node. What happens when I recover the crashed Data node. The data blocks which is under replicated will be reported but what happens to already replicated blocks. Will they be marked as corrupted blocks or will be deleted/removed since there will be 4 replicas?&lt;/P&gt;</description>
      <pubDate>Tue, 15 Jun 2021 16:51:03 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Data-Node-recovery-after-a-day-while-under-replicated-blocks/m-p/318744#M227521</guid>
      <dc:creator>pauljoshiva</dc:creator>
      <dc:date>2021-06-15T16:51:03Z</dc:date>
    </item>
    <item>
      <title>Re: Data Node recovery after a day while under replicated blocks are still replicating</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Data-Node-recovery-after-a-day-while-under-replicated-blocks/m-p/318760#M227533</link>
      <description>&lt;P&gt;Hello&amp;nbsp;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/57620"&gt;@pauljoshiva&lt;/a&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;If the crashed data node comes up, then the data replica count will be 4 that is the case of over replicated blocks.&lt;SPAN&gt;HDFS will automatically delete the excess replicas as the default replication factor has to be maintained 3. The replica from the now active datanode is going to be removed.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;&lt;A href="https://docs.cloudera.com/runtime/7.2.9/hdfs-overview/topics/hdfs-how-namenode-manages-blocks-on-a-failed-datanode.html" target="_blank"&gt;https://docs.cloudera.com/runtime/7.2.9/hdfs-overview/topics/hdfs-how-namenode-manages-blocks-on-a-failed-datanode.html&lt;/A&gt;&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 16 Jun 2021 07:57:41 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Data-Node-recovery-after-a-day-while-under-replicated-blocks/m-p/318760#M227533</guid>
      <dc:creator>tusharkathpal</dc:creator>
      <dc:date>2021-06-16T07:57:41Z</dc:date>
    </item>
    <item>
      <title>Re: Data Node recovery after a day while under replicated blocks are still replicating</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Data-Node-recovery-after-a-day-while-under-replicated-blocks/m-p/318761#M227534</link>
      <description>&lt;P&gt;Hello &lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/57620"&gt;@pauljoshiva&lt;/a&gt;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;The NameNode endeavors to ensure that each block always has the intended number of replicas. The NameNode detects that a block has become under- or over-replicated when a block report from a DataNode arrives. When a block becomes over replicated, the NameNode chooses a replica to remove. The NameNode will prefer not to reduce the number of racks that host replicas, and secondly prefer to remove a replica from the DataNode with the least amount of available disk space. The goal is to balance storage utilization across DataNodes without reducing the block's availability.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Hope this answers your query.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Regards,&lt;/P&gt;&lt;P&gt;Manoj&lt;/P&gt;</description>
      <pubDate>Wed, 16 Jun 2021 08:08:39 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Data-Node-recovery-after-a-day-while-under-replicated-blocks/m-p/318761#M227534</guid>
      <dc:creator>amk</dc:creator>
      <dc:date>2021-06-16T08:08:39Z</dc:date>
    </item>
  </channel>
</rss>

