<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: How does HDFS checkpointing work in a HA cluster ? in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-does-HDFS-checkpointing-work-in-a-HA-cluster/m-p/305684#M78777</link>
    <description>&lt;P&gt;&lt;A href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/15363" target="_blank" rel="noopener"&gt;@ni4ni&lt;/A&gt;&lt;SPAN&gt;&amp;nbsp;&lt;BR /&gt;&lt;/SPAN&gt;&lt;A href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/82842" target="_blank" rel="noopener"&gt;@Masood&lt;/A&gt;&lt;SPAN&gt;&amp;nbsp;&lt;BR /&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Unfortunateéy I will dispute&amp;nbsp; &lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/82842"&gt;@Masood&lt;/a&gt;&amp;nbsp;resèponse with a reference to&amp;nbsp;hadoop.apache.org documentation see link below&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;In a HA setup, the standby does effectively do the checkpointing reference, to maintain correct documentation as a community reference please un-accept the answer&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;&lt;A href="https://hadoop.apache.org/docs/stable/hadoop-project-dist/hadoop-hdfs/HDFSHighAvailabilityWithQJM.html" target="_blank" rel="noopener"&gt;www.hadoop.org&amp;nbsp;&lt;/A&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;See extract a quote from the above website&lt;BR /&gt;&lt;BR /&gt;&lt;FONT color="#FF6600"&gt;"Note that, in an HA cluster, the Standby NameNodes also performs checkpoints of the namespace state, and thus it is not necessary to run a Secondary NameNode, CheckpointNode, or BackupNode in an HA cluster. In fact, to do so would be an error. This also allows one who is reconfiguring a non-HA-enabled HDFS cluster to be HA-enabled to reuse the hardware which they had previously dedicated to the Secondary NameNode."&lt;/FONT&gt;&lt;BR /&gt;&lt;BR /&gt;Happy hadooping&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Mon, 09 Nov 2020 18:28:32 GMT</pubDate>
    <dc:creator>Shelton</dc:creator>
    <dc:date>2020-11-09T18:28:32Z</dc:date>
    <item>
      <title>How does HDFS checkpointing work in a HA cluster ?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-does-HDFS-checkpointing-work-in-a-HA-cluster/m-p/67679#M78773</link>
      <description>&lt;P&gt;Hello&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;There is something I do not fully understand about HDFS and I would be glad if someone here can clarify it for me.&lt;/P&gt;&lt;P&gt;In regular setup, where there are a namenode and a secondary namenode, the secondary namenode is responsibe for checkpoints (merging edits file into fsimage).&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;In a High availability setup, where there are active namenode and a standby namenode, the standby namenode is doing the checkpointing.&lt;/P&gt;&lt;P&gt;But wht happens in a High availability setup when the active namenode is down or destroyed ? The standby namenode is promoted to be active but it is alone now. There is no standby/secondary NN.&lt;/P&gt;&lt;P&gt;And still the cluster should continue functioning as long as the remaining NN is up.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Who is doing the checkpointing in this case ? Is it the surviving Namenode ? Or maybe checkpointing halts unyil someone brings the secod namenode up ? How does it work ?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks you&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Guy&lt;/P&gt;</description>
      <pubDate>Sun, 27 May 2018 06:01:47 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-does-HDFS-checkpointing-work-in-a-HA-cluster/m-p/67679#M78773</guid>
      <dc:creator>ni4ni</dc:creator>
      <dc:date>2018-05-27T06:01:47Z</dc:date>
    </item>
    <item>
      <title>Re: How does HDFS checkpointing work in a HA cluster ?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-does-HDFS-checkpointing-work-in-a-HA-cluster/m-p/67719#M78774</link>
      <description>&lt;P&gt;&lt;SPAN&gt;Hello&amp;nbsp;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/15363"&gt;@ni4ni&lt;/a&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;As per my understanding if only one Namenode is there, checkpointing can't be achieved&amp;nbsp;until and unless you restart the namenode machine.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;For performing checkpoint you either need HA enable or SNN configured to the cluster.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;If only&amp;nbsp;one Nanenode can we able to perform the checkpoint process then from starting only there will be only one namenode and no secondary namenode. As initially&amp;nbsp;the role of secondary namenode is to keep the Namenode updated.&lt;BR /&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;As now we are increasing&amp;nbsp;the number the of worker nodes to suffice our needs the standby namenode comes into picture, which can perform the checkpoint process and also act as active namenode and cater the incoming request as and when needed.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;I hope this will give you some more clarity&lt;BR /&gt;Thanks.&lt;BR /&gt;&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 29 May 2018 09:45:41 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-does-HDFS-checkpointing-work-in-a-HA-cluster/m-p/67719#M78774</guid>
      <dc:creator>prabhat10</dc:creator>
      <dc:date>2018-05-29T09:45:41Z</dc:date>
    </item>
    <item>
      <title>Re: How does HDFS checkpointing work in a HA cluster ?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-does-HDFS-checkpointing-work-in-a-HA-cluster/m-p/305319#M78775</link>
      <description>&lt;P&gt;&lt;SPAN&gt;No checkpointing takes place. Periodic checkpointing is suspended in an HA setup when the Standby NameNode is down.&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 03 Nov 2020 14:12:05 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-does-HDFS-checkpointing-work-in-a-HA-cluster/m-p/305319#M78775</guid>
      <dc:creator>Masood</dc:creator>
      <dc:date>2020-11-03T14:12:05Z</dc:date>
    </item>
    <item>
      <title>Re: How does HDFS checkpointing work in a HA cluster ?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-does-HDFS-checkpointing-work-in-a-HA-cluster/m-p/305342#M78776</link>
      <description>&lt;P&gt;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/15363"&gt;@ni4ni&lt;/a&gt;&amp;nbsp;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/82842"&gt;@Masood&lt;/a&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Checkpointing is a process that takes an fsimage and edit log and compacts them into a new fsimage. This way, instead of replaying a potentially unbounded edit log, the NameNode can load the final in-memory state directly from the fsimage. This is a far more efficient operation and reduces NameNode startup time.&lt;/P&gt;&lt;P&gt;Checkpointing is one of the most important activites of the standby or secondary Namenode in a cluster.&lt;BR /&gt;In an HA cluster, all connections and cluster activity is managed by the Active namenode and the &lt;A href="https://hadoop.apache.org/docs/stable/hadoop-project-dist/hadoop-hdfs/HDFSHighAvailabilityWithNFS.html" target="_blank" rel="noopener"&gt;Standby NameNode takes the responsibility of compacting the edits logs and fsimage&lt;/A&gt; it does also performs checkpoints of the namespace state, and thus it is not necessary to run a Secondary NameNode&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Hope that helps&lt;/P&gt;</description>
      <pubDate>Tue, 03 Nov 2020 20:48:18 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-does-HDFS-checkpointing-work-in-a-HA-cluster/m-p/305342#M78776</guid>
      <dc:creator>Shelton</dc:creator>
      <dc:date>2020-11-03T20:48:18Z</dc:date>
    </item>
    <item>
      <title>Re: How does HDFS checkpointing work in a HA cluster ?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-does-HDFS-checkpointing-work-in-a-HA-cluster/m-p/305684#M78777</link>
      <description>&lt;P&gt;&lt;A href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/15363" target="_blank" rel="noopener"&gt;@ni4ni&lt;/A&gt;&lt;SPAN&gt;&amp;nbsp;&lt;BR /&gt;&lt;/SPAN&gt;&lt;A href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/82842" target="_blank" rel="noopener"&gt;@Masood&lt;/A&gt;&lt;SPAN&gt;&amp;nbsp;&lt;BR /&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Unfortunateéy I will dispute&amp;nbsp; &lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/82842"&gt;@Masood&lt;/a&gt;&amp;nbsp;resèponse with a reference to&amp;nbsp;hadoop.apache.org documentation see link below&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;In a HA setup, the standby does effectively do the checkpointing reference, to maintain correct documentation as a community reference please un-accept the answer&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;&lt;A href="https://hadoop.apache.org/docs/stable/hadoop-project-dist/hadoop-hdfs/HDFSHighAvailabilityWithQJM.html" target="_blank" rel="noopener"&gt;www.hadoop.org&amp;nbsp;&lt;/A&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;See extract a quote from the above website&lt;BR /&gt;&lt;BR /&gt;&lt;FONT color="#FF6600"&gt;"Note that, in an HA cluster, the Standby NameNodes also performs checkpoints of the namespace state, and thus it is not necessary to run a Secondary NameNode, CheckpointNode, or BackupNode in an HA cluster. In fact, to do so would be an error. This also allows one who is reconfiguring a non-HA-enabled HDFS cluster to be HA-enabled to reuse the hardware which they had previously dedicated to the Secondary NameNode."&lt;/FONT&gt;&lt;BR /&gt;&lt;BR /&gt;Happy hadooping&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 09 Nov 2020 18:28:32 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-does-HDFS-checkpointing-work-in-a-HA-cluster/m-p/305684#M78777</guid>
      <dc:creator>Shelton</dc:creator>
      <dc:date>2020-11-09T18:28:32Z</dc:date>
    </item>
    <item>
      <title>Re: How does HDFS checkpointing work in a HA cluster ?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-does-HDFS-checkpointing-work-in-a-HA-cluster/m-p/306452#M78778</link>
      <description>&lt;P&gt;&lt;FONT color="#FF0000"&gt;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/15363"&gt;@ni4ni&lt;/a&gt;&amp;nbsp; mentions,&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT color="#FF0000"&gt;There is no standby/secondary NN.&lt;/FONT&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 25 Nov 2020 10:35:34 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-does-HDFS-checkpointing-work-in-a-HA-cluster/m-p/306452#M78778</guid>
      <dc:creator>Masood</dc:creator>
      <dc:date>2020-11-25T10:35:34Z</dc:date>
    </item>
  </channel>
</rss>

