<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: checkpoint is not occuring in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/checkpoint-is-not-occuring/m-p/15442#M2048</link>
    <description>It is difficult to say if you are hitting a bug without looking at relevant Checkpointer placed entries in the StandbyNameNode (SBN) logs.&lt;BR /&gt;&lt;BR /&gt;There may be issues with transferring the file between the SBN and the NN, probably cause of timeouts or otherwise.</description>
    <pubDate>Sun, 20 Jul 2014 05:23:23 GMT</pubDate>
    <dc:creator>Harsh J</dc:creator>
    <dc:date>2014-07-20T05:23:23Z</dc:date>
    <item>
      <title>checkpoint is not occuring</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/checkpoint-is-not-occuring/m-p/14050#M2047</link>
      <description>&lt;P&gt;&lt;SPAN&gt;Dear all, &amp;nbsp;I REcently enabled &amp;nbsp;HA With my namenode.&lt;BR /&gt;i started to see issue with my CHECKPOINT process, Means, CHeckPOInt did not occur for past 5 hours.&lt;BR /&gt;&lt;BR /&gt;Here go my observation. Have you seen this case before. or am i hitting any BUG?&lt;/SPAN&gt;&lt;/P&gt;&lt;DIV&gt;Kind share your advice to crack this issue out ...&amp;nbsp;&lt;BR /&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;As per checkpoint process,&lt;DIV&gt;When the updated FSIMAGE get downloaded to "NAMENODE" from "STANDBY NAMENODE",&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;The "FSIMAGE.ckpt_txid" must be renamed to "FSIMAGE_txid" But It's not happening in my case.&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;I did not see any file named with "FSIMAGE_txid" in my namenode , All are looks like &amp;nbsp;"FSIMAGE.ckpt_txid".&lt;/DIV&gt;&lt;DIV&gt;So I just compared both &amp;nbsp;"FSIMAGE.ckpt_txid" &amp;amp; "FSIMAGE_txid" ,Both got same checksum value.&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;FSIMAGE.ckpt_txid is from NAMENODE&lt;/DIV&gt;&lt;DIV&gt;FSIMAGE_txid is from SECONDARYNAMENODE&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;namenode:&lt;BR /&gt;=========&lt;/DIV&gt;&lt;DIV&gt;&lt;DIV&gt;root@namenode:/mnt/sdb/name/current# cksum fsimage.ckpt_0000000000604392126&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;3708522794&amp;nbsp;&lt;A target="_blank" href="tel:2148716968"&gt;2148716968&lt;/A&gt;&amp;nbsp;fsimage.ckpt_0000000000604392126&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;secondary-namenode:&lt;BR /&gt;================&lt;/DIV&gt;&lt;DIV&gt;root@secondary-namenode:/mnt/sdd/name/current# cksum fsimage_0000000000604392126&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;3708522794&amp;nbsp;&lt;A target="_blank" href="tel:2148716968"&gt;2148716968&lt;/A&gt;&amp;nbsp;fsimage_0000000000604392126&lt;/SPAN&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;NOTE: I did not see twork issueany ne, i am able to download the fsimage using "wget" Command.&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;i am using cdh 4.1.3 &amp;amp;&amp;nbsp;&lt;SPAN&gt;Cloudera Enterprise 4.6.3&amp;nbsp;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;Best Regards,&lt;BR /&gt;BOMmuraj&lt;/SPAN&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;</description>
      <pubDate>Tue, 24 Jun 2014 00:37:20 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/checkpoint-is-not-occuring/m-p/14050#M2047</guid>
      <dc:creator>Bommuraj Paramaraj</dc:creator>
      <dc:date>2014-06-24T00:37:20Z</dc:date>
    </item>
    <item>
      <title>Re: checkpoint is not occuring</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/checkpoint-is-not-occuring/m-p/15442#M2048</link>
      <description>It is difficult to say if you are hitting a bug without looking at relevant Checkpointer placed entries in the StandbyNameNode (SBN) logs.&lt;BR /&gt;&lt;BR /&gt;There may be issues with transferring the file between the SBN and the NN, probably cause of timeouts or otherwise.</description>
      <pubDate>Sun, 20 Jul 2014 05:23:23 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/checkpoint-is-not-occuring/m-p/15442#M2048</guid>
      <dc:creator>Harsh J</dc:creator>
      <dc:date>2014-07-20T05:23:23Z</dc:date>
    </item>
    <item>
      <title>Re: checkpoint is not occuring</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/checkpoint-is-not-occuring/m-p/15744#M2049</link>
      <description>Thank you Harsh for your email !!! i was hitting below issue, I increased this "dfs.image.transfer.timeout" and it fixed the issue. &lt;A target="_blank" href="https://issues.apache.org/jira/browse/HDFS-4301"&gt;https://issues.apache.org/jira/browse/HDFS-4301&lt;/A&gt; Checkpoint was working fine but the issue started when my fsimage size reached 2.1GB. Best Regards, Bommuraj</description>
      <pubDate>Mon, 21 Jul 2014 17:36:18 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/checkpoint-is-not-occuring/m-p/15744#M2049</guid>
      <dc:creator>Bommuraj Paramaraj</dc:creator>
      <dc:date>2014-07-21T17:36:18Z</dc:date>
    </item>
  </channel>
</rss>

