<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: journal node edit log issue in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/journal-node-edit-log-issue/m-p/139581#M43938</link>
    <description>&lt;P&gt;Hi &lt;A rel="user" href="https://community.cloudera.com/users/10616/sgowda.html" nodeid="10616"&gt;@Santhosh B Gowda&lt;/A&gt;,&lt;/P&gt;&lt;P&gt;Assuming that this is happening on a single JournalNode then you can try the following: &lt;/P&gt;&lt;OL&gt;&lt;LI&gt;As a precaution, stop HDFS. This will shut down all Journalnodes as well.&lt;/LI&gt;&lt;LI&gt;On the node in question, move the fsimage edits directory (/hadoop/hdfs/journal/stanleyhotel/current) to an alternate location.&lt;/LI&gt;&lt;LI&gt;Copy the fsimage edits directory (/hadoop/hdfs/journal/stanleyhotel/current) from a functioning JournalNode to this node. &lt;/LI&gt;&lt;LI&gt;Start  HDFS.&lt;/LI&gt;&lt;/OL&gt;&lt;P&gt;This should get this Journalnode back inline with the others and get you back to a properly functioning HA state.&lt;/P&gt;</description>
    <pubDate>Thu, 20 Oct 2016 05:22:40 GMT</pubDate>
    <dc:creator>bwilson</dc:creator>
    <dc:date>2016-10-20T05:22:40Z</dc:date>
    <item>
      <title>journal node edit log issue</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/journal-node-edit-log-issue/m-p/139580#M43937</link>
      <description>&lt;P&gt;journal node is logging below WARN in the logs and ambari is alerting about journal web ui is not accessible. any idea how to recover from this ?&lt;/P&gt;&lt;PRE&gt;
2016-10-19 12:36:20,353 WARN  namenode.FSImage (EditLogFileInputStream.java:scanEditLog(359)) - Caught exception after scanning through 0 ops from /hadoop/hdfs/journal/stanleyhotel/current/edits_inprogress_0000000000064985103 while determining its valid length. Position was 888832

java.io.IOException: Can't scan a pre-transactional edit log.

	at org.apache.hadoop.hdfs.server.namenode.FSEditLogOp$LegacyReader.scanOp(FSEditLogOp.java:4959)

	at org.apache.hadoop.hdfs.server.namenode.EditLogFileInputStream.scanNextOp(EditLogFileInputStream.java:245)

	at org.apache.hadoop.hdfs.server.namenode.EditLogFileInputStream.scanEditLog(EditLogFileInputStream.java:355)

	at org.apache.hadoop.hdfs.server.namenode.FileJournalManager$EditLogFile.scanLog(FileJournalManager.java:551)

	at org.apache.hadoop.hdfs.qjournal.server.Journal.scanStorageForLatestEdits(Journal.java:192)

	at org.apache.hadoop.hdfs.qjournal.server.Journal.&amp;lt;init&amp;gt;(Journal.java:152)

	at org.apache.hadoop.hdfs.qjournal.server.JournalNode.getOrCreateJournal(JournalNode.java:90)

	at org.apache.hadoop.hdfs.qjournal.server.JournalNode.getOrCreateJournal(JournalNode.java:99)^C

	at org.apache.hadoop.hdfs.qjournal.server.JournalNodeRpcServer.heartbeat(JournalNodeRpcServer.java:158)

	at org.apache.hadoop.hdfs.qjournal.protocolPB.QJournalProtocolServerSideTranslatorPB.heartbeat(QJournalProtocolServerSideTranslatorPB.java:172)

	at org.apache.hadoop.hdfs.qjournal.protocol.QJournalProtocolProtos$QJournalProtocolService$2.callBlockingMethod(QJournalProtocolProtos.java:25423)

	at org.apache.hadoop.ipc.ProtobufRpcEngine$Server$ProtoBufRpcInvoker.call(ProtobufRpcEngine.java:640)

	at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:982)

	at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2313)

	at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2309)

	at java.security.AccessController.doPrivileged(Native Method)

	at javax.security.auth.Subject.doAs(Subject.java:422)

	at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1724)

	at org.apache.hadoop.ipc.Server$Handler.run(Server.java:2307)

2016-10-19 12:36:20,353 WARN  namenode.FSImage (EditLogFileInputStream.java:scanEditLog(364)) - After resync, position is 888832
&lt;/PRE&gt;</description>
      <pubDate>Wed, 19 Oct 2016 19:49:35 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/journal-node-edit-log-issue/m-p/139580#M43937</guid>
      <dc:creator>sgowda</dc:creator>
      <dc:date>2016-10-19T19:49:35Z</dc:date>
    </item>
    <item>
      <title>Re: journal node edit log issue</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/journal-node-edit-log-issue/m-p/139581#M43938</link>
      <description>&lt;P&gt;Hi &lt;A rel="user" href="https://community.cloudera.com/users/10616/sgowda.html" nodeid="10616"&gt;@Santhosh B Gowda&lt;/A&gt;,&lt;/P&gt;&lt;P&gt;Assuming that this is happening on a single JournalNode then you can try the following: &lt;/P&gt;&lt;OL&gt;&lt;LI&gt;As a precaution, stop HDFS. This will shut down all Journalnodes as well.&lt;/LI&gt;&lt;LI&gt;On the node in question, move the fsimage edits directory (/hadoop/hdfs/journal/stanleyhotel/current) to an alternate location.&lt;/LI&gt;&lt;LI&gt;Copy the fsimage edits directory (/hadoop/hdfs/journal/stanleyhotel/current) from a functioning JournalNode to this node. &lt;/LI&gt;&lt;LI&gt;Start  HDFS.&lt;/LI&gt;&lt;/OL&gt;&lt;P&gt;This should get this Journalnode back inline with the others and get you back to a properly functioning HA state.&lt;/P&gt;</description>
      <pubDate>Thu, 20 Oct 2016 05:22:40 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/journal-node-edit-log-issue/m-p/139581#M43938</guid>
      <dc:creator>bwilson</dc:creator>
      <dc:date>2016-10-20T05:22:40Z</dc:date>
    </item>
    <item>
      <title>Re: journal node edit log issue</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/journal-node-edit-log-issue/m-p/139582#M43939</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/99/bwilson.html" nodeid="99"&gt;@Brandon Wilson&lt;/A&gt; Thanks it resolved the problem&lt;/P&gt;</description>
      <pubDate>Tue, 25 Oct 2016 11:49:04 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/journal-node-edit-log-issue/m-p/139582#M43939</guid>
      <dc:creator>sgowda</dc:creator>
      <dc:date>2016-10-25T11:49:04Z</dc:date>
    </item>
    <item>
      <title>Re: journal node edit log issue</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/journal-node-edit-log-issue/m-p/139583#M43940</link>
      <description>&lt;P&gt;Hi &lt;A rel="user" href="https://community.cloudera.com/users/99/bwilson.html" nodeid="99"&gt;@Brandon Wilson&lt;/A&gt;&lt;/P&gt;&lt;P&gt;Your solution works perfectly but only if "edits_inprogress_" file has the same name on both JournalNodes (JN). &lt;/P&gt;&lt;P&gt;In case of my devcluster, I was not engaged in the problem of two months. During this time, a healthy JN has created a new "edits_inprogress_" file, but the sick JN still asks the old "edits_inprogress_" file. I did all 4 steps of your algorithm, but sick JN again asks old file. The content of &lt;EM&gt;/hadoop/hdfs/journal/devcluster/current &lt;/EM&gt;is the same on both nodes.&lt;/P&gt;&lt;P&gt;What to do?&lt;/P&gt;&lt;P&gt;Log of healthy JN (edits_inprogress_0000000000016172345)&lt;/P&gt;&lt;PRE&gt;2017-02-02 10:15:12,513 INFO  namenode.FileJournalManager (FileJournalManager.java:finalizeLogSegment(133)) - Finalizing edits file /hadoop/hdfs/journal/devcluster/current/edits_inprogress_0000000000016172345 -&amp;gt; /hadoop/hdfs/journal/devcluster/current/edits_0000000000016172345-0000000000016172394
&lt;/PRE&gt;&lt;P&gt;Log of sick JN (edits_inprogress_0000000000011766543)&lt;/P&gt;&lt;PRE&gt;2017-02-02 10:15:57,744 WARN  namenode.FSImage (EditLogFileInputStream.java:scanEditLog(350)) - Caught exception after scanning through 0 ops from /hadoop/hdfs/journal/devcluster/current/edits_inprogress_0000000000011766543 while determining its valid length. Position was 1036288
java.io.IOException: Can't scan a pre-transactional edit log.
&lt;/PRE&gt;</description>
      <pubDate>Thu, 02 Feb 2017 16:59:54 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/journal-node-edit-log-issue/m-p/139583#M43940</guid>
      <dc:creator>aloha</dc:creator>
      <dc:date>2017-02-02T16:59:54Z</dc:date>
    </item>
    <item>
      <title>Re: journal node edit log issue</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/journal-node-edit-log-issue/m-p/139584#M43941</link>
      <description>&lt;P&gt;Solved it! Sick JN didn't stop when I stopped it in Ambari and even when I stop HDFS in Ambari. I killed the JN process manually, replaced the data from healthy JN and run HDFS. Now it works! &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 03 Feb 2017 15:14:53 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/journal-node-edit-log-issue/m-p/139584#M43941</guid>
      <dc:creator>aloha</dc:creator>
      <dc:date>2017-02-03T15:14:53Z</dc:date>
    </item>
    <item>
      <title>Re: journal node edit log issue</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/journal-node-edit-log-issue/m-p/139585#M43942</link>
      <description>&lt;P&gt;Thanks &lt;A rel="user" href="https://community.cloudera.com/users/99/bwilson.html" nodeid="99"&gt;@Brandon Wilson&lt;/A&gt; it worked for me too.&lt;/P&gt;</description>
      <pubDate>Thu, 21 Jun 2018 18:16:24 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/journal-node-edit-log-issue/m-p/139585#M43942</guid>
      <dc:creator>erkansirin78</dc:creator>
      <dc:date>2018-06-21T18:16:24Z</dc:date>
    </item>
  </channel>
</rss>

