<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: how to recover the edits_inprogress_xxxxxx file ? in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/how-to-recover-the-edits-inprogress-xxxxxx-file/m-p/242508#M204309</link>
    <description>&lt;P&gt;so according to this info do you recommended to set the value as is ( 31 ) or other ?&lt;/P&gt;</description>
    <pubDate>Fri, 14 Jun 2019 13:44:52 GMT</pubDate>
    <dc:creator>mike_bronson7</dc:creator>
    <dc:date>2019-06-14T13:44:52Z</dc:date>
    <item>
      <title>how to recover the edits_inprogress_xxxxxx file ?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/how-to-recover-the-edits-inprogress-xxxxxx-file/m-p/242502#M204303</link>
      <description>&lt;P&gt;hi all&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;we have ambari cluster - HDP version - 2.6.4&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;on one of the master machine we cant start the &lt;STRONG&gt;name node&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;and we notice that&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;PRE&gt;  ls &amp;nbsp;/hadoop/hdfs/journal/hdfsha/current/ | grep edits_inprogress&amp;nbsp;
  edits_inprogress_0000000000018783114.empty&lt;/PRE&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;we not have the &lt;STRONG&gt;edits_inprogress_xxxxxx&lt;/STRONG&gt; file&lt;/P&gt;&lt;P&gt;what we have is only the file - &lt;STRONG&gt;edits_inprogress_0000000000018783114.empty&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;any idea how to recover the - edits_inprogress_xxxxxx file&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;PRE&gt;2019-06-13 19:45:42,473 FATAL namenode.FSEditLog (JournalSet.java:mapJournalsAndReportErrors(398)) - Error: recoverUnfinalizedSegments failed for required journal (JournalAndStream(mgr=QJM to&amp;nbsp;
java.io.IOException: Timed out waiting 120000ms for a quorum of nodes to respond.
&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.qjournal.client.AsyncLoggerSet.waitForWriteQuorum(AsyncLoggerSet.java:137)
&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.qjournal.client.QuorumJournalManager.createNewUniqueEpoch(QuorumJournalManager.java:183)
&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.qjournal.client.QuorumJournalManager.recoverUnfinalizedSegments(QuorumJournalManager.java:436)
&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.server.namenode.JournalSet$8.apply(JournalSet.java:624)
&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.server.namenode.JournalSet.mapJournalsAndReportErrors(JournalSet.java:393)
&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.server.namenode.JournalSet.recoverUnfinalizedSegments(JournalSet.java:621)
&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.server.namenode.FSEditLog.recoverUnclosedStreams(FSEditLog.java:1521)
&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.server.namenode.FSNamesystem.startActiveServices(FSNamesystem.java:1196)
&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.server.namenode.NameNode$NameNodeHAContext.startActiveServices(NameNode.java:1951)
&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.server.namenode.ha.ActiveState.enterState(ActiveState.java:61)
&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.server.namenode.ha.HAState.setStateInternal(HAState.java:64)
&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.server.namenode.ha.StandbyState.setState(StandbyState.java:49)
&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.server.namenode.NameNode.transitionToActive(NameNode.java:1807)
&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.server.namenode.NameNodeRpcServer.transitionToActive(NameNodeRpcServer.java:1656)
&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.ha.protocolPB.HAServiceProtocolServerSideTranslatorPB.transitionToActive(HAServiceProtocolServerSideTranslatorPB.java:107)
&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.ha.proto.HAServiceProtocolProtos$HAServiceProtocolService$2.callBlockingMethod(HAServiceProtocolProtos.java:4460)
&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.ipc.ProtobufRpcEngine$Server$ProtoBufRpcInvoker.call(ProtobufRpcEngine.java:640)
&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:982)
&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2351)
&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2347)
&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at java.security.AccessController.doPrivileged(Native Method)
&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at javax.security.auth.Subject.doAs(Subject.java:422)
&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1869)
&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.ipc.Server$Handler.run(Server.java:2347)
2019-06-13 19:45:42,476 INFO &amp;nbsp;util.ExitUtil (ExitUtil.java:terminate(124)) - Exiting with status 1
&lt;/PRE&gt;</description>
      <pubDate>Fri, 14 Jun 2019 01:42:34 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/how-to-recover-the-edits-inprogress-xxxxxx-file/m-p/242502#M204303</guid>
      <dc:creator>mike_bronson7</dc:creator>
      <dc:date>2019-06-14T01:42:34Z</dc:date>
    </item>
    <item>
      <title>Re: how to recover the edits_inprogress_xxxxxx file ?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/how-to-recover-the-edits-inprogress-xxxxxx-file/m-p/242503#M204304</link>
      <description>&lt;P&gt;&lt;A rel="noopener noreferrer noopener noreferrer" href="http://@Michael%20Bronson" target="_blank"&gt;&lt;EM&gt;@Michael Bronson&lt;/EM&gt;&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;Yes its possible to recover from this situation, which happens sometimes in a Namenode HA setup. Journal nodes are distributed system to store edits. Active Namenode as a client writes edits to journal nodes and commit only when it's replicated to all the journal nodes in a distributed system. Standby NN needs to read data from edits to be in sync with Active one. It can read from any of the replica stored on journal nodes.&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;ZKFC will make sure that only one Namenode should be active at a time. However, when a failover occurs, it is still possible that the previous Active NameNode could serve read requests to clients, which may be out of date until that NameNode shuts down when trying to write to the JournalNodes. For this reason, we should configure fencing methods even when using the Quorum Journal Manager.&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;To work with a fencing journal manager uses epoc numbers. Epoc numbers are an integer which always gets increased and have unique value once assigned. Namenode generates epoc number using a simple algorithm and uses it while sending RPC requests to the QJM. When you configure Namenode HA, the first Active Namenode will get epoc value 1. In case of failover or restart, epoc number will get increased. The Namenode with higher epoc number is considered as newer than any Namenode with an earlier epoc number.&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;Now let's proceed with the real case, note the hostname of the healthy namenode&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;You will need to proceed as follows assuming you are logged on as root here is How do I fix one corrupted JN's edits? &lt;/EM&gt;&lt;/P&gt;&lt;PRE&gt;&lt;EM&gt;# su - hdfs&lt;/EM&gt;&lt;/PRE&gt;&lt;P&gt;&lt;EM&gt;1) Put both NN in safemode ( NN HA) &lt;/EM&gt;&lt;/P&gt;&lt;PRE&gt;&lt;EM&gt;$ hdfs dfsadmin -safemode enter &lt;/EM&gt;&lt;/PRE&gt;&lt;P&gt;&lt;EM&gt;&lt;STRONG&gt;Sample output&lt;/STRONG&gt;&lt;/EM&gt;&lt;/P&gt;&lt;PRE&gt;&lt;EM&gt;Safe mode is ON in namenode1/xxx.xxx.xx.xx:8020
Safe mode is ON in namenode2/xxx.xxx.xx.xx:8020&lt;/EM&gt;&lt;/PRE&gt;&lt;P&gt;&lt;EM&gt;2) Save Namespace &lt;/EM&gt;&lt;/P&gt;&lt;PRE&gt;&lt;EM&gt;$ hdfs dfsadmin -saveNamespace &lt;/EM&gt;&lt;/PRE&gt;&lt;P&gt;&lt;EM&gt;3) On the non-working name node change directory to &lt;STRONG&gt;/hadoop/hdfs/journal/hdfsha/current/*&lt;/STRONG&gt; Get the epoch and note the number it should be lower than the in the working name node cross check&lt;/EM&gt;&lt;/P&gt;&lt;PRE&gt;&lt;EM&gt;$ cat last-promised-epoch&lt;/EM&gt;&lt;/PRE&gt;&lt;P&gt;&lt;EM&gt;4) On the non-working name node backup all the files in journal dir &lt;STRONG&gt;/hadoop/hdfs/journal/hdfsha/current/*&lt;/STRONG&gt; they should look like below&lt;/EM&gt;&lt;/P&gt;&lt;PRE&gt;&lt;EM&gt;-rw-r--r-- 1 hdfs hadoop 1019566 Jun 10 09:45 edits_0000000000000928232-0000000000000935461
-rw-r--r-- 1 hdfs hadoop 1014516 Jun 10 15:45 edits_0000000000000935462-0000000000000942657
-rw-r--r-- 1 hdfs hadoop 1017540 Jun 10 21:46 edits_0000000000000942658-0000000000000949874
-rw-r--r-- 1 hdfs hadoop 1048576 Jun 10 23:36 edits_0000000000000949875-0000000000000952088
-rw-r--r-- 1 hdfs hadoop 1048576 Jun 13 22:27 edits_inprogress_0000000000000952089
-rw-r--r-- 1 hdfs hadoop &amp;nbsp;277083 Jun 10 21:46 fsimage_0000000000000949874
-rw-r--r-- 1 hdfs hadoop &amp;nbsp; &amp;nbsp; &amp;nbsp;62 Jun 10 21:46 fsimage_0000000000000949874.md5
-rw-r--r-- 1 hdfs hadoop &amp;nbsp;276740 Jun 13 22:13 fsimage_0000000000000952088
-rw-r--r-- 1 hdfs hadoop &amp;nbsp; &amp;nbsp; &amp;nbsp;62 Jun 13 22:13 fsimage_0000000000000952088.md5
-rw-r--r-- 1 hdfs hadoop &amp;nbsp; &amp;nbsp; &amp;nbsp; 7 Jun 13 22:13 seen_txid
-rw-r--r-- 1 hdfs hadoop &amp;nbsp; &amp;nbsp; 206 Jun 13 22:13 VERSION&lt;/EM&gt;&lt;/PRE&gt;&lt;P&gt;&lt;EM&gt;5) While in the current directory backup all the files note the (.) indicating current dir&lt;/EM&gt;&lt;/P&gt;&lt;PRE&gt;&lt;EM&gt;$ tar -zcvf editsbck.tar.gz .&lt;/EM&gt;&lt;/PRE&gt;&lt;P&gt;&lt;EM&gt;6) Move the editsbck.tar.gz to a safe location&lt;/EM&gt;&lt;/P&gt;&lt;PRE&gt;&lt;EM&gt;$ scp editsbck.tar.gz &amp;nbsp;/home/bronson &lt;/EM&gt;&lt;/PRE&gt;&lt;P&gt;&lt;EM&gt;7) Backup or move any directory therein eg &lt;/EM&gt;&lt;/P&gt;&lt;PRE&gt;&lt;EM&gt;$ mv paxos paxos.bck&lt;/EM&gt;&lt;/PRE&gt;&lt;P&gt;&lt;EM&gt;&lt;span class="lia-unicode-emoji" title=":smiling_face_with_sunglasses:"&gt;😎&lt;/span&gt; Delete all files in the &lt;STRONG&gt;/hadoop/hdfs/journal/hdfsha/current/&lt;/STRONG&gt; on the bad node remember you have a backup editsbck.tar.gz &lt;/EM&gt;&lt;/P&gt;&lt;PRE&gt;&lt;EM&gt;$ rm -rf /hadoop/hdfs/journal/hdfsha/current/*&lt;/EM&gt;&lt;/PRE&gt;&lt;P&gt;&lt;EM&gt;9) zip or tar the journal dir from a working JN node&lt;STRONG&gt; /hadoop/hdfs/journal/hdfsha/current/*&lt;/STRONG&gt;  &lt;/EM&gt;&lt;/P&gt;&lt;PRE&gt;&lt;EM&gt;$ tar -zcvf good_editsbck.tar.gz&lt;/EM&gt;&lt;/PRE&gt;&lt;P&gt;&lt;EM&gt;10) Copy and unzip/untar the good_editsbck.tar.gz to the non-working JN node to same path as the working namenode /hadoop/hdfs/journal/hdfsha/current/&lt;/EM&gt;&lt;/P&gt;&lt;PRE&gt;&lt;EM&gt;# scp good_editsbck.tar.gz root@namenode2:/hadoop/hdfs/journal/hdfsha/current/&lt;/EM&gt;&lt;/PRE&gt;&lt;P&gt;&lt;EM&gt;11) Unzip the files&lt;/EM&gt;&lt;/P&gt;&lt;PRE&gt;&lt;EM&gt;# tar xvzf something.tar.gz &amp;nbsp;-C /hadoop/hdfs/journal/hdfsha/current/&lt;/EM&gt;&lt;/PRE&gt;&lt;P&gt;&lt;EM&gt;12) Chown ownership to hdfs the &lt;STRONG&gt;-R&lt;/STRONG&gt; recursive in case you have directories&lt;/EM&gt;&lt;/P&gt;&lt;PRE&gt;&lt;EM&gt;# chown -R hdfs:hadoop /hadoop/hdfs/journal/hdfsha/current/*&lt;/EM&gt;&lt;/PRE&gt;&lt;P&gt;&lt;EM&gt;&lt;STRONG&gt;Log on the unhealthy name node &lt;/STRONG&gt;&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;13) Restarting the journal nodes&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;Start all 3 journal nodes note I run the command like root if the were running stop you will see &lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;&lt;STRONG&gt;journal node running as process xxxx. Stop it first.&lt;/STRONG&gt;&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;14) Stopping journal node&lt;/EM&gt;&lt;/P&gt;&lt;PRE&gt;&lt;EM&gt;# su -l hdfs -c "/usr/hdp/current/hadoop-hdfs-journalnode/../hadoop/sbin/hadoop-daemon.sh stop journalnode"&lt;/EM&gt;&lt;/PRE&gt;&lt;P&gt;&lt;EM&gt;15) Starting journal node&lt;/EM&gt;&lt;/P&gt;&lt;PRE&gt;&lt;EM&gt;# su -l hdfs -c "/usr/hdp/current/hadoop-hdfs-journalnode/../hadoop/sbin/hadoop-daemon.sh start journalnode"&lt;/EM&gt;&lt;/PRE&gt;&lt;P&gt;&lt;EM&gt;&lt;STRONG&gt;Restart HDFS from Ambari UI&lt;/STRONG&gt;&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;After some minutes the alerts should go and you should see a healthy Active &amp;amp; standby Namenodes. All should be fine now, the NameNode failover should now occur transparently and the below alerts should gradually disappear&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;HTH&lt;/EM&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 14 Jun 2019 04:38:08 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/how-to-recover-the-edits-inprogress-xxxxxx-file/m-p/242503#M204304</guid>
      <dc:creator>Shelton</dc:creator>
      <dc:date>2019-06-14T04:38:08Z</dc:date>
    </item>
    <item>
      <title>Re: how to recover the edits_inprogress_xxxxxx file ?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/how-to-recover-the-edits-inprogress-xxxxxx-file/m-p/242504#M204305</link>
      <description>&lt;P&gt;@Geoffrey Shelton Okot  thank you so much for your effort and for your time , tomorrow I will do your steps , in spite I still not do the tests ,  I will choose your answer as accept , and if I have comments then I will share them , again many many thanks &lt;/P&gt;</description>
      <pubDate>Fri, 14 Jun 2019 05:05:38 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/how-to-recover-the-edits-inprogress-xxxxxx-file/m-p/242504#M204305</guid>
      <dc:creator>mike_bronson7</dc:creator>
      <dc:date>2019-06-14T05:05:38Z</dc:date>
    </item>
    <item>
      <title>Re: how to recover the edits_inprogress_xxxxxx file ?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/how-to-recover-the-edits-inprogress-xxxxxx-file/m-p/242505#M204306</link>
      <description>&lt;P&gt;@Geoffrey Shelton Okot  - about &lt;EM&gt;cat last-promised-epoch&lt;/EM&gt; , I have the number 31 in non working node , do you mean that I need to decrease it to 30? &lt;/P&gt;</description>
      <pubDate>Fri, 14 Jun 2019 13:06:14 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/how-to-recover-the-edits-inprogress-xxxxxx-file/m-p/242505#M204306</guid>
      <dc:creator>mike_bronson7</dc:creator>
      <dc:date>2019-06-14T13:06:14Z</dc:date>
    </item>
    <item>
      <title>Re: how to recover the edits_inprogress_xxxxxx file ?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/how-to-recover-the-edits-inprogress-xxxxxx-file/m-p/242506#M204307</link>
      <description>&lt;P&gt;&lt;A rel="noopener noreferrer noopener noreferrer noopener noreferrer noopener noreferrer" href="http://michael%20bronson/" target="_blank"&gt;&lt;EM&gt;@Michael Bronson&lt;/EM&gt;&lt;/A&gt;&lt;EM&gt; &lt;BR /&gt;&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;Can you confirm all the other 2 journal nodes have the last-promised-epoch of 30?  That when the failure occurred, it's okay to replace the contents of the &lt;STRONG&gt;&lt;EM&gt;/hadoop/hdfs/journal/hdfsha/current/*&lt;/EM&gt; &lt;/STRONG&gt;with the contents of the good(active) namenode.&lt;/EM&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;EM&gt;Then proceed with the subsequent steps&lt;/EM&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 14 Jun 2019 13:34:48 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/how-to-recover-the-edits-inprogress-xxxxxx-file/m-p/242506#M204307</guid>
      <dc:creator>Shelton</dc:creator>
      <dc:date>2019-06-14T13:34:48Z</dc:date>
    </item>
    <item>
      <title>Re: how to recover the edits_inprogress_xxxxxx file ?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/how-to-recover-the-edits-inprogress-xxxxxx-file/m-p/242507#M204308</link>
      <description>&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;on the good name node we have the number 31&lt;/P&gt;&lt;P&gt;on the bad name node we have also the number 31&lt;/P&gt;&lt;P&gt;on the other &lt;EM&gt;journal node we have the number 28&lt;/EM&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 14 Jun 2019 13:43:29 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/how-to-recover-the-edits-inprogress-xxxxxx-file/m-p/242507#M204308</guid>
      <dc:creator>mike_bronson7</dc:creator>
      <dc:date>2019-06-14T13:43:29Z</dc:date>
    </item>
    <item>
      <title>Re: how to recover the edits_inprogress_xxxxxx file ?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/how-to-recover-the-edits-inprogress-xxxxxx-file/m-p/242508#M204309</link>
      <description>&lt;P&gt;so according to this info do you recommended to set the value as is ( 31 ) or other ?&lt;/P&gt;</description>
      <pubDate>Fri, 14 Jun 2019 13:44:52 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/how-to-recover-the-edits-inprogress-xxxxxx-file/m-p/242508#M204309</guid>
      <dc:creator>mike_bronson7</dc:creator>
      <dc:date>2019-06-14T13:44:52Z</dc:date>
    </item>
    <item>
      <title>Re: how to recover the edits_inprogress_xxxxxx file ?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/how-to-recover-the-edits-inprogress-xxxxxx-file/m-p/242509#M204310</link>
      <description>&lt;P&gt;&lt;A rel="noopener noreferrer noopener noreferrer noopener noreferrer noopener noreferrer noopener noreferrer noopener noreferrer" href="http://michael%20bronson/" target="_blank"&gt;&lt;EM&gt;@Michael Bronson&lt;/EM&gt;&lt;/A&gt;&lt;EM&gt; &lt;/EM&gt; &lt;/P&gt;&lt;P&gt;&lt;EM&gt;Get the journal node that is healthy  (active namenode)  aftter saving the &lt;STRONG&gt;Namespace&lt;/STRONG&gt; you also wipe out the other journal node which had &lt;STRONG&gt;edits_inprogress_0000000000018783114.empty&lt;/STRONG&gt;  remember to backup/zip all the journalnodes as good practice &lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;Once you have copied the good to all the 3 destinations proceed  and when you start the namenode after staring the journalnode one should become active and the other standby  thanks to ZKFailover.&lt;/EM&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 14 Jun 2019 14:57:37 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/how-to-recover-the-edits-inprogress-xxxxxx-file/m-p/242509#M204310</guid>
      <dc:creator>Shelton</dc:creator>
      <dc:date>2019-06-14T14:57:37Z</dc:date>
    </item>
    <item>
      <title>Re: how to recover the edits_inprogress_xxxxxx file ?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/how-to-recover-the-edits-inprogress-xxxxxx-file/m-p/242510#M204311</link>
      <description>&lt;P&gt;&lt;A rel="noopener noreferrer noopener noreferrer noopener noreferrer noopener noreferrer noopener noreferrer noopener noreferrer" href="http://michael%20bronson/" target="_blank"&gt;&lt;EM&gt;@Michael Bronson&lt;/EM&gt;&lt;/A&gt;&lt;EM&gt; &lt;/EM&gt; &lt;/P&gt;&lt;P&gt;&lt;EM&gt;Is all good?&lt;/EM&gt;&lt;/P&gt;</description>
      <pubDate>Sat, 15 Jun 2019 02:30:22 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/how-to-recover-the-edits-inprogress-xxxxxx-file/m-p/242510#M204311</guid>
      <dc:creator>Shelton</dc:creator>
      <dc:date>2019-06-15T02:30:22Z</dc:date>
    </item>
    <item>
      <title>Re: how to recover the edits_inprogress_xxxxxx file ?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/how-to-recover-the-edits-inprogress-xxxxxx-file/m-p/242511#M204312</link>
      <description>&lt;P&gt;@&lt;A rel="user" href="https://community.hortonworks.com/users/1271/sheltong.html"&gt;Geoffrey Shelton Okot&lt;/A&gt;  no both namenode started as standby and then the namenode stooped &lt;/P&gt;</description>
      <pubDate>Sun, 16 Jun 2019 11:02:12 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/how-to-recover-the-edits-inprogress-xxxxxx-file/m-p/242511#M204312</guid>
      <dc:creator>mike_bronson7</dc:creator>
      <dc:date>2019-06-16T11:02:12Z</dc:date>
    </item>
    <item>
      <title>Re: how to recover the edits_inprogress_xxxxxx file ?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/how-to-recover-the-edits-inprogress-xxxxxx-file/m-p/242512#M204313</link>
      <description>&lt;P&gt;also I cant start the journal node ( on the bade namenode )&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;PRE&gt;2019-06-16 05:29:39,734 WARN &amp;nbsp;namenode.FSImage (EditLogFileInputStream.java:scanEditLog(359)) - Caught exception after scanning through 0 ops from /hadoop/hdfs/journal/hdfsha/current/edits_inprogress_0000000000018783114 while determining its valid length. Position was 1032192
java.io.IOException: Can't scan a pre-transactional edit log.
&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.server.namenode.FSEditLogOp$LegacyReader.scanOp(FSEditLogOp.java:4974)
&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.server.namenode.EditLogFileInputStream.scanNextOp(EditLogFileInputStream.java:245)
&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.server.namenode.EditLogFileInputStream.scanEditLog(EditLogFileInputStream.java:355)
&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.server.namenode.FileJournalManager$EditLogFile.scanLog(FileJournalManager.java:551)
&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.qjournal.server.Journal.scanStorageForLatestEdits(Journal.java:192)
&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.qjournal.server.Journal.&amp;lt;init&amp;gt;(Journal.java:152)
&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.qjournal.server.JournalNode.getOrCreateJournal(JournalNode.java:90)
&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.qjournal.server.JournalNode.getOrCreateJournal(JournalNode.java:99)
&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.qjournal.server.JournalNodeRpcServer.getJournalState(JournalNodeRpcServer.java:127)
&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.qjournal.protocolPB.QJournalProtocolServerSideTranslatorPB.getJournalState(QJournalProtocolServerSideTranslatorPB.java:118)
&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.qjournal.protocol.QJournalProtocolProtos$QJournalProtocolService$2.callBlockingMethod(QJournalProtocolProtos.java:25415)
&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.ipc.ProtobufRpcEngine$Server$ProtoBufRpcInvoker.call(ProtobufRpcEngine.java:640)
&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:982)
&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2351)
&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2347)
&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at java.security.AccessController.doPrivileged(Native Method)
&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at javax.security.auth.Subject.doAs(Subject.java:422)
&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1869)
&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.ipc.Server$Handler.run(Server.java:2347)
2019-06-16 05:29:39,734 WARN &amp;nbsp;namenode.FSImage (EditLogFileInputStream.java:scanEditLog(364)) - After resync, position is 1032192&lt;/PRE&gt;</description>
      <pubDate>Sun, 16 Jun 2019 12:32:18 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/how-to-recover-the-edits-inprogress-xxxxxx-file/m-p/242512#M204313</guid>
      <dc:creator>mike_bronson7</dc:creator>
      <dc:date>2019-06-16T12:32:18Z</dc:date>
    </item>
  </channel>
</rss>

