Support Questions
Find answers, ask questions, and share your expertise
Announcements
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

JournalNode ( HDFS ) restarting all the time

Solved Go to solution
Highlighted

Re: JournalNode ( HDFS ) restarting all the time

now the journalnode stop for restarting when I start the matrix , bur when now I start the name-node we get

- ERROR namenode.NameNode (NameNode.java:main(1774)) - Failed to start namenode.
java.io.FileNotFoundException: No valid image files found
Michael-Bronson
Highlighted

Re: JournalNode ( HDFS ) restarting all the time

in the GUI - I search the

Custom hadoop-metrics2.properties , and I see only Add Property ... thats all !

Michael-Bronson
Highlighted

Re: JournalNode ( HDFS ) restarting all the time

Contributor

Can you redeploy the HA and see if there were any steps that you missed during the HA enabling process. Please follow the steps suggested by Hortonworks.

https://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.6.4/bk_hadoop-high-availability/content/ch_HA-N...

Highlighted

Re: JournalNode ( HDFS ) restarting all the time

@Jay but as you know the main problem now is that we cant start the namenode on both machines ,

Michael-Bronson
Highlighted

Re: JournalNode ( HDFS ) restarting all the time

@JAY what we see from the log when we start the name node on master01/03 is that

ERROR namenode.NameNode (NameNode.java:main(1774)) - Failed to start namenode.
java.io.FileNotFoundException: No valid image files found
Michael-Bronson
Highlighted

Re: JournalNode ( HDFS ) restarting all the time

Super Mentor

@Michael Bronson

As you are getting the error:

ERROR namenode.NameNode (NameNode.java:main(1774)) - Failed to start namenode.
java.io.FileNotFoundException: No valid image files found

.

So can you please check of the following directory has any fsimage in it or not? Also if the fsimage file has proper read permission as following or not?

Example:

# ls -l /hadoop/hdfs/namenode/current/fsimage*

-rw-r--r--. 1 hdfs hadoop 195873 Jan 22 20:05 /hadoop/hdfs/namenode/current/fsimage_0000000000002711213
-rw-r--r--. 1 hdfs hadoop     62 Jan 22 20:05 /hadoop/hdfs/namenode/current/fsimage_0000000000002711213.md5
-rw-r--r--. 1 hdfs hadoop 195873 Jan 23 02:05 /hadoop/hdfs/namenode/current/fsimage_0000000000002718519
-rw-r--r--. 1 hdfs hadoop     62 Jan 23 02:05 /hadoop/hdfs/namenode/current/fsimage_0000000000002718519.md5

.

View solution in original post

Highlighted

Re: JournalNode ( HDFS ) restarting all the time

 I get that:



ls -l /hadoop/hdfs/journal/hdfsha/current/fsimage*
ls: cannot access /hadoop/hdfs/journal/hdfsha/current/fsimage*: No such file or directory
Michael-Bronson
Highlighted

Re: JournalNode ( HDFS ) restarting all the time

Super Mentor

@Michael Bronson

It looks like some one deleted the "fsimage" file from the NameNode host. I am not aware of any hadoop bug which will cause deletion of this file.

So please check the Operating System Audit log to find out who has deleted the file and when?


# less /var/log/audit/audit.log

.

Highlighted

Re: JournalNode ( HDFS ) restarting all the time

Ho NO , any option for backup files or how to restore this ?

Michael-Bronson
Highlighted

Re: JournalNode ( HDFS ) restarting all the time

this huge file what need to search with grep ?

Michael-Bronson
Don't have an account?
Coming from Hortonworks? Activate your account here