Member since
08-08-2017
1652
Posts
30
Kudos Received
11
Solutions
My Accepted Solutions
| Title | Views | Posted |
|---|---|---|
| 1970 | 06-15-2020 05:23 AM | |
| 16071 | 01-30-2020 08:04 PM | |
| 2108 | 07-07-2019 09:06 PM | |
| 8250 | 01-27-2018 10:17 PM | |
| 4676 | 12-31-2017 10:12 PM |
10-23-2017
05:21 PM
we installed new HDP cluster version 2.6 meanwhile HDFS and yarn are down and when we start the HDFS we get: 2017-10-23 16:39:35,510 ERROR datanode.DataNode (DataNode.java:secureMain(2691)) - Exception in secureMain
org.apache.hadoop.util.DiskChecker$DiskErrorException: Too many failed volumes - current valid volumes: 0, volumes configured: 5, volumes failed: 5, volume fail
ures tolerated: 0
at org.apache.hadoop.hdfs.server.datanode.checker.StorageLocationChecker.check(StorageLocationChecker.java:216)
at org.apache.hadoop.hdfs.server.datanode.DataNode.makeInstance(DataNode.java:2583)
at org.apache.hadoop.hdfs.server.datanode.DataNode.instantiateDataNode(DataNode.java:2492)
at org.apache.hadoop.hdfs.server.datanode.DataNode.createDataNode(DataNode.java:2539)
at org.apache.hadoop.hdfs.server.datanode.DataNode.secureMain(DataNode.java:2684)
at org.apache.hadoop.hdfs.server.datanode.DataNode.main(DataNode.java:2708)
2017-10-23 16:39:35,512 INFO util.ExitUtil (ExitUtil.java:terminate(124)) - Exiting with status 1
2017-10-23 16:39:35,515 INFO datanode.DataNode (LogAdapter.java:info(47)) - SHUTDOWN_MSG: what we need to check in this case ?
... View more
Labels:
- Labels:
-
Apache Ambari
-
Apache Hadoop
10-23-2017
05:15 PM
Is this a kerberized cluster? - what you mean we have 3 masters machine + 2 workers machines
... View more
10-23-2017
05:14 PM
Do you have the correct "etc/hosts" - yes
... View more
10-23-2017
05:12 PM
Was this Node working fine earlier - yes
... View more
10-23-2017
04:22 PM
we are trying to start the "Standby NameNode (HDFS)" on master01 machine in ambari cluster version 2.6 and we cant start it we get the following logs: ERROR namenode.NameNode (NameNode.java:main(1774)) - Failed to start namenode.
org.apache.hadoop.hdfs.server.namenode.EditLogInputException: Error replaying edit log at offset 0. Expected transaction ID was 13361263
at org.apache.hadoop.hdfs.server.namenode.FSEditLogLoader.loadEditRecords(FSEditLogLoader.java:203)
at org.apache.hadoop.hdfs.server.namenode.FSEditLogLoader.loadFSEdits(FSEditLogLoader.java:143)
at org.apache.hadoop.hdfs.server.namenode.FSImage.loadEdits(FSImage.java:838)
at org.apache.hadoop.hdfs.server.namenode.FSImage.loadFSImage(FSImage.java:693)
at org.apache.hadoop.hdfs.server.namenode.FSImage.recoverTransitionRead(FSImage.java:289)
at org.apache.hadoop.hdfs.server.namenode.FSNamesystem.loadFSImage(FSNamesystem.java:1045)
at org.apache.hadoop.hdfs.server.namenode.FSNamesystem.loadFromDisk(FSNamesystem.java:703)
at org.apache.hadoop.hdfs.server.namenode.NameNode.loadNamesystem(NameNode.java:688)
at org.apache.hadoop.hdfs.server.namenode.NameNode.initialize(NameNode.java:752)
at org.apache.hadoop.hdfs.server.namenode.NameNode.<init>(NameNode.java:992)
at org.apache.hadoop.hdfs.server.namenode.NameNode.<init>(NameNode.java:976)
at org.apache.hadoop.hdfs.server.namenode.NameNode.createNameNode(NameNode.java:1701)
at org.apache.hadoop.hdfs.server.namenode.NameNode.main(NameNode.java:1769)
Caused by: org.apache.hadoop.hdfs.server.namenode.RedundantEditLogInputStream$PrematureEOFException: got premature end-of-file at txid 13361262; expected file to go up to 13361312 what chould be the problem m and how to fix , so the service will start?
... View more
Labels:
- Labels:
-
Apache Ambari
-
Apache Hadoop
10-20-2017
08:05 AM
thank you so much
... View more
10-20-2017
05:50 AM
we have ambari cluster - 2.6 we restart the masters machine and start the services when we start the zookeeper on master01 machine we get this from the log 2017-10-20 05:43:53,339 - INFO [main:NIOServerCnxnFactory@94] - binding to port 0.0.0.0/0.0.0.0:2181
2017-10-20 05:43:53,342 - ERROR [main:QuorumPeerMain@89] - Unexpected exception, exiting abnormally
java.net.BindException: Address already in use
at sun.nio.ch.Net.bind0(Native Method) so zookeeper cant start on master01 and master02 machine what chould be the problem here?
... View more
Labels:
- Labels:
-
Apache Ambari
-
Apache Hadoop
10-18-2017
03:27 PM
we have 3 masters machines on both masters machine yarn is active insted to be active stand by what is the procedure to change it to yarn standby on master01 and yarn active on master02
... View more
Labels:
- Labels:
-
Apache Ambari
-
Apache YARN
10-18-2017
08:58 AM
another question , as you know all workers are not appears cluster ( because all master machines are new ) , so not understand how we can use the API in that case
... View more
10-18-2017
08:50 AM
regarding to two API , can you give me example , because I not sure if some of them need to set as values or should be as default
... View more