About mike_bronson7

mike_bronson7 · ‎10-23-2017

we installed new HDP cluster version 2.6 meanwhile HDFS and yarn are down and when we start the HDFS we get: 2017-10-23 16:39:35,510 ERROR datanode.DataNode (DataNode.java:secureMain(2691)) - Exception in secureMain org.apache.hadoop.util.DiskChecker$DiskErrorException: Too many failed volumes - current valid volumes: 0, volumes configured: 5, volumes failed: 5, volume fail ures tolerated: 0 at org.apache.hadoop.hdfs.server.datanode.checker.StorageLocationChecker.check(StorageLocationChecker.java:216) at org.apache.hadoop.hdfs.server.datanode.DataNode.makeInstance(DataNode.java:2583) at org.apache.hadoop.hdfs.server.datanode.DataNode.instantiateDataNode(DataNode.java:2492) at org.apache.hadoop.hdfs.server.datanode.DataNode.createDataNode(DataNode.java:2539) at org.apache.hadoop.hdfs.server.datanode.DataNode.secureMain(DataNode.java:2684) at org.apache.hadoop.hdfs.server.datanode.DataNode.main(DataNode.java:2708) 2017-10-23 16:39:35,512 INFO util.ExitUtil (ExitUtil.java:terminate(124)) - Exiting with status 1 2017-10-23 16:39:35,515 INFO datanode.DataNode (LogAdapter.java:info(47)) - SHUTDOWN_MSG: what we need to check in this case ?

mike_bronson7 · ‎10-23-2017

Is this a kerberized cluster? - what you mean we have 3 masters machine + 2 workers machines

mike_bronson7 · ‎10-23-2017

Do you have the correct "etc/hosts" - yes

mike_bronson7 · ‎10-23-2017

Was this Node working fine earlier - yes

mike_bronson7 · ‎10-23-2017

we are trying to start the "Standby NameNode (HDFS)" on master01 machine in ambari cluster version 2.6 and we cant start it we get the following logs: ERROR namenode.NameNode (NameNode.java:main(1774)) - Failed to start namenode. org.apache.hadoop.hdfs.server.namenode.EditLogInputException: Error replaying edit log at offset 0. Expected transaction ID was 13361263 at org.apache.hadoop.hdfs.server.namenode.FSEditLogLoader.loadEditRecords(FSEditLogLoader.java:203) at org.apache.hadoop.hdfs.server.namenode.FSEditLogLoader.loadFSEdits(FSEditLogLoader.java:143) at org.apache.hadoop.hdfs.server.namenode.FSImage.loadEdits(FSImage.java:838) at org.apache.hadoop.hdfs.server.namenode.FSImage.loadFSImage(FSImage.java:693) at org.apache.hadoop.hdfs.server.namenode.FSImage.recoverTransitionRead(FSImage.java:289) at org.apache.hadoop.hdfs.server.namenode.FSNamesystem.loadFSImage(FSNamesystem.java:1045) at org.apache.hadoop.hdfs.server.namenode.FSNamesystem.loadFromDisk(FSNamesystem.java:703) at org.apache.hadoop.hdfs.server.namenode.NameNode.loadNamesystem(NameNode.java:688) at org.apache.hadoop.hdfs.server.namenode.NameNode.initialize(NameNode.java:752) at org.apache.hadoop.hdfs.server.namenode.NameNode.<init>(NameNode.java:992) at org.apache.hadoop.hdfs.server.namenode.NameNode.<init>(NameNode.java:976) at org.apache.hadoop.hdfs.server.namenode.NameNode.createNameNode(NameNode.java:1701) at org.apache.hadoop.hdfs.server.namenode.NameNode.main(NameNode.java:1769) Caused by: org.apache.hadoop.hdfs.server.namenode.RedundantEditLogInputStream$PrematureEOFException: got premature end-of-file at txid 13361262; expected file to go up to 13361312 what chould be the problem m and how to fix , so the service will start?

mike_bronson7 · ‎10-20-2017

thank you so much

mike_bronson7 · ‎10-20-2017

we have ambari cluster - 2.6 we restart the masters machine and start the services when we start the zookeeper on master01 machine we get this from the log 2017-10-20 05:43:53,339 - INFO [main:NIOServerCnxnFactory@94] - binding to port 0.0.0.0/0.0.0.0:2181 2017-10-20 05:43:53,342 - ERROR [main:QuorumPeerMain@89] - Unexpected exception, exiting abnormally java.net.BindException: Address already in use at sun.nio.ch.Net.bind0(Native Method) so zookeeper cant start on master01 and master02 machine what chould be the problem here?

mike_bronson7 · ‎10-18-2017

we have 3 masters machines on both masters machine yarn is active insted to be active stand by what is the procedure to change it to yarn standby on master01 and yarn active on master02

mike_bronson7 · ‎10-18-2017

another question , as you know all workers are not appears cluster ( because all master machines are new ) , so not understand how we can use the API in that case

mike_bronson7 · ‎10-18-2017

regarding to two API , can you give me example , because I not sure if some of them need to set as values or should be as default

Online	Offline
Last Visited	‎08-27-2024 09:17 AM

Member Since	‎08-08-2017 09:40 AM
Last Visited	‎08-27-2024 09:17 AM
Posts	1,652
Kudos received	29

Cloudera Community

Re: how to find number of CPU core on datanode ma...

Re: postgresql + ambari server failed to open port...

Re: how to stop the thrift servers by REST API

Re: namenode is in safe mode

Re: Directory /grid/sdg/hadoop/hdfs/data became un...

HDFS not start ( after new cluster installation ...

Re: Standby NameNode cant start in ambari cluster

Re: Standby NameNode cant start in ambari cluster

Re: Standby NameNode cant start in ambari cluster

Standby NameNode cant start in ambari cluster

Re: cant start zookeeper from ambari cluster

cant start zookeeper from ambari cluster

on both masters machine yarn is active how to enab...

Re: how to delete worker node from cluster

Re: how to delete worker node from cluster