Member since
08-08-2017
1652
Posts
30
Kudos Received
11
Solutions
My Accepted Solutions
| Title | Views | Posted |
|---|---|---|
| 1964 | 06-15-2020 05:23 AM | |
| 16016 | 01-30-2020 08:04 PM | |
| 2107 | 07-07-2019 09:06 PM | |
| 8236 | 01-27-2018 10:17 PM | |
| 4663 | 12-31-2017 10:12 PM |
01-22-2018
11:24 PM
@Jay what you recomended based on the output from grep ?
... View more
01-22-2018
11:12 PM
hi Jay , we check both xml's files on master01 and master03 and both xml's are the same
... View more
01-22-2018
11:03 PM
@Jay do you have some concultions from the xml?
... View more
01-22-2018
10:52 PM
dear jay - can you exxplain how to - try Disabling NameNode HA and then enable it back.
... View more
01-22-2018
10:51 PM
grep -A 1 'dfs.namenode.http-address' /etc/hadoop/conf/hdfs-site.xml
<name>dfs.namenode.http-address.hdfsha.nn1</name>
<value>master01.sys57.com:50070</value>
--
<name>dfs.namenode.http-address.hdfsha.nn2</name>
<value>master03.sys57.com:50070</value>
... View more
01-22-2018
08:22 PM
from some unclear reason we saw the following services are down without success to start them standby namenode & ZKFailoverController name node log: ERROR namenode.NameNode (NameNode.java:main(1774)) - Failed to start namenode.
java.lang.IllegalStateException: Could not determine own NN ID in namespace 'hdfsha'. Please ensure that this node is one of the machines listed as an NN RPC address, or configure dfs.ha.namenode.id
at com.google.common.base.Preconditions.checkState(Preconditions.java:172)
017-12-20 18:57:24,771 INFO zookeeper.ClientCnxn (ClientCnxn.java:logStartConnect(1019)) - Opening socket connection to server master02.sys56.com/100.4.22.18:2181. Will not attempt to authenticate using SASL (unknown error)
2017-12-21 02:48:29,403 INFO zookeeper.ClientCnxn (ClientCnxn.java:logStartConnect(1019)) - Opening socket connection to server master03.sys56.com/100.4.22.18:2181. Will not attempt to authenticate using SASL (unknown error)
CommandLine flags: -XX:CMSInitiatingOccupancyFraction=70 -XX:ErrorFile=/var/log/hadoop/hdfs/hs_err_pid%p.log -XX:InitialHeapSize=10468982784 -XX:MaxHeapSize=10468982784 -XX:MaxNewSize=1308622848 -XX:MaxTenuringThreshold=6 -XX:NewSize=1308622848 -XX:OldPLABSize=16 -XX:OnOutOfMemoryError="/usr/hdp/current/hadoop-hdfs-namenode/bin/kill-name-node" -XX:OnOutOfMemoryError from ambari-server log <strong>ERROR [ambari-heartbeat-processor-0] HeartbeatProcessor:554 - Operation failed - may be retried. Service component host: ZKFC, host: master03.sys57.com Action id 475-0 and taskId 1659</strong> ZKFailoverController log
Traceback (most recent call last):
File "/var/lib/ambari-agent/cache/common-services/HDFS/2.1.0.2.0/package/scripts/zkfc_slave.py", line 230, in <module>
ZkfcSlave().execute()
File "/usr/lib/python2.6/site-packages/resource_management/libraries/script/script.py", line 314, in execute
method(env)
File "/var/lib/ambari-agent/cache/common-services/HDFS/2.1.0.2.0/package/scripts/zkfc_slave.py", line 70, in start
ZkfcSlaveDefault.start_static(env, upgrade_type)
File "/var/lib/ambari-agent/cache/common-services/HDFS/2.1.0.2.0/package/scripts/zkfc_slave.py", line 92, in start_static
raise Fail("Could not initialize HA state in zookeeper")
resource_management.core.exceptions.Fail: Could not initialize HA state in zookeeper
2018-01-22 19:48:41,824 - HA state initialization in ZooKeeper failed with 1 error code. Will retry
Command failed after 1 tries
please advice how to resolve both service to became up
... View more
Labels:
- Labels:
-
Apache Ambari
-
Apache Hadoop
01-19-2018
02:31 PM
we use ambari cluster with hadoop version 2.6 on our redhat linux machines version 7.3 after few month experience with ambari cluster & hadoop version - 2.6 we saw some tools/scripts that help to maintence the ambari & hadoop but I am sure that we not covered all them , we want if it possible to get list of tools/scripts/third-party SW that can help us on the follwing tasks 1. ambari server /hadoop maintenance 2. verification/sanity utils/script for both ambari & hadoop 2.6 3. utils/scripts as that can help to trace a problem 4. etc I will happy to get information about these
... View more
Labels:
- Labels:
-
Apache Ambari
-
Apache Hadoop
01-19-2018
02:16 PM
thank you for the long explanation , regarding the ppt , I just to save the time to create this doc ) , so I oped that somethng in network already done
... View more
01-19-2018
02:14 PM
hi the problem was on the ambari.repo and hadoop repo under /etc/yum.repo.d folder, you said that repo also defined in the ambari GUI ? , this is new for me can you show me how you access the repo from ambari GUI ?
... View more
01-16-2018
06:00 PM
hi, I need to give a simple course about ambari machines - as masters machines in cluster and worker/kafka machines so I will happy to get simple doc ( prefer power point ) that described the nodes in the amabri cluster and nodes purpose / target include examples and diagram , and relashenship between master to worker/kafka's the target is to give basic idea to employees that are new in hadoop world
... View more
Labels:
- Labels:
-
Apache Ambari
-
Apache Hadoop