Member since
04-06-2018
7
Posts
2
Kudos Received
1
Solution
My Accepted Solutions
Title | Views | Posted |
---|---|---|
3325 | 04-08-2018 03:32 PM |
04-18-2018
02:10 PM
1 Kudo
*Update* It seems like hostname issue. The hostname -f command showed localhost instead of FQDN. This is weird as both the hostname and hostnamectl showed FQDN. Updating the hosts file did the trick. So instead of 127.0.0.1 localhost localhost.localdomain localhost4 localhost4.localdomain4 FQDN I updated it to 127.0.0.1 FQDN localhost localhost.localdomain localhost4 localhost4.localdomain4 This resolved the weird issue with the hostname command, and the server registraion completed successfully.
... View more
04-18-2018
01:29 PM
1 Kudo
I am facing the exact same issue. Asked a new question before I saw this one. For me the versions are the same, still the problem persists. Could not find anything useful in the server logs or agent logs, so decided to start from zero again. I removed ambari-agent from the new node and all the files and tried to install from ambari UI. It errors out now. Server Log snippet below the snapshot: Log shows the following:
18 Apr 2018 13:22:52,496 INFO [pool-18-thread-1] BSHostStatusCollector:55 - Request directory /var/run/ambari-server/bootstrap/2
18 Apr 2018 13:22:52,496 INFO [pool-18-thread-1] BSHostStatusCollector:62 - HostList for polling on [ambaridn01.informatica.com]
18 Apr 2018 13:23:07,054 INFO [ambari-client-thread-39] BootStrapImpl:108 - BootStrapping hosts ambaridn01.informatica.com:
18 Apr 2018 13:23:07,055 INFO [Thread-32] BSRunner:189 - Kicking off the scheduler for polling on logs in /var/run/ambari-server/bootstrap/3
18 Apr 2018 13:23:07,055 INFO [Thread-32] BSRunner:258 - Host= ambaridn01.informatica.com bs=/usr/lib/python2.6/site-packages/ambari_server/bootstrap.py requestDir=/var/run/ambari-server/bootstrap/3 user=root sshPort=22 keyfile=/var/run/ambari-server/bootstrap/3/sshKey passwordFile null server=ip-172-31-10-185.ap-south-1.compute.internal version=2.6.1.5 serverPort=8080 userRunAs=root timeout=300
18 Apr 2018 13:23:07,055 INFO [pool-19-thread-1] BSHostStatusCollector:55 - Request directory /var/run/ambari-server/bootstrap/3
18 Apr 2018 13:23:07,057 INFO [pool-19-thread-1] BSHostStatusCollector:62 - HostList for polling on [ambaridn01.informatica.com]
18 Apr 2018 13:23:07,058 INFO [Thread-32] BSRunner:286 - Bootstrap output, log=/var/run/ambari-server/bootstrap/3/bootstrap.err /var/run/ambari-server/bootstrap/3/bootstrap.out at ip-172-31-10-185.ap-south-1.compute.internal
18 Apr 2018 13:23:17,057 INFO [pool-19-thread-1] BSHostStatusCollector:55 - Request directory /var/run/ambari-server/bootstrap/3
18 Apr 2018 13:23:17,057 INFO [pool-19-thread-1] BSHostStatusCollector:62 - HostList for polling on [ambaridn01.informatica.com]
18 Apr 2018 13:23:27,058 INFO [pool-19-thread-1] BSHostStatusCollector:55 - Request directory /var/run/ambari-server/bootstrap/3
18 Apr 2018 13:23:27,058 INFO [pool-19-thread-1] BSHostStatusCollector:62 - HostList for polling on [ambaridn01.informatica.com]
18 Apr 2018 13:23:28,062 INFO [Thread-32] BSRunner:310 - Script log Mesg
INFO:root:BootStrapping hosts ['ambaridn01.informatica.com'] using /usr/lib/python2.6/site-packages/ambari_server cluster primary OS: redhat7 with user 'root'with ssh Port '22' sshKey File /var/run/ambari-server/bootstrap/3/sshKey password File null using tmp dir /var/run/ambari-server/bootstrap/3 ambari: ip-172-31-10-185.ap-south-1.compute.internal; server_port: 8080; ambari version: 2.6.1.5; user_run_as: root
INFO:root:Executing parallel bootstrap
ERROR:root:ERROR: Bootstrap of host ambaridn01.informatica.com fails because previous action finished with non-zero exit code (255)
ERROR MESSAGE: Connection to ambaridn01.informatica.com closed.
STDOUT: tput: No value for $TERM and no -T specified
tput: No value for $TERM and no -T specified
Connection to ambaridn01.informatica.com closed.
INFO:root:Finished parallel bootstrap
18 Apr 2018 13:23:28,062 INFO [pool-19-thread-1] BSHostStatusCollector:55 - Request directory /var/run/ambari-server/bootstrap/3
18 Apr 2018 13:23:28,062 INFO [pool-19-thread-1] BSHostStatusCollector:62 - HostList for polling on [ambaridn01.informatica.com]
18 Apr 2018 13:23:38,870 INFO [ambari-hearbeat-monitor] HeartbeatMonitor:318 - KAFKA_BROKER is at INSTALLED adding more payload per agent ask
18 Apr 2018 13:24:38,948 INFO [ambari-hearbeat-monitor] HeartbeatMonitor:318 - KAFKA_BROKER is at INSTALLED adding more payload per agent ask
18 Apr 2018 13:24:47,794 INFO [ambari-heartbeat-processor-0] HeartbeatProcessor:607 - State of service component METRICS_COLLECTOR of service AMBARI_METRICS of cluster mycluster has changed from INSTALLED to STARTED at host ambariserv.informatica.com according to STATUS_COMMAND report
18 Apr 2018 13:24:59,091 ERROR [ambari-client-thread-37] MetricsRequestHelper:115 - Error getting timeline metrics : Connection refused (Connection refused)
18 Apr 2018 13:24:59,092 ERROR [ambari-client-thread-37] MetricsRequestHelper:122 - Cannot connect to collector: SocketTimeoutException for ambariserv.informatica.com
18 Apr 2018 13:25:51,885 INFO [pool-17-thread-1] MetricSinkWriteShardHostnameHashingStrategy:42 - Calculated collector shard ip-172-31-10-185.ap-south-1.compute.internal based on hostname: ip-172-31-10-185.ap-south-1.compute.internal
(
... View more
04-18-2018
12:36 PM
I am trying to add additional node to an AWS ambari server on AWS. I have installed ambari-agent on the new node and is up and running. But when I try to add the node to my cluster through ambari, it fails with an error (see screenshot-1) I am not able to find the correct logs in the ambari server. Any ideas where the logs will be? What is the problem that might have happened?
... View more
Labels:
- Labels:
-
Apache Ambari
04-08-2018
03:32 PM
Geoffery, Jay. Thanks a lot for your replies. I went through the logs and found out that the JVM engine is not able to start with insufficient memory. Bumped up the server. Now it is working. Thanks a lot
... View more
04-07-2018
07:03 AM
Hello Geoffery, Thank you for coming back. Yes the folder is available and the permissions look correct. drwxr-xr-x. 2 hdfs hadoop 6 Apr 6 20:43 /hadoop/hdfs/data/ This is a single node cluster so there are no different data and name nodes.
... View more
04-06-2018
10:40 PM
Please tell me if you need the logs, I will attach it here. There is a file limitation so I am not attaching anything for now.
... View more
04-06-2018
10:34 PM
I installed a HDP 2.6 in Centos using the web repo. I tried to start 4 services: Hadoop, Smartsense, Ambari-metrics and Zookeeper. All the services got installed but none could start. After the installation, during cluster deployment from the web UI, the services are Installed but did not start. ambari-agent log shows the following : ERROR 2018-04-06 20:41:19,502 script_alert.py:123 - [Alert][datanode_unmounted_data_dir] Failed with result CRITICAL: ['The following data dir(s) were not found: /hadoop/hdfs/data\n'] WARNING 2018-04-06 20:41:19,508 base_alert.py:138 - [Alert][namenode_hdfs_blocks_health] Unable to execute alert. [Alert][namenode_hdfs_blocks_health] Unable to extract JSON from JMX response I found no documentation in hortonworks community for this. Any help?
... View more
Labels: