Support Questions
Find answers, ask questions, and share your expertise
Announcements
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

ERROR 2017-11-02 05:55:40,989 Controller.py:415 - Connection to bdanode1.xxx.com was lost (details=Request to https://bdanode1.xxx..com:8441/agent/v1/heartbeat/bdanode1.xxxx.com failed due to Error occured during connecting to the server: '') 2.

Highlighted

ERROR 2017-11-02 05:55:40,989 Controller.py:415 - Connection to bdanode1.xxx.com was lost (details=Request to https://bdanode1.xxx..com:8441/agent/v1/heartbeat/bdanode1.xxxx.com failed due to Error occured during connecting to the server: '') 2.

New Contributor

how can i start ambari-server.i did not change IP address and my machine FDQN also perfect in 3 node cluster.

i restart ambari-agent and ambari-server.

ssh is perfect and firewall also stopped

ntp service also running properly@swami sangameshwar

3 REPLIES 3

Re: ERROR 2017-11-02 05:55:40,989 Controller.py:415 - Connection to bdanode1.xxx.com was lost (details=Request to https://bdanode1.xxx..com:8441/agent/v1/heartbeat/bdanode1.xxxx.com failed due to Error occured during connecting to the server: '') 2.

Super Mentor

@swami sangameshwar

1. Please check if you have any error mentioned in the "/var/log/ambari-agent/ambari-agent.log" To see if you are getting any error Specially any SSL error.

2. Try doing a telnet to see if you are able to connect to Ambari Server?

# nc -v $AMBARI_HOST 8441

.

3. Have you recently upgraded any OS package/kernel?

Re: ERROR 2017-11-02 05:55:40,989 Controller.py:415 - Connection to bdanode1.xxx.com was lost (details=Request to https://bdanode1.xxx..com:8441/agent/v1/heartbeat/bdanode1.xxxx.com failed due to Error occured during connecting to the server: '') 2.

New Contributor

Hi Kumar,

[root@bdanode1 ~]# nc -v bdanode1.xxx.com 8441

Ncat: Version 6.40 ( http://nmap.org/ncat )

Ncat: Connection refused.

No i did not upgrade os.

Re: ERROR 2017-11-02 05:55:40,989 Controller.py:415 - Connection to bdanode1.xxx.com was lost (details=Request to https://bdanode1.xxx..com:8441/agent/v1/heartbeat/bdanode1.xxxx.com failed due to Error occured during connecting to the server: '') 2.

Super Mentor

@swami sangameshwar

Sometimes the Heartbeat lost is temporary due to some network issue or load on ambari server the heartbeat messages ar enot processed properly Or Ambari Server is not able to respond to the agent's heartbeat message. So we will also need to see if the Ambari Server is running properly and has enough memory available or if there is any error logged in ambari-server.log around the mentioned timestamp.



But in this case it looks like the Ambari Port 8441 is not accessible from the agent machine hence, Please check if the Ambari Server has opened the port 8441 or some how that port is blocked for remote access?

On Ambari Server Host:

# netstat -tnlpa | grep 8441
# netstat -tnlpa | grep `cat /var/run/ambari-server/ambari-server.pid` 

# hostname -f
# service iptables stop

.

Try restarting ambari-server to see if the port (8441) is getting opened, And also chcking the ambari-server.log will really help to know if there are any errors.

.