Support Questions
Find answers, ask questions, and share your expertise

YARN components fail after AMBARI Metrics collector migration( failed)

Expert Contributor

I was trying to migrate AMC from a particular node to another node and this failed. But after this, all the YARN components are failing with the following error message :

Connection Failed to http :<<<<Host Name >>>>:8088 urlopen error. Connection refused.

2 REPLIES 2

Cloudera Employee

8088 is your resource manager port. Is resource manager running? Can you telnet to the <<<<Host Name >>>>:8088 from this node manager?

Expert Contributor

Following components are having issues :

NodeManager Web UI -

Connection failed to http://dev3.exp.caspian.rax.io:8042 (<urlopen error [Errno 111] Connection refused>)

NodeManager Health,

Connection failed to http://dev3.exp.caspian.rax.io:8042/ws/v1/node/info (Traceback (most recent call last): File "/var/lib/ambari-agent/cache/common-services/YARN/2.1.0.2.0/package/alerts/alert_nodemanager_health.py", line 165, in execute url_response = urllib2.urlopen(query, timeout=connection_timeout) File "/usr/lib64/python2.6/urllib2.py", line 126, in urlopen return _opener.open(url, data, timeout) File "/usr/lib64/python2.6/urllib2.py", line 391, in open response = self._open(req, data) File "/usr/lib64/python2.6/urllib2.py", line 409, in _open '_open', req) File "/usr/lib64/python2.6/urllib2.py", line 369, in _call_chain result = func(*args) File "/usr/lib64/python2.6/urllib2.py", line 1190, in http_open return self.do_open(httplib.HTTPConnection, req) File "/usr/lib64/python2.6/urllib2.py", line 1165, in do_open raise URLError(err) URLError: <urlopen error [Errno 111] Connection refused> )

ResourceManager Web UI

Connection failed to http://dev3.exp.caspian.rax.io:8088 (<urlopen error [Errno 111] Connection refused>)